Commit Graph

59 Commits

Author SHA1 Message Date
bac5ac0f48 Comment out newLine logic to move it to Extract 2019-07-25 01:08:35 +02:00
ffc7a38175 More refactoring of moving data between constructors
- move more data into PdfPage object
2019-07-25 00:49:23 +02:00
477539c527 VisitorText add more text handling methods placeholder
- remove stale debug constructor passing
 - add PdfPage object
 - change pdf page to pageData
 - add some more attributes to TextObject
2019-07-24 23:15:45 +02:00
b75b5c73c6
Update README.md 2019-07-24 21:21:37 +02:00
2211abb068 Rename page pdf to pageData and add PdfPage object 2019-07-24 20:58:01 +02:00
38b46e0b94 Visitor refactoring, simplify code, add Geometry
- create visitor on each page
- pass dependencies and page in constructor
- add Geometry for text position measurements
2019-07-24 18:35:47 +02:00
20d839f447 Fix character spacing use -250 value for now (need to measure glyphs) 2019-07-23 19:45:33 +02:00
ac8e1c9d01 Update README.md move documentation link 2019-07-23 08:59:59 +02:00
4c969b79ae Add example command to help 2019-07-23 08:58:34 +02:00
9b314bda4a
Update README.md 2019-07-23 08:56:13 +02:00
b58f364fa0
Update README.md 2019-07-23 08:55:39 +02:00
57e594402e Update README.md with new todo / done 2019-07-23 06:21:52 +02:00
068f56db5c Version 0.0.4 0.0.4 2019-07-23 06:18:00 +02:00
c2ebe7f526 Add todo to Extract 2019-07-23 06:12:55 +02:00
3810d2fcc0 Fix text extraction based on pdf.js samples 2019-07-23 06:04:55 +02:00
263b318029 Update README.md TODO list 2019-07-23 05:47:58 +02:00
30d673a455 Add missing OPS beginAnnotations, endAnnotations 2019-07-23 05:45:16 +02:00
9c2baab2a6 Fix Unimplmemented operator message 2019-07-23 05:44:02 +02:00
d0a2e44cdf Update README.md with correct package url 2019-07-23 05:34:59 +02:00
10443b009b Update package.json repository url 0.0.3 2019-07-23 05:28:39 +02:00
44e59dfe8f Version 0.0.2 0.0.2 2019-07-23 05:16:41 +02:00
c2f597d899 Update README.md with documentation location and package.json keywords 2019-07-23 05:16:28 +02:00
2dee872d40 Change output directory structure / Closes #2 0.0.1 2019-07-23 05:00:53 +02:00
cc2002078b Extract images with VisitorImage #2
- output to file based on format data.{format}
- more documentation
2019-07-23 04:53:17 +02:00
1b6bfbbe13 Documentation FormatterJSON 2019-07-23 04:35:41 +02:00
94c05cf064 Commandline change -i for file input -f for format 2019-07-23 04:05:48 +02:00
63710c2f8b Add demo.sh and test.sh for automating stuff 2019-07-23 01:59:55 +02:00
23c4586d28 VisitorBase for common visitors constructor 2019-07-23 01:43:57 +02:00
bf4698df59 dummy 2019-07-23 01:22:54 +02:00
ab4e0ffdae Update README.md 2019-07-23 01:22:16 +02:00
81f4de1c29 Documentation generate scripts 2019-07-23 01:19:17 +02:00
5cff7ee4c0 Fix after move lib to src 2019-07-23 01:05:11 +02:00
5b2c8eded3 Move lib to src 2019-07-23 01:04:28 +02:00
4a01a382cf Documentation 2019-07-23 00:58:58 +02:00
12536bbc21 Text elements move to separate files 2019-07-23 00:44:49 +02:00
ac54beceba Documentation for Visitor 2019-07-23 00:37:13 +02:00
4aea804772 dummy GoldDigger comment 2019-07-23 00:34:20 +02:00
263fe310f2 Update README.md with simplified usage information 2019-07-23 00:31:09 +02:00
cefca38fa9 Fix missing pdfdig shebang 2019-07-23 00:26:04 +02:00
b797a2f192 Fix package.json 2019-07-23 00:22:42 +02:00
c141efb7a6 Formatter move each formatter to separate file 2019-07-23 00:17:48 +02:00
b638f75bca FormatterText return empty output as print is handled elsewhere 2019-07-23 00:14:17 +02:00
a2b9bcb1d7 Fix Visitor dependency paths 2019-07-23 00:09:16 +02:00
e3d25d2e77 Visitor classes move to separate files make universal visit method 2019-07-23 00:07:23 +02:00
9f9126e2c0 Documentation for Visitor 2019-07-22 23:34:41 +02:00
2f4fa0c474 Rename Executor to Visitor, fix GoldDiger digPage 2019-07-22 23:32:51 +02:00
cbfdc29fde Documentation for Constraints, Extract, FontObject 2019-07-22 23:27:52 +02:00
f7fc939832 Add FONT_IDENTITY_MATRIX for calculating glyph x position 2019-07-22 23:20:37 +02:00
99bd258317 Documentation add to GoldDigger 2019-07-22 23:16:36 +02:00
53c053542d Formatter JSON refactoring move nested forEach to separate methods 2019-07-22 23:08:46 +02:00