Commit Graph

87 Commits

Author SHA1 Message Date
e4f44a5642 Add FormatterHTML for html output, closes #15 2019-07-28 19:25:37 +02:00
9b0574777f Update demo.sh with xml output and extract font 2019-07-28 17:34:16 +02:00
a115debd41 Update package.json keywords to match github project 2019-07-28 17:32:55 +02:00
fd9058c099
Update README.md 2019-07-28 17:28:08 +02:00
0519406a25 Add image data information to output xml, json formatters 2019-07-28 17:27:02 +02:00
e8685c78af Merge branch 'master' of github.com:vane/pdf-gold-digger 2019-07-28 16:53:26 +02:00
ca3071c0a0 dummy 2019-07-28 16:53:20 +02:00
af25b4ad9d
Update README.md 2019-07-28 16:52:28 +02:00
9d4672ccb8
Update README.md 2019-07-28 16:46:31 +02:00
2434562da0 Save font files as ttf option, closes #3 2019-07-28 16:44:00 +02:00
f60eebb02d
Update README.md 2019-07-28 15:38:50 +02:00
4fac64de45
Update README.md 2019-07-28 15:32:13 +02:00
2b0c5350eb
Update README.md 2019-07-28 15:20:15 +02:00
8ae1cee785 Version 0.0.6 0.0.6 2019-07-28 15:12:38 +02:00
a28f9d997f Add more text information, closes #8 2019-07-28 15:10:06 +02:00
c74c037d2a Add xml formatter for xml output closes #1 2019-07-28 14:57:11 +02:00
255de1f3c2 Text information changes
- remove width, x from textline
- sort text lines before output
- add text font x,y position to json formatter
2019-07-28 14:36:31 +02:00
618500a269
Merge pull request #13 from vane/feature/bbox-calculate
Feature bbox calculate
2019-07-28 09:43:08 +02:00
dea317eda8 Add eslint standard with small modifications
- semi always
- comma-dangle always-multiple
2019-07-28 09:40:39 +02:00
4fd9b6024c Fix json formatter / add width calculation 2019-07-27 10:09:37 +02:00
5c83ef37d6 Version 0.0.5 0.0.5 2019-07-26 00:41:01 +02:00
892a8c7bb1 Documentation update / move formatter, visitor to formatters, visitors 2019-07-26 00:38:51 +02:00
d1400175a9
Merge pull request #12 from vane/feature/bbox
VisitorText add more text handling methods placeholder
2019-07-26 00:06:04 +02:00
14819dd714 Fix JSON formatter 2019-07-26 00:04:52 +02:00
1aae2e7425 Remove not used Geometry classes 2019-07-26 00:04:40 +02:00
c3dbfc0e8b Heuristic method to determine if difference between two points is space 2019-07-26 00:02:05 +02:00
877c14a6e1 Rewrite text extraction - calculate new line Closes #11 2019-07-25 23:00:04 +02:00
f5441748bf Add vertical for font object 2019-07-25 01:19:19 +02:00
bac5ac0f48 Comment out newLine logic to move it to Extract 2019-07-25 01:08:35 +02:00
ffc7a38175 More refactoring of moving data between constructors
- move more data into PdfPage object
2019-07-25 00:49:23 +02:00
477539c527 VisitorText add more text handling methods placeholder
- remove stale debug constructor passing
 - add PdfPage object
 - change pdf page to pageData
 - add some more attributes to TextObject
2019-07-24 23:15:45 +02:00
b75b5c73c6
Update README.md 2019-07-24 21:21:37 +02:00
2211abb068 Rename page pdf to pageData and add PdfPage object 2019-07-24 20:58:01 +02:00
38b46e0b94 Visitor refactoring, simplify code, add Geometry
- create visitor on each page
- pass dependencies and page in constructor
- add Geometry for text position measurements
2019-07-24 18:35:47 +02:00
20d839f447 Fix character spacing use -250 value for now (need to measure glyphs) 2019-07-23 19:45:33 +02:00
ac8e1c9d01 Update README.md move documentation link 2019-07-23 08:59:59 +02:00
4c969b79ae Add example command to help 2019-07-23 08:58:34 +02:00
9b314bda4a
Update README.md 2019-07-23 08:56:13 +02:00
b58f364fa0
Update README.md 2019-07-23 08:55:39 +02:00
57e594402e Update README.md with new todo / done 2019-07-23 06:21:52 +02:00
068f56db5c Version 0.0.4 0.0.4 2019-07-23 06:18:00 +02:00
c2ebe7f526 Add todo to Extract 2019-07-23 06:12:55 +02:00
3810d2fcc0 Fix text extraction based on pdf.js samples 2019-07-23 06:04:55 +02:00
263b318029 Update README.md TODO list 2019-07-23 05:47:58 +02:00
30d673a455 Add missing OPS beginAnnotations, endAnnotations 2019-07-23 05:45:16 +02:00
9c2baab2a6 Fix Unimplmemented operator message 2019-07-23 05:44:02 +02:00
d0a2e44cdf Update README.md with correct package url 2019-07-23 05:34:59 +02:00
10443b009b Update package.json repository url 0.0.3 2019-07-23 05:28:39 +02:00
44e59dfe8f Version 0.0.2 0.0.2 2019-07-23 05:16:41 +02:00
c2f597d899 Update README.md with documentation location and package.json keywords 2019-07-23 05:16:28 +02:00