|
e4f44a5642
|
Add FormatterHTML for html output, closes #15
|
2019-07-28 19:25:37 +02:00 |
|
|
9b0574777f
|
Update demo.sh with xml output and extract font
|
2019-07-28 17:34:16 +02:00 |
|
|
a115debd41
|
Update package.json keywords to match github project
|
2019-07-28 17:32:55 +02:00 |
|
|
fd9058c099
|
Update README.md
|
2019-07-28 17:28:08 +02:00 |
|
|
0519406a25
|
Add image data information to output xml, json formatters
|
2019-07-28 17:27:02 +02:00 |
|
|
e8685c78af
|
Merge branch 'master' of github.com:vane/pdf-gold-digger
|
2019-07-28 16:53:26 +02:00 |
|
|
ca3071c0a0
|
dummy
|
2019-07-28 16:53:20 +02:00 |
|
|
af25b4ad9d
|
Update README.md
|
2019-07-28 16:52:28 +02:00 |
|
|
9d4672ccb8
|
Update README.md
|
2019-07-28 16:46:31 +02:00 |
|
|
2434562da0
|
Save font files as ttf option, closes #3
|
2019-07-28 16:44:00 +02:00 |
|
|
f60eebb02d
|
Update README.md
|
2019-07-28 15:38:50 +02:00 |
|
|
4fac64de45
|
Update README.md
|
2019-07-28 15:32:13 +02:00 |
|
|
2b0c5350eb
|
Update README.md
|
2019-07-28 15:20:15 +02:00 |
|
|
8ae1cee785
|
Version 0.0.6
0.0.6
|
2019-07-28 15:12:38 +02:00 |
|
|
a28f9d997f
|
Add more text information, closes #8
|
2019-07-28 15:10:06 +02:00 |
|
|
c74c037d2a
|
Add xml formatter for xml output closes #1
|
2019-07-28 14:57:11 +02:00 |
|
|
255de1f3c2
|
Text information changes
- remove width, x from textline
- sort text lines before output
- add text font x,y position to json formatter
|
2019-07-28 14:36:31 +02:00 |
|
|
618500a269
|
Merge pull request #13 from vane/feature/bbox-calculate
Feature bbox calculate
|
2019-07-28 09:43:08 +02:00 |
|
|
dea317eda8
|
Add eslint standard with small modifications
- semi always
- comma-dangle always-multiple
|
2019-07-28 09:40:39 +02:00 |
|
|
4fd9b6024c
|
Fix json formatter / add width calculation
|
2019-07-27 10:09:37 +02:00 |
|
|
5c83ef37d6
|
Version 0.0.5
0.0.5
|
2019-07-26 00:41:01 +02:00 |
|
|
892a8c7bb1
|
Documentation update / move formatter, visitor to formatters, visitors
|
2019-07-26 00:38:51 +02:00 |
|
|
d1400175a9
|
Merge pull request #12 from vane/feature/bbox
VisitorText add more text handling methods placeholder
|
2019-07-26 00:06:04 +02:00 |
|
|
14819dd714
|
Fix JSON formatter
|
2019-07-26 00:04:52 +02:00 |
|
|
1aae2e7425
|
Remove not used Geometry classes
|
2019-07-26 00:04:40 +02:00 |
|
|
c3dbfc0e8b
|
Heuristic method to determine if difference between two points is space
|
2019-07-26 00:02:05 +02:00 |
|
|
877c14a6e1
|
Rewrite text extraction - calculate new line Closes #11
|
2019-07-25 23:00:04 +02:00 |
|
|
f5441748bf
|
Add vertical for font object
|
2019-07-25 01:19:19 +02:00 |
|
|
bac5ac0f48
|
Comment out newLine logic to move it to Extract
|
2019-07-25 01:08:35 +02:00 |
|
|
ffc7a38175
|
More refactoring of moving data between constructors
- move more data into PdfPage object
|
2019-07-25 00:49:23 +02:00 |
|
|
477539c527
|
VisitorText add more text handling methods placeholder
- remove stale debug constructor passing
- add PdfPage object
- change pdf page to pageData
- add some more attributes to TextObject
|
2019-07-24 23:15:45 +02:00 |
|
|
b75b5c73c6
|
Update README.md
|
2019-07-24 21:21:37 +02:00 |
|
|
2211abb068
|
Rename page pdf to pageData and add PdfPage object
|
2019-07-24 20:58:01 +02:00 |
|
|
38b46e0b94
|
Visitor refactoring, simplify code, add Geometry
- create visitor on each page
- pass dependencies and page in constructor
- add Geometry for text position measurements
|
2019-07-24 18:35:47 +02:00 |
|
|
20d839f447
|
Fix character spacing use -250 value for now (need to measure glyphs)
|
2019-07-23 19:45:33 +02:00 |
|
|
ac8e1c9d01
|
Update README.md move documentation link
|
2019-07-23 08:59:59 +02:00 |
|
|
4c969b79ae
|
Add example command to help
|
2019-07-23 08:58:34 +02:00 |
|
|
9b314bda4a
|
Update README.md
|
2019-07-23 08:56:13 +02:00 |
|
|
b58f364fa0
|
Update README.md
|
2019-07-23 08:55:39 +02:00 |
|
|
57e594402e
|
Update README.md with new todo / done
|
2019-07-23 06:21:52 +02:00 |
|
|
068f56db5c
|
Version 0.0.4
0.0.4
|
2019-07-23 06:18:00 +02:00 |
|
|
c2ebe7f526
|
Add todo to Extract
|
2019-07-23 06:12:55 +02:00 |
|
|
3810d2fcc0
|
Fix text extraction based on pdf.js samples
|
2019-07-23 06:04:55 +02:00 |
|
|
263b318029
|
Update README.md TODO list
|
2019-07-23 05:47:58 +02:00 |
|
|
30d673a455
|
Add missing OPS beginAnnotations, endAnnotations
|
2019-07-23 05:45:16 +02:00 |
|
|
9c2baab2a6
|
Fix Unimplmemented operator message
|
2019-07-23 05:44:02 +02:00 |
|
|
d0a2e44cdf
|
Update README.md with correct package url
|
2019-07-23 05:34:59 +02:00 |
|
|
10443b009b
|
Update package.json repository url
0.0.3
|
2019-07-23 05:28:39 +02:00 |
|
|
44e59dfe8f
|
Version 0.0.2
0.0.2
|
2019-07-23 05:16:41 +02:00 |
|
|
c2f597d899
|
Update README.md with documentation location and package.json keywords
|
2019-07-23 05:16:28 +02:00 |
|