Commit Graph

113 Commits

Author SHA1 Message Date
34e04ae212 Update README.md with latest changes about password parameter 2020-11-20 21:41:37 +01:00
edb7ddd318 Add ability to provide password as command line argument closes #16
- add catching errors and displaying formatted output ( optional stacktrace for debug)
- bump pdfjs-dist and minimist to latest version
- use 'pdfjs-dist/es5/build/pdf' for node v14.5.0
2020-11-20 21:39:09 +01:00
4f2e28b557 Fix missing pdf font object toUnicode 2020-11-20 21:36:06 +01:00
e0f7cedd76 Update README.md with link to difference between last release and master 2019-08-17 02:50:41 +02:00
f1e6fbfcf7 Fix Extract text bounding box position 2019-08-17 02:45:47 +02:00
3edb27c12a Version 0.0.8 0.0.8 2019-08-10 06:24:05 +02:00
b1ad478e85 Fix json formatting - ability to omit fonts 2019-08-10 06:21:45 +02:00
45c70c7d7d
Update README.md
- add TODO items for text recognition
2019-08-05 23:22:14 +02:00
0604c419e5 Add more information to FontObject
- opentype.js font
- loaded pdf font name
- prepare for ocr of font data
2019-08-05 23:11:07 +02:00
7df0d1543d Change formatter argument from 'text' to 'txt' 2019-08-05 23:09:33 +02:00
f3ab15d374 Disable console output for text formatter 2019-08-05 23:08:40 +02:00
1c3c5461d5
Reformat command line utility and add version information 2019-07-29 00:02:03 +02:00
89bc5a9657
Update package.json keywords to match github project 2019-07-28 22:12:54 +02:00
eeef0bd126
Version 0.0.7 0.0.7 2019-07-28 22:08:45 +02:00
2b1cf8c08f
Add html output to demo.sh 2019-07-28 22:08:22 +02:00
753a8afdb5 Add font file information to formatters xml / json and more about font
- optimise font extract
- set unknown style to normal
2019-07-28 22:05:54 +02:00
0395eb34e2 Remove some information about font 2019-07-28 21:49:22 +02:00
1a6b16b53c
Format font from font cache for xml / json formatter
closes #14
2019-07-28 21:43:53 +02:00
6788ac6093
Add opentype.js and extract more font information, closes #5 2019-07-28 20:57:29 +02:00
90e9e04153
Add title from metadata to FormatterHTML output 2019-07-28 19:41:49 +02:00
2bf5a5eb54 dummy 2019-07-28 19:31:15 +02:00
a85a55a6da
Update README.md 2019-07-28 19:26:29 +02:00
09b80bd792 Merge branch 'master' of github.com:vane/pdf-gold-digger 2019-07-28 19:25:49 +02:00
e4f44a5642 Add FormatterHTML for html output, closes #15 2019-07-28 19:25:37 +02:00
1a4751caf1
Update README.md 2019-07-28 17:45:42 +02:00
dedc3cb12e
Update LICENSE 2019-07-28 17:44:45 +02:00
ea6a658d9c
Update README.md 2019-07-28 17:44:10 +02:00
9b0574777f Update demo.sh with xml output and extract font 2019-07-28 17:34:16 +02:00
a115debd41 Update package.json keywords to match github project 2019-07-28 17:32:55 +02:00
fd9058c099
Update README.md 2019-07-28 17:28:08 +02:00
0519406a25 Add image data information to output xml, json formatters 2019-07-28 17:27:02 +02:00
e8685c78af Merge branch 'master' of github.com:vane/pdf-gold-digger 2019-07-28 16:53:26 +02:00
ca3071c0a0 dummy 2019-07-28 16:53:20 +02:00
af25b4ad9d
Update README.md 2019-07-28 16:52:28 +02:00
9d4672ccb8
Update README.md 2019-07-28 16:46:31 +02:00
2434562da0 Save font files as ttf option, closes #3 2019-07-28 16:44:00 +02:00
f60eebb02d
Update README.md 2019-07-28 15:38:50 +02:00
4fac64de45
Update README.md 2019-07-28 15:32:13 +02:00
2b0c5350eb
Update README.md 2019-07-28 15:20:15 +02:00
8ae1cee785 Version 0.0.6 0.0.6 2019-07-28 15:12:38 +02:00
a28f9d997f Add more text information, closes #8 2019-07-28 15:10:06 +02:00
c74c037d2a Add xml formatter for xml output closes #1 2019-07-28 14:57:11 +02:00
255de1f3c2 Text information changes
- remove width, x from textline
- sort text lines before output
- add text font x,y position to json formatter
2019-07-28 14:36:31 +02:00
618500a269
Merge pull request #13 from vane/feature/bbox-calculate
Feature bbox calculate
2019-07-28 09:43:08 +02:00
dea317eda8 Add eslint standard with small modifications
- semi always
- comma-dangle always-multiple
2019-07-28 09:40:39 +02:00
4fd9b6024c Fix json formatter / add width calculation 2019-07-27 10:09:37 +02:00
5c83ef37d6 Version 0.0.5 0.0.5 2019-07-26 00:41:01 +02:00
892a8c7bb1 Documentation update / move formatter, visitor to formatters, visitors 2019-07-26 00:38:51 +02:00
d1400175a9
Merge pull request #12 from vane/feature/bbox
VisitorText add more text handling methods placeholder
2019-07-26 00:06:04 +02:00
14819dd714 Fix JSON formatter 2019-07-26 00:04:52 +02:00