Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Stars
654
Forks
52
Watchers
654
Open Issues
7
Overall repository health assessment
No package.json found
This might not be a Node.js project
100
commits
97
commits
68
commits
17
commits
1
commits
1
commits
1
commits