A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars
2.3k
Forks
370
Watchers
2.3k
Open Issues
5
Overall repository health assessment
No package.json found
This might not be a Node.js project
Merge pull request #23 from timgates42/bugfix_typo_specify
7b86a90View on GitHubupdated to be less strict about the expected number of columns
79da7e8View on GitHuballow to specify image source and target separately in common.set_page_image
66b26c1View on GitHubMerge branch 'master' of github.com:WZBSocialScienceCenter/pdftabextract
39cd5f8View on GitHubadded new function to clustering module: adjust_bad_positions
200946cView on GitHub