Ruby document parsing toolkit with zero runtime dependencies. Parse PDFs, DOCX, XLSX, and images (with OCR) using a single, lightweight gem. Statically links MuPDF and Tesseract at compile time for hassle-free installation - no system libraries or external tools required.
Stars
45
Forks
1
Watchers
45
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
Merge pull request #31 from scientist-labs/dependabot/cargo/ext/parsekit/tesseract-rs-0.2
7acf882View on GitHubMerge pull request #30 from scientist-labs/dependabot/github_actions/actions/checkout-6
e9c00e6View on GitHubUpdate tesseract-rs requirement from 0.1 to 0.2 in /ext/parsekit
b61172aView on GitHubMerge pull request #29 from scientist-labs/add-release-workflow
160376bView on GitHubMerge pull request #28 from scientist-labs/dependabot/cargo/ext/parsekit/calamine-0.34
5ba7259View on GitHubUpdate calamine requirement from 0.33 to 0.34 in /ext/parsekit
8ddfc04View on GitHubUpdate zip requirement from 5.0 to 8.0 in /ext/parsekit (#27)
e2445a5View on GitHubUpdate calamine requirement from 0.31 to 0.33 in /ext/parsekit (#26)
cf618b5View on GitHubUpdate mupdf requirement from 0.5 to 0.6 in /ext/parsekit (#23)
bc14ba7View on GitHubUpdate quick-xml requirement from 0.38 to 0.39 in /ext/parsekit (#21)
9019d06View on GitHub