Back to search
A utility that extracts text from images or PDFs using a local or remote OpenAI-compatible LLM API endpoint with vision-capable multimodal models. For PDFs, each page is rendered to an image and processed sequentially; outputs are concatenated into a single Markdown document.
Stars
31
Forks
3
Watchers
31
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
5
commits