Parse PDFs into markdown using Vision LLMs
Stars
465
Forks
66
Watchers
465
Open Issues
3
Overall repository health assessment
No package.json found
This might not be a Node.js project
Refactor whitespace in parser.py for improved readability; streamline image processing code and async calls.
4f70377View on GitHubUpdate .gitignore to include .venv and uv.lock; remove uv.lock file as it is no longer needed.
3a5cee3View on GitHubAdd Pillow dependency for image processing; refactor whitespace in various files for consistency and readability.
af8a759View on GitHubRefactor LLM class to improve Gemini client integration; streamline template accessors and update test cases for markdown generation.
d2cfbe9View on GitHubAdd support for dynamic vision model discovery and refactor constants; introduce ModelDetector class for improved model capability detection.
950c232View on GitHubUpdate dependencies and documentation for Gemini model; remove DeepSeek references and examples.
81dc0c2View on GitHubRefactor LLM class to streamline Ollama client initialization and enhance concurrency support.
c3580dbView on GitHubReplace PyMuPDF with pypdfium2 for PDF processing and update related code in vision_parse module.
8b733f3View on GitHubMerge pull request #32 from iamarunbrahma/add_streamlit_app
b862dafView on GitHubAdd Streamlit demo application for Vision Parse PDF extraction
6d93f52View on GitHubMerge pull request #31 from iamarunbrahma/updated_markdown_prompt
512bee0View on GitHubRefine markdown prompt template text extraction instructions
d3fa683View on GitHub