Found 347,425 repositories(showing 30)
microsoft
Python tool for converting files and office documents to Markdown.
storybookjs
Storybook is the industry standard workshop for building, documenting, and testing UI components in isolation
PaddlePaddle
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
redis
For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.
opendatalab
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
docling-project
Get your documents ready for gen AI
zylon-ai
Interact with your documents using the power of GPT, 100% privately, no data leaks
run-llama
LlamaIndex is the leading document agent and OCR platform
paperless-ngx
A community-supported supercharged document management system: scan, index and archive all your documents
carbon-language
Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)
surrealdb
A scalable, distributed, collaborative, document-graph database, for the realtime web
alibaba
阿里云计算平台DataWorks(https://help.aliyun.com/document_detail/137663.html) 团队出品,为监控而生的数据库连接池
Cinnamon
An open-source RAG-based tool for chatting with your documents.
VectifyAI
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
doczjs
✍ It has never been so easy to document your things!
PromtEngineer
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
openai
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
enescingoz
280+ free n8n automation templates — ready-to-use workflows for Gmail, Telegram, Slack, Discord, WhatsApp, Google Drive, Notion, OpenAI, and more. AI agents, RAG chatbots, email automation, social media, DevOps, and document processing. The largest open-source n8n template collection.
ether
Etherpad: A modern really-real-time collaborative document editor.
arc53
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
nfl
A document head manager for React
tc39
Status, process, and documents for ECMA-262
Unstructured-IO
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
sbilly
A collection of awesome software, libraries, documents, books, resources and cools stuffs about security.
arangodb
🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.
QuestPDF
QuestPDF is a modern library for PDF document generation. Its fluent C# API lets you design complex layouts with clean, readable code. Create documents using a flexible, component-based approach.
Tencent
LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.
star7th
ShowDoc is a tool greatly applicable for an IT team to share documents online一个非常适合IT团队的在线API文档、技术文档工具
getomni-ai
OCR & Document Extraction using vision models
h2oai
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/