Found 218 repositories(showing 30)
varunvasudeva1
End-to-end documentation to set up your own local & fully private LLM server on Debian. Equipped with chat, web search, RAG, model management, MCP servers, image generation, and TTS.
shinpr
Local-first RAG server for developers. Semantic + keyword search for code and technical docs. Works with MCP or CLI. Fully private, zero setup.
omar-haris
An extensible Model Context Protocol (MCP-Local-MRL-RAG-AST) server that provides intelligent semantic code search for AI assistants. Built with local AI models, inspired by Cursor's semantic search.
AnalyseDeCircuit
All-in-one terminal workspace โ local shells, SSH, SFTP, remote IDE, AI agent, and file manager in a single native binary. Built with Tauri 2 and pure Rust SSH (no OpenSSL). Smart reconnect, MCP, RAG, plugins, 30+ themes, 11 languages.
nkapila6
"primitive" RAG-like web search model context protocol (MCP) server that runs locally. โจ no APIs โจ
Yuchen20
๐ง ๐ด๐๐๐๐๐-๐ท๐๐๐ is a lightweight, local RAG memory store for MCP agents. Easily record, retrieve, update, delete, and visualize persistent "memories" across sessionsโperfect for developers working with multiple AI coders (like Windsurf, Cursor, or Copilot) or anyone who wants their AI to actually remember them.
lyonzin
Local RAG System for Claude Code โ Hybrid search + Cross-encoder Reranking + Markdown-aware Chunking + 12 MCP Tools. No external servers, pure ONNX in-process.
MobilaName
No description available
vmlinuzx
One stop shop - Local-first RAG stack with intelligent polyglot-code/docs, remote code execution, local llama enrichment, progressive disclosure tools, mcp server, sandboxed security.
doITmagic
Privacy-first semantic code navigation MCP server using RAG. Features deep AST multi-language support (Go, PHP/Laravel/WP, JS/TS/React, Python), 100% local LLMs (Ollama), and vector search (Qdrant) for AI IDEs like Cursor, Windsurf, Copilot, and Claude.
msjsc001
Local-first RAG desktop app & official MCP Server. Let any AI instantly search your private Markdown, PDF, and 1290+ document formats. (ๆฌๅฐไผๅ ็ RAG ๆก้ข็ซฏไธๅฎๆน MCP ๆๅกๅจใ่ฎฉไปปๆ AI ็ฌ้ดๆฃ็ดขไฝ ็็งๆ MarkdownใPDF ๅ 1290+ ็งๆๆกฃๆ ผๅผใ)
ksaritek
๐ฆ High-performance local RAG server in Rust that integrates with Claude Desktop via MCP. Search PDF documents privately using Ollama embeddings - no external API calls.
SaM-92
This repository demonstrates how to use AutoGen to integrate local and remote MCP (Model Context Protocol) servers. It showcases a local math tool (math_server.py) using Stdio and a remote Apify tool (RAG Web Browser Actor) via SSE for tasks like arithmetic and web browsing.
takeshy
All-in-one local AI hub for Obsidian โ LLM chat with vault tools, MCP servers, RAG, workflow automation, encryption, and edit history. Fully private, no cloud required.
renl
No description available
jardhel
Local Codebase RAG MCP Server for Claude Code - Proactive semantic indexing with AST-based chunking
jbulger82
An upgraded llama.cpp GUI (https://github.com/ggml-org) local-first cloud model llama.cpp GUI multi agent command center with RAG, MCP tools, browser automation, voice, and multi-provider orchestration. Demo Here https://llamahub.netlify.app/
juanqui
A PDF document RAG MCP that is easy to setup, supports completely local parsing and embedding, hybrid search, and semantic chunking. Also features an optional web interface.
swordfeng
Yorishiro: Extract character souls from films/novels, generate SOUL.md for AI roleplay. Dual-layer memory (SOUL.md + RAG), knowledge boundary filtering, split-persona support, MCP-ready. Works with cloud or local models.
david-franz
Intelligent context management for AI coding assistants. Indexes your codebase with tree-sitter, builds a semantic knowledge graph, and serves 12 tools over MCP. Hybrid RAG combines keyword search, vector embeddings, and graph traversal for precise retrieval. Works with Claude Desktop, Cursor, and any MCP client. Fully local with Ollama.
ca-srg
CLI tool for building production RAG systems from Markdown, CSV, and PDF documents using hybrid search (BM25 + vector) with OpenSearch. Features MCP server, Slack bot, Web UI, multi-source ingestion (local/S3/GitHub), and multi-provider embeddings (Bedrock/Gemini).
Soul-XuYang
This is a multi-agent processing system with long-term and short-term memory+mcp+local tools+Agentic Rag. It uses the LLM locally deployed by ollma and PostgreSQL to store user's data, and the back-end uses grpc,gorm and gin to coordinate the output of the agent.
davidvictoria
๐ญ AI agents at the edge: Build for offline, scale in cloud. Strands Agents SDK demos with IoT control, RAG, MCP database, and local/cloud model switching.
Razpines
Local Unity docs RAG + MCP server for coding agents, with versioned offline citations.
patakuti
A semantic search and retrieval system for local documents using vector embeddings. Powered by MCP (Model Context Protocol).
IceWhaleTech
๐ ToolFS: A FUSE virtual filesystem for AI Agents, integrating memory, RAG & local data access with flexible MCP/tool chaining and a scalable plugin system
tsunamayo7
All-in-one AI chat studio โ 7 providers (Ollama, Claude, OpenAI, vLLM, Claude Code, Codex, Gemini CLI), RAG knowledge base, MCP tool integration, Mem0 shared memory, and 3-step pipeline. 100% local-capable. MIT licensed.
sebastianhutter
A simple local RAG with sqlite and ollama. All data is kept local, MCP exposes it for further use.
ovitrac
๐งฌ RAGIX: Local-first development assistant making LLMs behave like disciplined engineers โ Unix-RAG retrieval, sandboxed execution, MCP-compatible, fully auditable
ConfidentialMind
FastMCP client agent with PostgreSQL and RAG MCP servers in dual mode: API/HTTP or local/stdio