Found 150 repositories(showing 30)
Maximilian-Winter
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
huggingface
HF CLI extension to run local coding agent powered by llmfit and llama.cpp
EfficientContext
Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, OpenClaw, RAG, and Agentic AI.
llm-use
LLM orchestration toolkit for agent workflows: planner + workers + synthesis, optional router (LLM + learned fallback), supports OpenAI/Anthropic/Ollama/llama.cpp, real scraping with caching, MCP server integration, and a TUI chat UI.
bjoern-buettner
This is the repository of an ai agent to role play with - currently based on llama.cpp or beam.cloud and finetuned mistral instruct v0.3
CuaOS
This repository is a CUA (computer use agent) system that, using the Qwen3-VL model on Ubuntu computers, aims to perform tasks on your behalf using the keyboard and mouse in a local Sandbox environment in GGUF format, based on the commands you provide.
slb350
Rust SDK for building AI agents with local OpenAI-compatible servers (LMStudio, Ollama, llama.cpp, vLLM). Features streaming, tools, hooks, retry logic, and comprehensive examples.
koron
A client for the C3TR Agent for Japanese-English and English-Japanese translation running on llama.cpp
jbulger82
An upgraded llama.cpp GUI (https://github.com/ggml-org) local-first cloud model llama.cpp GUI multi agent command center with RAG, MCP tools, browser automation, voice, and multi-provider orchestration. Demo Here https://llamahub.netlify.app/
woheller69
Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent
Rafaelmdcarneiro
About FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.
crimson-knight
A CLI powered agent that uses llama.cpp on your system to run models that are gguf
caioross
O Trebuchet Framework é uma infraestrutura robusta para a criação, execução e orquestração de Agentes de IA autônomos, projetada especificamente para hardware local. Diferente de soluções que dependem exclusivamente de APIs na nuvem, o Trebuchet prioriza a privacidade e o desempenho local utilizando llama-cpp-python para inferência de LLMs e chroma
DmyMi
Kotlin Multiplatform application showcase with llama.cpp on device LLM inference & Koog.ai agent
pabl-o-ce
Chat with DuckDuckGo Agent using llama.cpp
Maximilian-Winter
No description available
opensecurity
A 100% local, containerized AI coding agent powered by pi and llama.cpp. Run private LLMs on CPU or NVIDIA GPU without external API dependencies.
Life-Ambassadors-International
🌌 Fully Autonomous Offline AI iOS System • Local LLM Integration (llama.cpp + MLX) • SwiftUI Agentic Interface • GitHub Actions Auto-Build • Complete Sovereign AI for iPhone 14+
PrinceNanChan
Modular local AI Call Agent with voice-to-text, LLM, and TTS support. Built with FastAPI, Llama.cpp, Faster Whisper, Coqui TTS, and Twilio.
fabiomatricardi
Run AI agents with llama-cpp-agents locally
WayneCider
Your Own Personal Jean-Luc — a local AI coding agent powered by llama.cpp. No cloud, no API keys.
zabarich
The simplest agent shell for your own models. Backend-agnostic terminal coding agent for Ollama, llama.cpp, vLLM, or any OpenAI-compatible endpoint. (c) Danucore
EliasOenal
Agents that won't leak your company data. Multi-user runtime with firejail sandboxes, segmented networking, OS-level isolation. Real shell access — boundaries enforced by the sandbox, not the model. Per-user credentials. Web UI. Self-hosted with vLLM/llama.cpp. BSD License.
cschladetsch
CppDeepSeek is a C++20 agent runtime that defaults to local inference (via llama.cpp + GGUF), supports DeepSeek's hosted API as a fallback, and includes an explicit Logic Gate to enforce policy. It runs on Linux/WSL2 with CUDA and Mac with Metal acceleration.
UPtrimOfficial
UPtrim gives your local AI a real memory. It sits between your chat app and your AI, remembering who you are, what you've talked about, and what matters to you — across every conversation. Multi-user support, smart context management, file uploads, agent mode, and a full dashboard. Works with Open WebUI, SillyTavern, llama.cpp, Ollama, and more.
SolidRusT
Agentic Chat using llama-cpp-agent
dinubs
A coding agent designed to work llama.cpp servers
Dhyanesh18
A starter template for building powerful, local, tool-calling LLM agents using LangGraph and llama-cpp-python
Ishabdullah
V3AM FOB - Termux/Android port. Multi-agent AI platform running from Termux with llama.cpp, browser-based dashboard, and full fleet management.
BittnerPierre
Multi‑agent research AI workflow with cloud API and llama.cpp support, OpenAI vector_search or ChromaDB retrieval, Docker stacks (local & NVIDIA DGX Spark), and model benchmarking.