Search Results

Found 882 repositories(showing 30)

smolvlm-realtime-webcam

ngxson

💛84

Real-time webcam demo with SmolVLM and llama.cpp server

5.5k

892

NOASSERTION

HTML

Updated 15 hours ago

llama-swap

mostlygeek

💛75

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

3.1k

230

MIT

Updated 1 hour ago

golangllamallamacpp+5

secret-llama

abi

🧡69

Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.

2.7k

170

Apache-2.0

TypeScript

Updated 5 days ago

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

772

188

Python

Updated 3 hours ago

anthropicapple-siliconaudio-processing+17

llama-throughput-lab

alexziskind1

💛71

Interactive launcher and benchmarking harness for llama.cpp server throughput, with tests, sweeps, and round‑robin load tools.

336

MIT

Python

Updated 1 day ago

kronk

ardanlabs

💛70

Your personal engine for running open source models locally. Use Go for hardware accelerated local inference with llama.cpp directly integrated into your Go applications via the yzma module. Kronk provides a high-level API that feels similar to using an OpenAI compatible API. Kronk also provides a model server to run local work

259

Apache-2.0

Updated 13 hours ago

llama-api-server

iaalm

🧡50

A OpenAI API compatible REST server for llama.

210

MIT

Python

Updated 1 week ago

language-modelllamallm+5

Discord-AI-Selfbot

Najmul190

💛71

A Discord chatbot / selfbot that allows users to talk to AI powered by Groq API which uses Meta Llama-3 or use your own ChatGPT API key. The AI runs on a genuine Discord account, not a bot account and so it can be put in any server without any permissions! Try it out at: https://discord.gg/yUWmzQBV4P

190

GPL-3.0

Python

Updated 5 hours ago

aichatbotchatbots+11

llava-cpp-server

trzy

🧡65

LLaVA server (llama.cpp).

183

MIT

C++

Updated 2 days ago

llamallama2llava+3

python-mcp-server-client

GobinFan

🧡51

支持查询主流agent框架技术文档的MCP server（支持stdio和sse两种传输协议）, 支持 langchain、llama-index、autogen、agno、openai-agents-sdk、mcp-doc、camel-ai 和 crew-ai

154

Python

Updated 3 weeks ago

agentllmmcp+3

llama-server

nuance1979

💛70

LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.

133

MIT

Python

Updated 3 days ago

chatbot-uillamallama-cpp+1

neurochat

ortegaalfredo

🧡60

Native gui to serveral AI services plus llama.cpp local AIs.

115

BSD-2-Clause

Pascal

Updated 2 weeks ago

llama-server-launcher

thad0ctor

🧡50

No description available

109

Python

Updated 4 days ago

llamactl

lordmathis

🧡65

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

105

MIT

Updated 14 hours ago

llama-cppllama-serverllamacpp+10

mcp-server-llamacloud

run-llama

🧡60

A MCP server connecting to managed indexes on LlamaCloud

MIT

JavaScript

Updated 4 weeks ago

aitoolllamaindexmcp+1

llama-saas

avilum

❤️40

A client/server for LLaMA (Large Language Model Meta AI) that can run ANYWHERE.

Apache-2.0

Updated 10 months ago

aiclient-serverfacebook+3

flexllama

yazon

💛70

🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GPU support

BSD-3-Clause

Python

Updated 2 days ago

FakeServer

nicknochnack

❤️30

An end to end walkthrough of LLaMA CPP's server.

Python

Updated 11 months ago

vision-core-ai

herrera-luis

🧡50

Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.

Python

Updated 3 weeks ago

bakllavallamacppllava+1

Qwen-3.5-16G-Vram-Local

willbnu

🧡60

Configs, launchers, benchmarks, and tooling for running Qwen3.5 GGUF models locally with llama.cpp on a 16GB NVIDIA GPU

MIT

Python

Updated 2 days ago

ai-inferencebenchmarkcuda+17

llm-use

🧡65

LLM orchestration toolkit for agent workflows: planner + workers + synthesis, optional router (LLM + learned fallback), supports OpenAI/Anthropic/Ollama/llama.cpp, real scraping with caching, MCP server integration, and a TUI chat UI.

MIT

Python

Updated 19 hours ago

ab-testingai-orchestrationanthropic+15

alpine-llama-cpp-server

SamuelTallet

💛70

A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.

MIT

Dockerfile

Updated 21 hours ago

llmc

vmlinuzx

🧡55

One stop shop - Local-first RAG stack with intelligent polyglot-code/docs, remote code execution, local llama enrichment, progressive disclosure tools, mcp server, sandboxed security.

MIT

Python

Updated 1 day ago

clicode-searchdeveloper-tools+5

llama.sh

m18coppola

❤️40

No-messing-around sh client for llama.cpp's server

GPL-2.0

Shell

Updated 11 months ago

llamacppshell-script

llm-llama-server

simonw

❤️20

LLM plugin for interacting with llama-server models

Python

Updated 7 months ago

llama-cpp.el

kurnevsky

🧡50

A client for llama-cpp server

GPL-3.0

Emacs Lisp

Updated 1 month ago

aiemacsllama+1

mygpt

jhud

🧡50

An easily-trained baby GPT that can stand in for the real thing. Based on Andrej Karpathy's makemore, but set up to mimic a llama-cpp server. This is not production-ready; it's a toy implementation for educational purposes.

GPL-3.0

Python

Updated 1 month ago

EchoLink-AI-Powered-Voice-Calling-with-Twilio-and-Meta-LLAMA

pyserve

🧡60

AI powered voice calling assistant using Twilio as telephony server and Meta LLAMA as agent model.

MIT

Python

Updated 4 weeks ago

ai-callsai-chatbotmeta-llama3+3

llamacpp-terminal-chat

hwpoison

🧡55

A lightweight chat terminal-interface for llama.cpp server written in C++ with many features and windows/linux support.

C++

Updated 6 days ago

chatllamallama-server+4

finetuning-and-deploying-llama-on-Sagemaker

yuhuiaws

❤️35

Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the fine tuned llama on Sagemaker with server side batch.

Jupyter Notebook

Updated 1 year ago

GitHub Explorer

Search Results

smolvlm-realtime-webcam

llama-swap

secret-llama

vllm-mlx

llama-throughput-lab

kronk

llama-api-server

Discord-AI-Selfbot

llava-cpp-server

python-mcp-server-client

llama-server

neurochat

llama-server-launcher

llamactl

mcp-server-llamacloud

llama-saas

flexllama

FakeServer

vision-core-ai

Qwen-3.5-16G-Vram-Local

llm-use

alpine-llama-cpp-server

llmc

llama.sh

llm-llama-server

llama-cpp.el

mygpt

EchoLink-AI-Powered-Voice-Calling-with-Twilio-and-Meta-LLAMA

llamacpp-terminal-chat

finetuning-and-deploying-llama-on-Sagemaker

smolvlm-realtime-webcam

llama-swap

secret-llama

vllm-mlx

llama-throughput-lab

kronk

llama-api-server

Discord-AI-Selfbot

llava-cpp-server

python-mcp-server-client

llama-server

neurochat

llama-server-launcher

llamactl

mcp-server-llamacloud

llama-saas

flexllama

FakeServer

vision-core-ai

Qwen-3.5-16G-Vram-Local

llm-use

alpine-llama-cpp-server

llmc

llama.sh

llm-llama-server

llama-cpp.el

mygpt

EchoLink-AI-Powered-Voice-Calling-with-Twilio-and-Meta-LLAMA

llamacpp-terminal-chat

finetuning-and-deploying-llama-on-Sagemaker