Search Results

Found 5 repositories(showing 5)

agentic-research

BittnerPierre

❤️45

Multi‑agent research AI workflow with cloud API and llama.cpp support, OpenAI vector_search or ChromaDB retrieval, Docker stacks (local & NVIDIA DGX Spark), and model benchmarking.

Apache-2.0

Python

Updated 1 week ago

agents-sdkchromadbdgx-spark+9

llama_cpp

nerdpudding

🧡55

Local LLM serving made manageable: llama.cpp in Docker with model profiles, interactive dashboard, benchmarking, and integration with Claude Code and AI tools

Python

Updated 1 week ago

gemma4-llama-dgx-spark

shamily

🧡65

Dockerized inference server and benchmarks for Gemma 4 26B on the NVIDIA DGX Spark (GB10). Features ARM64 CUDA 13 builds using llama.cpp.

MIT

Shell

Updated 1 day ago

benchmarkdgx-sparkgemma4+5

llama-cpp-docker-benchmark

drohbo

❤️35

A dockerized option to benchmark your Llama.cpp server.

Dockerfile

Updated 7 months ago

local-llm-setup

shuvanon

❤️35

Run and benchmark Large Language Models (LLMs) locally with llama.cpp on GPU (Docker + WSL2). Includes helper scripts, quantisation benchmarks, and an OpenAI-compatible API server.

Shell

Updated 6 months ago

All 5 repositories loaded

GitHub Explorer

Search Results

agentic-research

llama_cpp

gemma4-llama-dgx-spark

llama-cpp-docker-benchmark

local-llm-setup

agentic-research

llama_cpp

gemma4-llama-dgx-spark

llama-cpp-docker-benchmark

local-llm-setup