Found 43 repositories(showing 30)
SamuelTallet
A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.
turiPO
Latest llama.cpp in a containerized server + langchain support
allenporter
Docker images for easier running of llama-cpp-python server
ZbigniewTomanek
Docker image for Lllama2 server which works out of the box on the GPU
kth8
llama.cpp server + small language model in Docker container
bryan-pakulski
This project is build off of the lit-llama project found here: https://github.com/Lightning-AI/lit-llama The main functionality improvment is encapsulating the LLAMA into a docker container with a client / server message queue for interaction.
maragudk
llama-server in Docker.
gtrias
Docker setup for llama.cpp server with router mode, supporting multiple GGUF models with lazy loading
DehydratedWater
Project starter for building Airflow that integrates with local llama.cpp server
DehydratedWater
Docker template for connecting Airflow with dockerized llama.cpp servers
darshpanchal
Serving open source models of your choice in as a docker container using llama-cpp-python's OpenAI compatible server
UJJWALTHAKUR28
DeployIQ is a application that simplifies deploying open-source LLMs (like Llama 3, Mistral, and Falcon) on cloud platforms (AWS, GCP, Azure), bare-metal servers, or local Docker environments, with automated GPU provisioning, scaling, and secure API exposure, while integrating monitoring and logging for seamless inference management.
moelsaka01
Egyptian AI Assistant is a lightweight local chatbot that runs fully offline using llama-server and a GGUF model. It includes a modern Next.js chat UI, a FastAPI backend, and Docker support. The backend enforces Egyptian Arabic responses, The backend enforces Egyptian Arabic responses with optional voice endpoints that can be enabled when needed.
S71Ches
No description available
lucknutlealucky
No description available
boyqiu-001
No description available
h4sh-tqgs
No description available
llama.cpp CUDA Server Docker Image
a-tabaza
Docker Image for Llama.cpp Server
lalithchandran
CUDA-accelerated llama.cpp Docker inference server
A dockerized option to benchmark your Llama.cpp server.
DehydratedWater
No description available
furiousteabag
Simple FastAPI server wrapped in Nvidia docker for Llama inference
PolRibas
Docker Server for run meta-llama/Llama-3.3-70B-Instruct with a HUGGINGFACE_API_KEY
PolRibas
Docker Server for run meta-llama/Llama-3.1-8B-Instruct with a HUGGINGFACE_API_KEY
amxela435
Find, download model from HF and change model in llama.cpp server (Docker)
UnitVectorY-Labs
Built from upstream llama.cpp as a CUDA-enabled llama-server container for local LLM inference with a simple, reproducible Docker workflow.
brocolidata
A Dev Container + Docker Compose setup to easily run a crewai using llama.cpp server
sigilmakes
Run llama.cpp server in Docker with GPU support. Model-independent, swap models by changing one line.
Javacosta
A project to deploy a llama.cpp server with a web UI to AWS using Docker and Terraform.