Search Results

Found 43 repositories(showing 30)

alpine-llama-cpp-server

SamuelTallet

💛70

A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.

MIT

Dockerfile

Updated 6 hours ago

llamacpp-docker-server

turiPO

❤️40

Latest llama.cpp in a containerized server + langchain support

MIT

Python

Updated 6 months ago

llama-cpp-server

allenporter

❤️25

Docker images for easier running of llama-cpp-python server

Apache-2.0

Roff

Updated 10 months ago

llama2-server-docker-gpu

ZbigniewTomanek

❤️40

Docker image for Lllama2 server which works out of the box on the GPU

Apache-2.0

Python

Updated 2 years ago

llama-server

kth8

🧡55

llama.cpp server + small language model in Docker container

Apache-2.0

Dockerfile

Updated 4 weeks ago

This project is build off of the lit-llama project found here: https://github.com/Lightning-AI/lit-llama The main functionality improvment is encapsulating the LLAMA into a docker container with a client / server message queue for interaction.

C++

Updated 2 years ago

llama-server

maragudk

❤️20

llama-server in Docker.

MIT

Shell

Updated 1 year ago

llama-server-docker

gtrias

🧡50

Docker setup for llama.cpp server with router mode, supporting multiple GGUF models with lazy loading

Shell

Updated 1 week ago

airflow-docker-with-llamacpp-server

DehydratedWater

❤️35

Project starter for building Airflow that integrates with local llama.cpp server

Apache-2.0

Python

Updated 2 years ago

multi_llm_airflow_docker_template

DehydratedWater

❤️35

Docker template for connecting Airflow with dockerized llama.cpp servers

Apache-2.0

Python

Updated 2 years ago

llm-server

darshpanchal

❤️35

Serving open source models of your choice in as a docker container using llama-cpp-python's OpenAI compatible server

Dockerfile

Updated 1 year ago

llama-cpp-pythonllmopenai+1

DeployIQ

UJJWALTHAKUR28

❤️20

DeployIQ is a application that simplifies deploying open-source LLMs (like Llama 3, Mistral, and Falcon) on cloud platforms (AWS, GCP, Azure), bare-metal servers, or local Docker environments, with automated GPU provisioning, scaling, and secure API exposure, while integrating monitoring and logging for seamless inference management.

TypeScript

Updated 1 year ago

Egyptian-AI-Assistant

moelsaka01

🧡60

Egyptian AI Assistant is a lightweight local chatbot that runs fully offline using llama-server and a GGUF model. It includes a modern Next.js chat UI, a FastAPI backend, and Docker support. The backend enforces Egyptian Arabic responses, The backend enforces Egyptian Arabic responses with optional voice endpoints that can be enabled when needed.

NOASSERTION

Updated 4 weeks ago

llama-server-docker

S71Ches

❤️30

No description available

MIT

Python

Updated 10 months ago

llama_server_docker

lucknutlealucky

❤️25

No description available

Updated 6 months ago

translategemma-llama-server-docker

boyqiu-001

❤️45

No description available

PowerShell

Updated 4 weeks ago

dgxspark-docker-llama_server-bonsai

h4sh-tqgs

🧡60

No description available

MIT

Dockerfile

Updated 12 hours ago

llama-server-cuda

kth8

❤️40

llama.cpp CUDA Server Docker Image

Apache-2.0

Python

Updated 10 months ago

llamacpp_server

a-tabaza

❤️35

Docker Image for Llama.cpp Server

Dockerfile

Updated 6 months ago

llama-cpp-docker

lalithchandran

❤️45

CUDA-accelerated llama.cpp Docker inference server

Python

Updated 1 month ago

llama-cpp-docker-llm-cuda-gguf-gpu-ai

llama-cpp-docker-benchmark

drohbo

❤️35

A dockerized option to benchmark your Llama.cpp server.

Dockerfile

Updated 7 months ago

llama_cpp_qwen-235B-A22B-docker-server

DehydratedWater

❤️30

No description available

Apache-2.0

Python

Updated 8 months ago

llama-inference

furiousteabag

❤️35

Simple FastAPI server wrapped in Nvidia docker for Llama inference

Python

Updated 2 years ago

llama_33_70B_intruct

PolRibas

❤️40

Docker Server for run meta-llama/Llama-3.3-70B-Instruct with a HUGGINGFACE_API_KEY

MIT

Python

Updated 1 year ago

llama_31_8B_instruct

PolRibas

❤️40

Docker Server for run meta-llama/Llama-3.1-8B-Instruct with a HUGGINGFACE_API_KEY

MIT

Python

Updated 1 year ago

Find_download_model_HF-

amxela435

❤️45

Find, download model from HF and change model in llama.cpp server (Docker)

Python

Updated 2 months ago

docker-llama.cpp-cuda

UnitVectorY-Labs

🧡65

Built from upstream llama.cpp as a CUDA-enabled llama-server container for local LLM inference with a simple, reproducible Docker workflow.

MIT

Dockerfile

Updated 2 days ago

llm_agents_lab

brocolidata

❤️30

A Dev Container + Docker Compose setup to easily run a crewai using llama.cpp server

Python

Updated 8 months ago

llama-docker

sigilmakes

🧡65

Run llama.cpp server in Docker with GPU support. Model-independent, swap models by changing one line.

Shell

Updated 5 days ago

llama.cpp-cloud-deployment

Javacosta

❤️35

A project to deploy a llama.cpp server with a web UI to AWS using Docker and Terraform.

HTML

Updated 5 months ago

GitHub Explorer

Search Results

alpine-llama-cpp-server

llamacpp-docker-server

llama-cpp-server

llama2-server-docker-gpu

llama-server

litty-llm

llama-server

llama-server-docker

airflow-docker-with-llamacpp-server

multi_llm_airflow_docker_template

llm-server

DeployIQ

Egyptian-AI-Assistant

llama-server-docker

llama_server_docker

translategemma-llama-server-docker

dgxspark-docker-llama_server-bonsai

llama-server-cuda

llamacpp_server

llama-cpp-docker

llama-cpp-docker-benchmark

llama_cpp_qwen-235B-A22B-docker-server

llama-inference

llama_33_70B_intruct

llama_31_8B_instruct

Find_download_model_HF-

docker-llama.cpp-cuda

llm_agents_lab

llama-docker

llama.cpp-cloud-deployment

alpine-llama-cpp-server

llamacpp-docker-server

llama-cpp-server

llama2-server-docker-gpu

llama-server

litty-llm

llama-server

llama-server-docker

airflow-docker-with-llamacpp-server

multi_llm_airflow_docker_template

llm-server

DeployIQ

Egyptian-AI-Assistant

llama-server-docker

llama_server_docker

translategemma-llama-server-docker

dgxspark-docker-llama_server-bonsai

llama-server-cuda

llamacpp_server

llama-cpp-docker

llama-cpp-docker-benchmark

llama_cpp_qwen-235B-A22B-docker-server

llama-inference

llama_33_70B_intruct

llama_31_8B_instruct

Find_download_model_HF-

docker-llama.cpp-cuda

llm_agents_lab

llama-docker

llama.cpp-cloud-deployment