GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

defilantech/LLMKube - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

LLMKube

defilantech•PUBLIC

View on GitHub

Kubernetes operator for local LLM inference with llama.cpp, vLLM, and TGI - multi-GPU, autoscaling, air-gapped, production-ready

aiai-infrastructureapple-siliconautoscalingedge-computinggguf

Apache License 2.0

Created on Nov 12, 2025

Updated on Apr 8, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

55/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

21% of total

Closed

Recent Commits

chore: release 0.6.0 (#263)

github-actions[bot]•1 day ago

02a9242View on GitHub

feat: add vLLM and TGI runtime backends with per-runtime HPA metrics (#273)

Christopher Maher•1 day ago

441c7c7View on GitHub

feat: add first-class PersonaPlex (Moshi) runtime backend (#272)

Christopher Maher•1 day ago

2b1c948View on GitHub

feat: add pluggable runtime backends for non-llama.cpp inference engines (#271)

Christopher Maher•2 days ago

bb1576cView on GitHub

feat: add Grafana inference metrics dashboard (#269)

Christopher Maher•2 days ago

be376c6View on GitHub

feat: separate image registry from repository in Helm chart (#268)

Christopher Maher•2 days ago

5c059a4View on GitHub

feat: support custom layer splits from GPUShardingSpec (#267)

Christopher Maher•2 days ago

a37701cView on GitHub

feat!: update default CUDA image to server-cuda13 for Qwen3.5 and Blackwell support (#262)

Christopher Maher•2 days ago

cc9a95eView on GitHub

chore(deps): bump go.opentelemetry.io/otel/exporters/otlp/otlptrace/otlptracegrpc from 1.42.0 to 1.43.0 (#266)

dependabot[bot]•2 days ago

c7c97b2View on GitHub

feat: add HPA autoscaling for InferenceService (#260)

Christopher Maher•6 days ago

2d16502View on GitHub

chore: release 0.5.3 (#255)

github-actions[bot]•1 week ago

86f9bbeView on GitHub

feat: add Ollama as runtime backend for Metal agent (#258)

Christopher Maher•1 week ago

6148b89View on GitHub

feat: add oMLX as alternative runtime backend for Metal agent (#257)

Christopher Maher•1 week ago

eaf9045View on GitHub

feat: add KV cache type configuration and extraArgs escape hatch (#256)

Christopher Maher•1 week ago

7a4b855View on GitHub

fix: improve Metal agent usability (#254)

Christopher Maher•1 week ago

149c582View on GitHub

View all commits

GitHub Explorer

LLMKube

Score Breakdown

Issues Activity: Last 6 months

Top Labels

Hottest Issues