GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

back2matching/turboquant - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

turboquant

back2matching•PUBLIC

View on GitHub

First open-source TurboQuant KV cache compression for LLM inference. Drop-in for HuggingFace. pip install turboquant.

compressiongpuhuggingfaceinferencekv-cachellm

Other

Created on Mar 25, 2026

Updated on Apr 6, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

55/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

100% of total

Closed

Recent Commits

feat: v0.3.0 — asymmetric K/V bits, layer-adaptive precision, deprecate QJL

PC•6 days ago

acef33bView on GitHub

overhaul: rewrite README hook, clean repo, fix positioning

PC•6 days ago

96d08d9View on GitHub

docs: add active plan for v0.2.0 launch and next steps

PC•1 week ago

b8aa374View on GitHub

feat: 4-bit nibble packing — halves index storage, 1 GB saved at 4K context

PC•1 week ago

fe86073View on GitHub

marketing: add strategy, blog post, reddit posts, portfolio blog plan

PC•1 week ago

efef4e2View on GitHub

docs: fix last stale refs — PyPI 0.1.0→0.2.0, roadmap status update

PC•1 week ago

47d1238View on GitHub

docs: update all references for v0.2.0 compressed index storage

PC•1 week ago

e2088f8View on GitHub

feat: compressed index storage — real KV cache compression (v0.2.0)

PC•1 week ago

78d8f34View on GitHub

docs: accuracy sweep — fix stale numbers, update references, add research index

PC•1 week ago

5aab56bView on GitHub

bench: add cross-architecture + long-context data (42 total data points)

PC•1 week ago

94cbb3eView on GitHub

bench: RTX 4080 benchmark data — 7B + 3B Qwen, context sweep

PC•1 week ago

224dd64View on GitHub

docs: add ExecPlan workflow system to guides

PC•1 week ago

c6cc71eView on GitHub

docs: add archived plans (moved from FlockRun)

PC•1 week ago

2b70c05View on GitHub

docs: add KV cache compression research (moved from FlockRun)

PC•1 week ago

ced0e50View on GitHub

docs: full doc system — architecture, reference, workflow, codebase map

PC•1 week ago

c444342View on GitHub

View all commits

GitHub Explorer

turboquant

Score Breakdown

Issues Activity: Last 6 months

Hottest Issues