GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

Alperen012/TurboQuant - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

TurboQuant

Alperen012•PUBLIC

View on GitHub

Ultra-Low Bit KV-Cache Compression optimization layer built on top of llama.cpp for LLM inference. Reduces VRAM overhead by ~75-80% using custom CUDA kernels.

agent-memorycudainferencekv-cachellama-cppllm

Created on Mar 29, 2026

Updated on Apr 2, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

65/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Recent Commits

Fix ruff lint errors and update CI lint step

Alperen012•1 week ago

603a47eView on GitHub

Add llama.cpp source files

Alperen012•1 week ago

e695eabView on GitHub

Initial commit: Add TurboQuant with integrated llama.cpp and README

Alperen012•1 week ago

0121e3dView on GitHub

docs: Add Run Local LLM inference instructions

Alperen012•1 week ago

312fc22View on GitHub

docs: Update README with Quick Start and Fused CUDA performance metrics

Alperen012•1 week ago

fbfef59View on GitHub

chore(scripts): add exporter for C headers

Alperen012•1 week ago

906e485View on GitHub

View all commits

machine-learning

quantization

vram-optimization

Community

1 stars, 0 forks

0/30

Documentation

Has description, wiki

15/20

75%

Maintenance

0.0% issue ratio

20/20

100%

Health score is calculated based on activity, community engagement, documentation quality, and maintenance practices

Languages

C++

55.8%

12.6%

Python

9.0%

Cuda

5.9%

HTML

3.8%

TypeScript

2.8%

Svelte

2.2%

Metal

1.7%

GLSL

1.1%

Shell

1.0%

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

Alperen012

User

commits

Languages

C++

55.8%

12.6%

Python

9.0%

Cuda

5.9%

HTML

3.8%

TypeScript

2.8%

Svelte

2.2%

Metal

1.7%

GLSL

1.1%

Shell

1.0%

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

Alperen012

User

commits

Recent Commits

Fix ruff lint errors and update CI lint step

Alperen012•1 week ago

603a47eView on GitHub

Add llama.cpp source files

Alperen012•1 week ago

e695eabView on GitHub

Initial commit: Add TurboQuant with integrated llama.cpp and README

Alperen012•1 week ago

0121e3dView on GitHub

docs: Add Run Local LLM inference instructions

Alperen012•1 week ago

312fc22View on GitHub

docs: Update README with Quick Start and Fused CUDA performance metrics

Alperen012•1 week ago

fbfef59View on GitHub

chore(scripts): add exporter for C headers

Alperen012•1 week ago

906e485View on GitHub

View all commits