1bit llama.cpp gguf weights paired with turboquant 4 bit kv cache
Stars
22
Forks
6
Watchers
22
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
1.6k
commits
389
commits
362
commits
349
commits
261
commits
244
commits
225
commits
101
commits
100
commits
91
commits
Merge pull request #2 from RodriMora/codex/fix-q1-0-g128-cpu-kernel
be3adebView on GitHubFix security audit findings: thread safety, NULL checks, sizeof
7ededd2View on GitHubAdd TBQ4_0 KV cache with CUDA dequantize (fixes silent CPU fallback)
bb1e236View on GitHub