GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

scrya-com/rotorquant - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

rotorquant

scrya-com•PUBLIC

View on GitHub

KV cache compression via block-diagonal rotation. Beats TurboQuant: better PPL (6.91 vs 7.07), 28% faster decode, 5.3x faster prefill, 44x fewer params. Drop-in llama.cpp integration.

Created on Mar 26, 2026

Updated on Apr 6, 2026

Stars

247

Forks

Watchers

247

Open Issues

Repository Health Score

🧡

65/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

60% of total

Closed

Recent Commits

Add RaBitQ module + comprehensive 1-bit/2-bit benchmark suite

John D. Pope•3 days ago

616cb93View on GitHub

Update CLAUDE.md with llama.cpp status, PPL results, and TODOs

John D. Pope•5 days ago

1ba8989View on GitHub

Credit @ParaMind2025 for PlanarQuant and IsoQuant

John D. Pope•5 days ago

d3eae68View on GitHub

Update README: speed benchmarks, architecture evolution, commit history

John D. Pope•5 days ago

7511721View on GitHub

Update README: symmetric 3-bit PPL results beat TurboQuant

John D. Pope•5 days ago

61154aeView on GitHub

Add Llama 3.1 8B benchmarks: 239 tok/s decode, PPL 8.44, 4% faster than FP16

John D. Pope•6 days ago

6ce8c03View on GitHub

Update README with RTX 5090 llama.cpp CUDA benchmarks

John D. Pope•6 days ago

6637e30View on GitHub

Rewrite README: concise, results-first, reproducible

John D. Pope•6 days ago

7dd21fdView on GitHub

Update README: delete Mac Metal numbers, add authoritative CUDA PPL

John D. Pope•6 days ago

fc29d06View on GitHub

Add post-prefill PPL benchmarks: IsoQuant 4-bit 9.03, PlanarQuant 3-bit 10.12

John D. Pope•6 days ago

ec98f4bView on GitHub

Restore RotorQuant trivector centroids, add CUDA PPL to README

John D. Pope•6 days ago

0c98c28View on GitHub

Add IsoQuant + PlanarQuant backends to PPL benchmark

John D. Pope•6 days ago

b9d3f1aView on GitHub

Add wikitext download step to PPL benchmark instructions

John D. Pope•6 days ago

bfa6022View on GitHub

Update README with PPL benchmarks: iso3 is 2.6-43x better than turbo3

John D. Pope•6 days ago

a195f9bView on GitHub

Add llama.cpp Metal benchmark results to README

John D. Pope•6 days ago

1c26657View on GitHub

View all commits

GitHub Explorer

rotorquant

Score Breakdown

Issues Activity: Last 6 months

Top Labels

Hottest Issues