GitHub Explorer

by Alexey Ratnikov

Search Results

Found 1 repositories(showing 1)

codepawl

🧡55

PyTorch implementation of TurboQuant. Near-optimal vector quantization for KV cache compression and vector search. 3-bit with zero accuracy loss.

MIT

Python

Updated 3 days ago

compressioninferencekv-cache+3

All 1 repositories loaded