GitHub Explorer

by Alexey Ratnikov

Search Results

Found 1 repositories(showing 1)

AI-Engineering-at

💛70

Practical guide: TurboQuant KV-cache quantization for llama.cpp. Run 122B models on consumer GPUs.

NOASSERTION

Shell

Updated 14 hours ago

All 1 repositories loaded