A repository of Python & PyTorch scripts which (currently) converts .safetensors models into scaled FP8 variants, utilizing gradient descent for optimal rounding.
Stars
27
Forks
1
Watchers
27
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
15
commits
Update fast TPEC-Quant for proper top_k calculation & lower niter on pca_lowrank
8c0a00cView on GitHubUpdate TPEC-Quant with faster principal component calcs, add early skips.
4b30715View on GitHub