Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
Stars
559
Forks
75
Watchers
559
Open Issues
43
Overall repository health assessment
No package.json found
This might not be a Node.js project
26
commits
25
commits
25
commits
22
commits
15
commits
15
commits
13
commits
7
commits
6
commits
3
commits
feat(token_compressor): introduce universal pruning framework and ali… (#238)
0f1e8a6View on GitHubfix: AttributeError: 'Qwen3VLForConditionalGeneration' object has no attribute 'language_model' (#279)
d90d4d9View on GitHubfeat(speculative): support extract hidden states with vllm backend (#273)
de4f5ecView on GitHubsupport image size control by max_pixels/min_pixels for eagle3 (#272)
bf884aeView on GitHubfeat(spinquant): refactor SpinQuant with CPU offload, parallel fuse, meta device support, and vLLM export (#269)
99efb57View on GitHubsupport eagle3 offline training with per_device_train_batch_size>1 (#264)
676ae7dView on GitHub