Universal LLM Deployment Engine with ML Compilation
Stars
22.4k
Forks
2.0k
Watchers
22.4k
Open Issues
300
Overall repository health assessment
No package.json found
This might not be a Node.js project
488
commits
150
commits
135
commits
87
commits
71
commits
56
commits
46
commits
32
commits
32
commits
30
commits
[Fix] Replace str(target.kind) with target.kind.name to align with new format (#3470)
e656f00View on GitHub[FIX] Fix black formatting in per_tensor_quantization (#3466)
fcce2ccView on GitHubfix: apply post_layernorm in CLIPVisionTransformer forward (#3456)
1c79631View on GitHubFix model_task pydantic warning and skip flashinfer on non-linux (#3460)
76cd196View on GitHubAdd Qwen3.5 GatedDeltaNet hybrid model + kHybrid KVStateKind (#3449)
05f79e1View on GitHub[FIX] Rename T.alloc_buffer to T.sblock_alloc_buffer (#3457)
20d7fb3View on GitHub[Bug] Replace alloc_buffer with sblock_alloc_buffer and temporarily bypass CSE (#3454)
2eb1f12View on GitHub