LLM inference in C/C++
Stars
101.0k
Forks
16.3k
Watchers
101.0k
Open Issues
1.3k
Overall repository health assessment
No package.json found
This might not be a Node.js project
HIP: build eatch ci build test for a different architecture (#21337)
43a4ee4View on GitHubggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315)
f1ac841View on GitHub(revert) kv-cache : do not quantize SWA KV cache (#21332)
39b27f0View on GitHubtests : add unit test coverage for llama_tensor_get_type (#20112)
7992aa7View on GitHubtests: allow exporting graph ops from HF file without downloading weights (#21182)
5803c8dView on GitHubmodel, mtmd: fix gguf conversion for audio/vision mmproj (#21309)
63f8fe0View on GitHubcommon : add commentary rules for gpt-oss-20b (#21286)
2233737View on GitHub1.7k
commits
409
commits
362
commits
356
commits
265
commits
251
commits
248
commits
103
commits
101
commits
100
commits