GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

alibaba/MNN - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

MNN

alibaba•PUBLIC

View on GitHub

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

armconvolutiondeep-learningembedded-devicesllmmachine-learning

Apache License 2.0

Created on Apr 15, 2019

Updated on Apr 6, 2026

Stars

14.8k

Forks

2.3k

Watchers

14.8k

Open Issues

Repository Health Score

💚

100/100

Excellent

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

49% of total

Closed

Recent Commits

[OpecnCL:Feature] Opt linearAttention

MNNSyncBot•3 days ago

b56a81bView on GitHub

[Metal: Feature] Opt metal linear attention

MNNSyncBot•3 days ago

2d4a17bView on GitHub

[MNNChat:Bugfix] fix fastlane structure (#4337)

ᡠᠵᡠᡳ ᡠᠵᡠ ᠮᠠᠨᡩ᠋ᠠᠨ•3 days ago

08bb57eView on GitHub

[LLM:Feature] Add text-level prompt cache for multi-turn chat (#4330)

Jan Slominski•3 days ago

66fd356View on GitHub

[CPU:Feature] Add RISC-V Vector extension (RVV) support and fix tokenizer header (#4331)

Sherlockzhangjinge•3 days ago

ade3d6cView on GitHub

[MNN:Feature] Add Moore Threads MUSA Backend Support (#4182)

dongyang-mt•3 days ago

c857fa2View on GitHub

[Infra:Feature] Ignore .oneday/ in <.gitignore>

MNNSyncBot•4 days ago

06a0101View on GitHub

[LLM:Feature] Support Qwen3.5 smooth and omni export (#4336)

YinHanke•4 days ago

a351235View on GitHub

[Bugfix: Typo] delete print log

MNNSyncBot•6 days ago

136cec0View on GitHub

[LLM:Feature] add mmap_size in config.

MNNSyncBot•5 days ago

2e0ac31View on GitHub

[LinearAttention: Feature] support linear attention status load/store in disk

MNNSyncBot•6 days ago

10d2ae9View on GitHub

[MNNChat:Bugfix] reuse loaded runtime session for API start in 0.8.2.2 (#4319)

Jinde.Song•6 days ago

ccd2dbfView on GitHub

[CPU:Feature] Add TurboQuant TQ3/TQ4 KV cache quantization

MNNSyncBot•1 week ago

244f5d1View on GitHub

[Vulkan:Feature] Support llm inference

MNNSyncBot•1 week ago

242af36View on GitHub

[LLM:Bugfix] Fix prefix disk cache not loaded after first response (#4316)

luckysmg•1 week ago

622b3fbView on GitHub

View all commits

GitHub Explorer

MNN

Score Breakdown

Issues Activity: Last 6 months

Top Labels

Hottest Issues