GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

vllm-project/vllm - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

vllm

vllm-project•PUBLIC

View on GitHub

A high-throughput and memory-efficient inference and serving engine for LLMs

amdblackwellcudadeepseekdeepseek-v3gpt

Apache License 2.0

Created on Feb 9, 2023

Updated on Apr 4, 2026

Stars

75.2k

Forks

15.1k

Watchers

75.2k

Open Issues

4.1k

Repository Health Score

💚

90/100

Excellent

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

94% of total

Closed

6% of total

Avg Close Time

Fast response ✅

Issues Activity: Last 6 months

Top Labels

Hottest Issues

#38910 [Bug]: Gemma4 tool parser duplicates HTML tag prefixes in streamed tool arguments

Recent Commits

[Bug] Fix compile error for `swap_blocks_batch` in CUDA 13 (#38915)

Wentao Ye•1 hour ago

062f1a2View on GitHub

[Bugfix][LoRA] Fix missing in_proj_z in Qwen3_5ForConditionalGenerati… (#38927)

elenalil-aws•1 hour ago

81994e1View on GitHub

[ROCm][CI] Minor missing import patch (#38951)

Andreas Karatzas•2 hours ago

4b506ffView on GitHub

[ROCm][CI] Added back missing common deps (#38937)

Andreas Karatzas•2 hours ago

5875bb2View on GitHub

[ci] Remove soft fail for AMD image build job (#38941)

Kevin H. Luu•4 hours ago

f0d3ad9View on GitHub

Removed GPU state confirmation and cleanup steps. (#38238)

Divin Honnappa•4 hours ago

121ea5aView on GitHub

Remove MQ multi-node tests (#38934)

Jeffrey Wang•5 hours ago

ab79863View on GitHub

[Model Runner V2] Add config validation for not-yet-supported features (#38758)

Nick Hill•5 hours ago

5f1de2bView on GitHub

[Bugfix] Re-enable Renormalize routing for TRT-LLM MoE experts (#38859)

yzong-rh•7 hours ago

a5a623dView on GitHub

[vLLM IR] add `import_ir_kernels()` to support OOT platforms (#38807)

Xiaoshuang Wang•7 hours ago

f8c3af2View on GitHub

Fix invalid logprobs with MTP enabled and sync scheduling (#38711)

danisereb•8 hours ago

50cd567View on GitHub

[Frontend] new online quantization frontend (#38138)

Vasiliy Kuznetsov•9 hours ago

7b1a742View on GitHub

[KVConnector] Skip `register_kv_caches` on profiling (#38558)

Nicolò Lucchesi•9 hours ago

97f92c6View on GitHub

[Bugfix] Fix AWQ models batch invariance issues (#38670)

Yusuf Mohammad•10 hours ago

46f02e0View on GitHub

[XPU] bump up xpu-kernel v0.1.5, transpose moe weights (#38342)

Qiming Zhang•10 hours ago

6b48722View on GitHub

View all commits