GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

thu-ml/SageAttention - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

SageAttention

thu-ml•PUBLIC

View on GitHub

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

attentioncudaefficient-attentioninference-accelerationllmllm-infra

Apache License 2.0

Created on Oct 3, 2024

Updated on Apr 3, 2026

Stars

3.3k

Forks

388

Watchers

3.3k

Open Issues

175

Repository Health Score

🧡

67/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

81% of total

Closed

19% of total

Avg Close Time

Avg response

Issues Activity: Last 6 months

Top Labels

Hottest Issues

#283 Many people are still using TESLA V100, but SageAttention 2.2 does not support it. I wonder if it is possible to add support for V100!

Recent Commits

Update README

Jintao Zhang•2 months ago

d1a57a5View on GitHub

Update HunyuanVideo example and README

Weiqi-Feng•3 months ago

5a6f53cView on GitHub

Merge pull request #307 from cooper1637/main

Haoxu Wang•3 months ago

ff71e1eView on GitHub

Restore previous README

whx1003•3 months ago

7608f44View on GitHub

Revert previous reverts as the issue is fixed.

whx1003•3 months ago

c03f15fView on GitHub

Update example inference scripts

jt-zhang•3 months ago

5061f38View on GitHub

clean revertion

jt-zhang•3 months ago

c9f4cd7View on GitHub

Revert "Remove torch.compile related tests"

jt-zhang•3 months ago

06418c2View on GitHub

Revert "Merge pull request #218 from guilhermeleobas/guilhermeleobas/torch-compile"

jt-zhang•3 months ago

e5bf6eeView on GitHub

Revert "add `mutates_args=("output")`"

jt-zhang•3 months ago

2fff52eView on GitHub

Revert "Ensure mutates_args is a tuple"

jt-zhang•3 months ago

795ca0cView on GitHub

Revert "Merge pull request #279 from guilhermeleobas/guilhermeleobas/fix-device"

jt-zhang•3 months ago

35747c3View on GitHub

Revert "pip install sageattention==2.2.0"

jt-zhang•3 months ago

e16dcd7View on GitHub

Revert "pip install sageattention==2.2.0"

jt-zhang•3 months ago

f898ecdView on GitHub

Revert "pip install sageattention==2.2.0 --no-build-isolation"

jt-zhang•3 months ago

c20aed1View on GitHub

View all commits