🚀🚀 Efficient implementations of Native Sparse Attention
Stars
941
Forks
15
Watchers
941
Open Issues
6
Overall repository health assessment
No package.json found
This might not be a Node.js project
Merge pull request #10 from Relaxed-System-Lab/Youhe-Jiang-patch-1
7ff144fView on GitHubMerge pull request #9 from yiakwy-xpu-ml-framework-team/add_cudnn_into_baseline_benchmark
1da34f3View on GitHubadd cudnn to baseline attention benchmark
0c9cf75View on GitHubMerge pull request #7 from yiakwy-xpu-ml-framework-team/add_fa3_benchmark
26891beView on GitHubupdate gqa flops calculator and bump FA2 to 2.8.3
d84470fView on GitHubupdate flops GQA formula to Tri Dao's version (using q heads)
2cb4dceView on GitHubMerge pull request #5 from Relaxed-System-Lab/sel-attn-unit-test
32ae403View on GitHub