woct0rdho/transformers-qwen3-moe-fused - GitHub Explorer | GitHub Explorer | Trending

Stars

247

Forks

14

Watchers

247

Open Issues

3

Repository Health Score

🧡

65/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Recent Commits

Update benchmark scripts

woctordho•1 month ago

4b8e2a2View on GitHub

Update README.md

woctordho•1 month ago

e063fc8View on GitHub

Touch the build log after successfully loads the module

woctordho•2 months ago

0165203View on GitHub

Update README.md

woctordho•2 months ago

e93ccedView on GitHub

Add CK forward kernel

woctordho•2 months ago

acf6a31View on GitHub

Use persistent variable in loop rather than two loads

woctordho•2 months ago

10c7309View on GitHub

Use mask rather than if to prevent branching

woctordho•2 months ago

881a424View on GitHub

Use m_offsets rather than m_sizes

woctordho•2 months ago

fedaedaView on GitHub

Use torch.nn.functional.grouped_mm for the naive implementation

woctordho•2 months ago

74c5063View on GitHub

Remove unnecessary reshape in dequant.py

woctordho•2 months ago

f243dfdView on GitHub

Update README.md

woctordho•2 months ago

7727a9fView on GitHub

Move triton_kernels to openai_triton_kernels for clarity

woctordho•2 months ago

91150b1View on GitHub

Adjust softmax_topk num_stages for Strix Halo

woctordho•2 months ago

e23d861View on GitHub

In get_expert_counts_and_idx, use parallel impl when N is small

woctordho•2 months ago

a051ed1View on GitHub

Add Triton kernel for expert counting and indexing

woctordho•2 months ago

ad4729eView on GitHub

View all commits

GitHub Explorer

transformers-qwen3-moe-fused

Score Breakdown

Issues Activity: Last 6 months

Hottest Issues