A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Stars
8.9k
Forks
1.5k
Watchers
8.9k
Open Issues
757
Overall repository health assessment
No package.json found
This might not be a Node.js project
286
commits
173
commits
111
commits
43
commits
38
commits
34
commits
33
commits
31
commits
28
commits
24
commits
Fix lerp overload ambiguity with std::lerp under C++20 (#1985)
f199212View on GitHubFix divide-by-zero in GroupNorm two-pass kernel for large batch sizes (#1984)
dbe421eView on GitHubDeprecate apex.contrib.fmha and apex.contrib.multihead_attn (#1932)
212061eView on GitHubAdd VRAM estimation and memory-aware skipping to GroupNorm large tensor tests (#1976)
9c5bd72View on GitHub