YaFSDP: Yet another Fully Sharded Data Parallel
Stars
984
Forks
47
Watchers
984
Open Issues
6
Overall repository health assessment
No package.json found
This might not be a Node.js project
45
commits
6
commits
4
commits
3
commits
2
commits
2
commits
1
commits
Allow multiple forward passes with partial gradient computation in yafsdp
b2d506eView on GitHubCustom factor to reduce_scatter in YaFSDP, Remove ScaleGradient in GLU [YaFSDP]
71b470cView on GitHubSet no reshard after forward for last module across modules with the same data buffer
ee7cb4bView on GitHub