Best practices & guides on how to write distributed pytorch training code
Stars
600
Forks
68
Watchers
600
Open Issues
8
Overall repository health assessment
No package.json found
This might not be a Node.js project
Merge branch 'main' of https://github.com/LambdaLabsML/distributed-training-guide
c404fcbView on GitHubAdding bucket_cap_mb setting to ch2 and moving torch.compile before DDP
cb9eeadView on GitHubMerge branch 'main' of https://github.com/LambdaLabsML/distributed-training-guide
9ff32d3View on GitHub