Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
Stars
1.0k
Forks
146
Watchers
1.0k
Open Issues
194
Overall repository health assessment
No package.json found
This might not be a Node.js project
386
commits
281
commits
50
commits
37
commits
32
commits
30
commits
20
commits
20
commits
19
commits
15
commits
Add explicit errors for unsupported Megatron profiles (#887)
4dd7770View on GitHubRevert "[BUGFIX] [P2PRDMA] Add rollout post-processing after P2PRDMA weight updates" (#882)
e0fc889View on GitHubSplit session lock: release during proxy to unblock DELETE (#870)
0977e6eView on GitHub[session] Rename SingleUserTurnTrajectory to LinearTrajectory (#859)
0baba44View on GitHubAdd debug logging across rollout and session pipeline (#869)
38f4245View on GitHubRemove src_rank and use default behavior in gather_object (#833)
a3993daView on GitHub