Democratizing Reinforcement Learning for LLMs
Stars
5.4k
Forks
539
Watchers
5.4k
Open Issues
162
Overall repository health assessment
No package.json found
This might not be a Node.js project
477
commits
344
commits
157
commits
140
commits
97
commits
72
commits
68
commits
38
commits
34
commits
23
commits
style: auto-format 21 files to fix ruff-format pre-commit failures (#487)
ac1852dView on GitHubfix: resolve CI failures — E501 lint, tinker test deps, disable Claude actions (#486)
7fb0450View on GitHubMerge pull request #485 from yifannnwu/feat/sft-hf-template
e5b81c1View on GitHubMerge pull request #469 from MarkJoson/fix-sdk-rollout-engine-crash
19618b2View on GitHubfix: handle signal.signal ValueError in non-main threads (#484)
bc54009View on GitHubfeat: add hf_template tokenize_and_mask method + verl SFTTrainer compat
d6101dcView on GitHubfix: migrate VerlBackend to new EngineWorker path (verl 0.7.1) (#483)
ec8bd7aView on GitHubMerge pull request #476 from taivu1998/tdv/issue-321-multi-mcp
4f49efcView on GitHubMerge pull request #480 from Lidang-Jiang/fix/verl-import-path
4520ad7View on GitHubMerge pull request #471 from JiwaniZakir/fix/447-norm-adv-by-std-in-grpo-should-be-from-r
61a5145View on GitHubfix: update verl import paths for verl 0.7.1+ compatibility
3bbe160View on GitHub