A framework for Reinforcement Learning from Human Feedback based on CleanRL
Stars
6
Forks
0
Watchers
6
Open Issues
4
Overall repository health assessment
No package.json found
This might not be a Node.js project
182
commits
109
commits
73
commits
71
commits
23
commits
Merge remote-tracking branch 'origin/add-hf-bash' into add-hf-bash
5460415View on GitHubMerge branch '132-enhance-readme' of github.com:jualat/CleanRLHF into 132-enhance-readme
75d64ebView on GitHubMerge branch '132-enhance-readme' of github.com:jualat/CleanRLHF into 132-enhance-readme
49f2795View on GitHub