Found 90 repositories(showing 30)
OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
OpenRL-Lab
Unified Reinforcement Learning Framework
openrlbenchmark
No description available
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
No description available
rosieyzh
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
sjelassi
Code for "Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models".
Freder-chen
No description available
galdar496
OpenRL-based realtime path tracer
victorShawFan
添加了simpo方法的OpenRLHF,个人修改,原仓库链接:https://github.com/OpenLLMAI/OpenRLHF
LLM4AIOps
No description available
OpenRLHF
No description available
FreeAndFair
Free & Fair's open source RLA support software.
OpenRL-Lab
OpenRL document
computermouth
A Free and Libre Community Asset Pack for RogueLegacy1
yyht
No description available
andrewsiah
Replicating Deepseek R1 and OpenAI O series training
swtheing
No description available
eksno
RLBot Functions and Classes made for ease of use and with detailed descriptions.
millioniron
No description available
ardywibowo
Open Library for Quick Iteration on RL Development
ggsharma
No description available
EthanChen39
No description available
mailshanx
No description available
shizhengLi
深入解析OpenRLHF框架
garrett4wade
No description available
KorolevSoftware
No description available
Qeneb
No description available
honglu2875
Uses upstream OpenRLHF to experiment a few things.
nightfury12901
No description available