Found 2 repositories(showing 2)
DeepGym
RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.
marketagents-ai
An RLVR environment for post-training dataset generation with parallel rollouts
All 2 repositories loaded