GitHub Explorer

by Alexey Ratnikov

Search Results

Found 2 repositories(showing 2)

DeepGym

🧡65

RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.

Python

Updated 5 days ago

ai-agentscode-executioncoding-agents+15

marketagents-ai

❤️45

An RLVR environment for post-training dataset generation with parallel rollouts

Python

Updated 1 month ago

All 2 repositories loaded