Reproduce R1 Zero on Logic Puzzle
Stars
2.4k
Forks
164
Watchers
Open Issues
13
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
43
commits
11
4
3
ADORA
9d2c457
Short-RL
67b863c
update batch size
882de6a
add aime amc eval scripts
7030f4f
36bc250
minimize kl
520aeba
add eval
69aed64
add curriculum learning
4fe7fb4
Update README.md
b8f221d
6283ca0
46cf370
readme
1f1dfb3
f663d93
update teaser
170435c
Update citation
9771a71