Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning
Stars
15
Forks
1
Watchers
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
5
commits
update
ac9f99e
b109679
iclr 2026
86f258c
update: code and dataset
1085db5
init
b604782