Back to search
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
Stars
264
Forks
11
Watchers
264
Open Issues
7
Overall repository health assessment
No package.json found
This might not be a Node.js project
10
commits
1
commits
add support for different formats when using gpt models as human simulators
16f23baView on GitHubMerge branch 'main' of https://github.com/facebookresearch/sweet_rl
fc80a09View on GitHub