Found 4 repositories(showing 4)
facebookresearch
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
adrienhadjsalah
The sweetest Reinforcement Learning framework
SweetRlk
No description available
niintendoo64
haii !! + ° . ๑・° ⊹ . + ° . ๑・° ⊹ . + ° . ISTP 9w1 introvert my replies may seem cold and dry(sometims..), but im rlly sweet to talk idc abt pronouns, but i prefer she/her + ° . ๑・° ⊹ . + ° . ๑・° ⊹ . + ° . mostly afk... c+h always allowed i dont bite. i rlly luv matching cosplayss n cuddling with em!!!
All 4 repositories loaded