Found 3 repositories(showing 3)
OpenLMLab
Secrets of RLHF in Large Language Models Part I: PPO
Miraclemarvel55
用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]
DeepAlgoExpert
No description available
All 3 repositories loaded