Search Results

Found 3 repositories(showing 3)

OpenLMLab

🧡62

Secrets of RLHF in Large Language Models Part I: PPO

1.4k

105

Apache-2.0

Python

Updated 1 week ago

ai-safetyalignmentrlhf

Miraclemarvel55

❤️30

用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]

GPL-3.0

Python

Updated 1 year ago

chinesellamalora+6

DeepAlgoExpert

❤️30

No description available

Apache-2.0

Python

Updated 2 years ago

All 3 repositories loaded

GitHub Explorer