Found 11 repositories(showing 11)
huggingface
Robust recipes to align language models with human and AI preferences
argilla-io
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
yashiqxlabai
No description available
Red-Fairy
No description available
spsanps
No description available
Ally-Ha
No description available
Ekanshqx
This is the final code for SFT full finetuning on multiple GPU
No description available
No description available
No description available
No description available
All 11 repositories loaded