Back to search
Performed supervised fine-tuning (SFT) on Llama 3.1 8B using HH-RLHF and Ranked 10K responses with Llama 3.1 70B to build a safety-optimized dataset
Stars
2
Forks
0
Watchers
2
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
8
commits