Learning setting up RL environments and post-training flows.
Stars
0
Forks
Watchers
Open Issues
Overall repository health assessment
No package.json found
This might not be a Node.js project
clean: added required metadata
127190d
Merge branch 'main' of https://github.com/arjunk820/rlhf-learning
e6480ca
chore: ran training and uploaded results
6f7f098
Created using Colab (for training)
af2f4bd
docs: added goals for sft proj
a17f5f2
feat: created a notebook for sft training
0a2e850
fix: wandb experiment tracking accuracy
5d796af
feat: setup SFT training
1dfcc12
initial commit
c852c5a
User
9
commits