Back to search
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
Stars
1.8k
Forks
153
Watchers
1.8k
Open Issues
0
Overall repository health assessment
No language data available
No package.json found
This might not be a Node.js project
1
commits
1
commits
1
commits
1
commits
1
commits
Update README.md to point to HuggingFace for updated data (#5)
c72f5ceView on GitHubAdd harmless preference data and red team dialogue data (#1)
0398e36View on GitHub