Self-Alignment with Principle-Following Reward Models
Stars
170
Forks
13
Watchers
Open Issues
5
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
11
commits
1
Merge pull request #1 from eltociear/patch-1
7392a56
Update README.md
dc9a39e
Fix typo in README.md
b2e049b
0777f67
cf9e37d
08ad926
modify dromedary-2 figure
1cc4fd7
add dromedary-2 figure
64fd9cd
add training code
4a66d51
clean the code of step 2 & 3
db7f872
clean the code of step 1
d512401
add demo code
c391b90
first commit
195f098