Search Results

Found 15 repositories(showing 15)

RRHF

GanjinZero

❤️46

[NIPS2023] RRHF & Wombat

808

Python

Updated 1 month ago

EasyRLHF

DaehanKim

🧡65

EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets

Python

Updated 14 hours ago

dpoinstruction-tuningipo+4

RRHF-V

chengq1001

❤️35

[COLING'25] RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human Feedback

Python

Updated 6 months ago

rrhfoem04-lib

ajxv

❤️40

Python Library for interfacing with the RRHFOEM04/RRHFOEM07-USB RFID Reader

MIT

Python

Updated 7 months ago

hidiso14443aiso15693+16

llm_rrhf

ssbuild

❤️30

No description available

Apache-2.0

Python

Updated 1 year ago

rrhfoem04libc

sillylilfox

🧡55

C library for RRHFOEM04 Card readers, renewed

Updated 2 weeks ago

RRHF

annakijas1

❤️40

Rock and Roll Hall of Fame Inductee Data (1986-2018)

CC0-1.0

Updated 6 years ago

rrhfhfghfg

nameswer

❤️40

fghfghf

Unlicense

Updated 3 years ago

rrhfemsxkdlrj.github.io

rrhfemsxkdlrj

❤️25

No description available

HTML

Updated 3 years ago

rrhFrontend

lizet96

❤️25

No description available

JavaScript

Updated 1 year ago

RRHFIT-SYS32

AngelL327

❤️25

No description available

Dart

Updated 4 months ago

5rrhfrtd4geyrttuitsczwerxytruftgyiuuj

marwan-fahad

❤️25

No description available

JavaScript

Updated 2 years ago

rRhfpdyvQ5Y

LRCHub

🧡55

宇多田ヒカル「First Love」(HIKARU UTADA SCIENCE FICTION TOUR 2024)

Updated 3 weeks ago

-meta-name-google-site-verification-content-pqzeV_0-d6pRAEli9H9P6aTTOtMyRRhfwwx14JySr70-

EditFlow0

❤️35

No description available

Updated 2 months ago

Surveyed Reinforcement Learning from Human Feedback techniques in order to find out how AI systems can better align with human values. Reviewed and implemented key techniques such as AIHF, Christiano's method, and RRHF using the CartPoleV1 task, analyzing the strengths of each technique regarding scalability, efficiency, and robustness.

Python

Updated 1 year ago

All 15 repositories loaded

GitHub Explorer

Search Results

RRHF

EasyRLHF

RRHF-V

rrhfoem04-lib

llm_rrhf

rrhfoem04libc

RRHF

rrhfhfghfg

rrhfemsxkdlrj.github.io

rrhFrontend

RRHFIT-SYS32

5rrhfrtd4geyrttuitsczwerxytruftgyiuuj

rRhfpdyvQ5Y

-meta-name-google-site-verification-content-pqzeV_0-d6pRAEli9H9P6aTTOtMyRRhfwwx14JySr70-

RLHF-on-CartPole

RRHF

EasyRLHF

RRHF-V

rrhfoem04-lib

llm_rrhf

rrhfoem04libc

RRHF

rrhfhfghfg

rrhfemsxkdlrj.github.io

rrhFrontend

RRHFIT-SYS32

5rrhfrtd4geyrttuitsczwerxytruftgyiuuj

rRhfpdyvQ5Y

-meta-name-google-site-verification-content-pqzeV_0-d6pRAEli9H9P6aTTOtMyRRhfwwx14JySr70-

RLHF-on-CartPole