Search Results

Found 6 repositories(showing 6)

TextRL

voidful

🧡61

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

566

MIT

Python

Updated 1 week ago

chatgptcontrolled-nlggpt-2+7

context-relevant-pruning-textrl

IBM

❤️30

Repository for Reinforcement Learning for Text-based games with Context Relevant State Truncation to improve generalization to unseen games. Code accompanying accepted EMNLP2020 paper.

Apache-2.0

Python

Updated 1 year ago

textRL

won1k

❤️25

No description available

Updated 9 years ago

KB-TextRL

DUT-B910

❤️35

融合知识库和文本的知识表示学习软件

Python

Updated 4 years ago

TextRL-text-generation-using-RL

robertwilkins0115

❤️30

No description available

MIT

Python

Updated 5 months ago

rlhf-nlp

jddunn

❤️35

POC library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO

Python

Updated 1 year ago

pporeward-modelrlhf+1

All 6 repositories loaded

GitHub Explorer

Search Results

TextRL

context-relevant-pruning-textrl

textRL

KB-TextRL

TextRL-text-generation-using-RL

rlhf-nlp

TextRL

context-relevant-pruning-textrl

textRL

KB-TextRL

TextRL-text-generation-using-RL

rlhf-nlp