Found 6 repositories(showing 6)
voidful
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
Repository for Reinforcement Learning for Text-based games with Context Relevant State Truncation to improve generalization to unseen games. Code accompanying accepted EMNLP2020 paper.
won1k
No description available
DUT-B910
融合知识库和文本的知识表示学习软件
robertwilkins0115
No description available
jddunn
POC library built on TextRL for easy training and usage of fine-tuned models using RLHF, a rewards model, and PPO
All 6 repositories loaded