Search Results

Found 53 repositories(showing 30)

RLHF_learn

haohaoXhang

🧡65

这是一个从零开始构建的强化学习人类反馈（RLHF）学习代码库，实现了 PPO、GRPO、GSPO 以及相关的策略优化算法，并提供了清晰、可复现的训练流程。由于文档是由latex文件转译过来，如果md文件渲染异常，请用VScode的md插件打开

Python

Updated 4 days ago

Reinforcement-Learning-from-bandits-to-RLHF

pyshka501

💛70

This repository contains lecture notes, practical materials, and implementations for the course: "Reinforcement Learning: from Bandits to RLHF" The course is designed to provide a deep and systematic understanding of RL, combining: solid mathematical foundations intuitive explanations practical implementations modern research insights

MIT

Jupyter Notebook

Updated 1 day ago

MisleadLM

Jiaxin-Wen

❤️45

Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""

MIT

Python

Updated 1 month ago

RLHF_learn

Dylsimple60

🧡65

🤖 Enhance reinforcement learning stability and efficiency with advanced algorithms like TRPO, PPO, DPO, GRPO, DAPO, and GSPO for optimized policy training.

Python

Updated 7 hours ago

ai-safetyattention-mechanismsdatasets+17

Reinforcement-Learning-for-Human-Feedback-RLHF

SJ9VRF

❤️35

This repository contains the implementation of a Reinforcement Learning with Human Feedback (RLHF) system using custom datasets. The project utilizes the trlX library for training a preference model that integrates human feedback directly into the optimization of language models.

Python

Updated 9 months ago

language-molanguage-modelllms+2

reinforcement-learning

ThinamXx

❤️40

You will learn about RLHF from this repository 🤖.

MIT

Python

Updated 1 year ago

all-about-ai

beyhanmeyrali

❤️45

🚀 Master AI fine-tuning on consumer hardware! Learn LoRA, DoRA, QDoRA, PiSSA & RLHF through engaging stories & hands-on tutorials. Optimized for AMD GPUs (ROCm). From 15-min demos to production deployment. Features LLaMA-Factory zero-code interface & constitutional AI. 🇺🇸🇹🇷 Available in English & Turkish.

Python

Updated 2 months ago

RLHF-Reinforcement-Learning-fromHuman-Feedback

iCake-zg

❤️35

The RLHF (Reinforcement Learning from Human Feedback) repository provides an implementation of reinforcement learning (RL) enhanced by human feedback. It enables agents to improve their learning and performance when traditional reward signals are insufficient or hard to define.

Python

Updated 6 months ago

RLHF-from-DeepLearningAI

Tharindu1527

❤️25

No description available

Jupyter Notebook

Updated 1 year ago

Learning-RLHF

haroonraja01

🧡60

Learning RLHF

Jupyter Notebook

Updated 6 days ago

rl-book-labs

mlnjsh

🧡55

🎮 Interactive browser-based labs for "Complete Reinforcement Learning Journey: From Basics to RLHF" — visualize algorithms, tweak parameters, and watch agents learn in real-time.

Jupyter Notebook

Updated 2 weeks ago

educationinteractivejupyter-notebook+5

PokemonPlatinum.AI

scottpitcher

❤️35

Developing a Reinforcement Learning model to learn to play Pokemon Platinum, integrating this into the code through a desktop emulator such as DeSmuME. Will then utilise RLHF (Reinforcement Learning with Human Feedback) for model training.

Jupyter Notebook

Updated 1 year ago

microDPO

Rahulkumar010

🧡60

microDPO: A minimalist, pure PyTorch implementation of Direct Preference Optimization. Inspired by nanoGPT, it strips away massive RLHF libraries to reveal the elegant math behind AI alignment. Demystify how LLMs learn human preferences with a single, highly readable file. Train a tiny aligned model on your laptop in minutes.

MIT

Python

Updated 3 weeks ago

ai-alignmentdpoeducational+4

OmniForge.ai

just-ketan

🧡60

OmniForge is a platform a brand's knowledge base (PDFs, Brand Guidelines) is ingested and an AI Agent then uses RAG to understand the brand voice. system generates marketing copy and matching social media assets (Diffusion). for "perfection," the system uses LoRA for consistent style and RLHF to learn from human marketing experts' feedback.

GPL-3.0

Python

Updated 3 weeks ago

MisleadLM

Taslim-M

❤️35

Additional experiments to the paper "Language Models Learn to Mislead Humans via RLHF""

MIT

Python

Updated 11 months ago

LearnRLHF

dx-dtran

❤️10

No description available

MIT

Python

Updated 6 months ago

RLHF_Learning

mengbingrock

❤️25

No description available

Updated 1 year ago

RLHF-learning

gokul77898

❤️30

No description available

MIT

Java

Updated 4 months ago

rlhf-learning

arjunk820

❤️35

Learning setting up RL environments and post-training flows.

Jupyter Notebook

Updated 6 months ago

learning-RLHF

jkcte

❤️35

https://www.youtube.com/watch?v=JgvyzIkgxF0&ab_channel=ArxivInsights

Updated 1 year ago

rlhf_learning

NairongZheng

❤️45

可以用来简单 debug 查看中间过程的 rlhf 学习框架

Python

Updated 1 month ago

RLHF-Learning

guojialin-csu

❤️35

RLHF学习记录

Updated 3 months ago

rlhf_learn

SonuDixit

❤️25

No description available

Python

Updated 1 year ago

RLHF-Learning-materials

DDDDorwin

❤️35

A learning material list of RLHF (Updating)

Updated 2 years ago

subliminal_learning_rlhf

LauraGomezjurado

❤️25

No description available

Jupyter Notebook

Updated 4 months ago

rlhf-learning-game

YethroSaas

❤️35

Created with CodeSandbox

Updated 10 months ago

RLHF-Learning-Optimization

chiz-ai

❤️40

Using Reinforcement Learning with Human Feedback (RLHF) for optimizing cybersecurity educational learning paths.

MIT

Updated 1 year ago

CurriculumLearningRLHF

rohandeb24

❤️45

No description available

Python

Updated 1 week ago

learn_rlhf_algorithms

dragonfly90

❤️35

No description available

HTML

Updated 1 month ago

rlhf-deep-learning

tejaschandr

❤️35

Reinforcement Learning from Human Feedback Exploration using Nvidia's HelpSteer3 Database

Jupyter Notebook

Updated 11 months ago

GitHub Explorer

Search Results

RLHF_learn

Reinforcement-Learning-from-bandits-to-RLHF

MisleadLM

RLHF_learn

Reinforcement-Learning-for-Human-Feedback-RLHF

reinforcement-learning

all-about-ai

RLHF-Reinforcement-Learning-fromHuman-Feedback

RLHF-from-DeepLearningAI

Learning-RLHF

rl-book-labs

PokemonPlatinum.AI

microDPO

OmniForge.ai

MisleadLM

LearnRLHF

RLHF_Learning

RLHF-learning

rlhf-learning

learning-RLHF

rlhf_learning

RLHF-Learning

rlhf_learn

RLHF-Learning-materials

subliminal_learning_rlhf

rlhf-learning-game

RLHF-Learning-Optimization

CurriculumLearningRLHF

learn_rlhf_algorithms

rlhf-deep-learning

RLHF_learn

Reinforcement-Learning-from-bandits-to-RLHF

MisleadLM

RLHF_learn

Reinforcement-Learning-for-Human-Feedback-RLHF

reinforcement-learning

all-about-ai

RLHF-Reinforcement-Learning-fromHuman-Feedback

RLHF-from-DeepLearningAI

Learning-RLHF

rl-book-labs

PokemonPlatinum.AI

microDPO

OmniForge.ai

MisleadLM

LearnRLHF

RLHF_Learning

RLHF-learning

rlhf-learning

learning-RLHF

rlhf_learning

RLHF-Learning

rlhf_learn

RLHF-Learning-materials

subliminal_learning_rlhf

rlhf-learning-game

RLHF-Learning-Optimization

CurriculumLearningRLHF

learn_rlhf_algorithms

rlhf-deep-learning