Found 127,456 repositories(showing 30)
microsoft
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
LlamaChinese
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
AI4Finance-Foundation
FinRL®: Financial Reinforcement Learning. 🔥
datawhalechina
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
thu-ml
An elegant PyTorch deep reinforcement learning library.
aikorea
Reinforcement learning resources curated
OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
OpenPipe
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!
lucidrains
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
yandexdataschool
A course in reinforcement learning in the wild
keras-rl
Deep Reinforcement Learning for Keras.
rllm-org
Democratizing Reinforcement Learning for LLMs
THUDM
slime is an LLM post-training framework for RL Scaling.
unslothai
250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.
inclusionAI
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
huggingface
This repo contains the Hugging Face Deep Reinforcement Learning Course.
hiyouga
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Gen-Verse
OpenClaw-RL: Train any agent simply by talking
boyu-ai
https://hrl.boyuai.com/
PeterGriffinJin
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
opendilab
A curated list of reinforcement learning with human feedback resources (continually updated)
GT-RIPL
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
AI4Finance-Foundation
Massively Parallel Deep Reinforcement Learning. 🔥
changyeyu
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
PrimeIntellect-ai
Our library for RL environments + evals
OpenManus
A live stream development of RL tunning for LLM agents
hkust-nlp
Simple RL training for reasoning
wangshub
📈 如何用深度强化学习自动炒股
junxiaosong
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)