Search Results

Found 127,456 repositories(showing 30)

qlib

microsoft

💚100

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.

40.6k

6.4k

MIT

Python

Updated 6 minutes ago

algorithmic-tradingauto-quantdeep-learning+15

Llama-Chinese

LlamaChinese

💚93

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

14.7k

1.3k

Python

Updated 1 day ago

agentllamallama4+3

FinRL

AI4Finance-Foundation

💚100

FinRL®: Financial Reinforcement Learning. 🔥

14.7k

3.3k

MIT

Jupyter Notebook

Updated 1 hour ago

algorithmic-tradingdeep-reinforcement-learningdrl-algorithms+11

easy-rl

datawhalechina

💚99

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

14.0k

2.2k

NOASSERTION

Jupyter Notebook

Updated 4 hours ago

a3cddpgdeep-reinforcement-learning+11

dopamine

google

💚95

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

10.9k

1.4k

Apache-2.0

Jupyter Notebook

Updated 1 day ago

aigoogleml+2

tianshou

thu-ml

💚93

An elegant PyTorch deep reinforcement learning library.

10.5k

1.3k

MIT

Python

Updated 1 hour ago

a2cataribcq+16

awesome-rl

aikorea

💚90

Reinforcement learning resources curated

9.7k

1.9k

Updated 3 hours ago

OpenRLHF

💛88

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

9.3k

915

Apache-2.0

Python

Updated 7 minutes ago

large-language-modelsproximal-policy-optimizationraylib+5

ART

OpenPipe

💛87

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

9.2k

790

Apache-2.0

Python

Updated 8 hours ago

agentagentic-aigrpo+6

PaLM-rlhf-pytorch

lucidrains

💛85

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

7.9k

680

MIT

Python

Updated 17 hours ago

artificial-intelligenceattention-mechanismsdeep-learning+3

Practical_RL

yandexdataschool

💚91

A course in reinforcement learning in the wild

6.5k

1.8k

Unlicense

Jupyter Notebook

Updated 12 hours ago

course-materialsdeep-learningdeep-reinforcement-learning+8

keras-rl

💛89

Deep Reinforcement Learning for Keras.

5.6k

1.3k

MIT

Python

Updated 3 days ago

kerasmachine-learningneural-networks+3

rllm

rllm-org

💛76

Democratizing Reinforcement Learning for LLMs

5.4k

539

Apache-2.0

Python

Updated 9 hours ago

agent-frameworkagentic-workflowcoding-agent+11

slime

THUDM

💛72

slime is an LLM post-training framework for RL Scaling.

5.2k

712

Apache-2.0

Python

Updated 35 minutes ago

notebooks

unslothai

💛79

250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.

5.2k

842

LGPL-3.0

Jupyter Notebook

Updated 45 minutes ago

unsloth

AReaL

inclusionAI

💛75

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

5.0k

460

Apache-2.0

Python

Updated 1 hour ago

agentllmllm-agent+5

deep-rl-class

huggingface

💛83

This repo contains the Hugging Face Deep Reinforcement Learning Course.

4.8k

782

Apache-2.0

MDX

Updated 2 hours ago

deep-learningdeep-reinforcement-learningreinforcement-learning+1

EasyR1

hiyouga

💛74

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

4.8k

367

Apache-2.0

Python

Updated 18 hours ago

aideepseekgpt+5

OpenClaw-RL

Gen-Verse

💛80

OpenClaw-RL: Train any agent simply by talking

4.8k

505

Apache-2.0

Python

Updated 48 minutes ago

asynccodinggrpo+10

Hands-on-RL

boyu-ai

💛83

https://hrl.boyuai.com/

4.7k

812

Apache-2.0

Jupyter Notebook

Updated 1 hour ago

Search-R1

PeterGriffinJin

💛78

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

4.4k

386

Apache-2.0

Python

Updated 47 minutes ago

awesome-RLHF

opendilab

💛77

A curated list of reinforcement learning with human feedback resources (continually updated)

4.3k

252

Apache-2.0

Updated 6 hours ago

deep-learningdeep-reinforcement-learninghuman-feedback+3

Awesome-LLM-Robotics

GT-RIPL

💛78

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4.3k

326

BSD-3-Clause

Updated 20 hours ago

ElegantRL

AI4Finance-Foundation

💛84

Massively Parallel Deep Reinforcement Learning. 🔥

4.3k

970

NOASSERTION

Python

Updated 7 hours ago

a2cbipedalwalkerhardcoreddpg+14

LLM-RL-Visualized

changyeyu

💛78

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

4.0k

380

NOASSERTION

Python

Updated 2 hours ago

aialgorithmdeep-learning+7

verifiers

PrimeIntellect-ai

💛74

Our library for RL environments + evals

4.0k

530

MIT

Python

Updated 1 hour ago

OpenManus-RL

OpenManus

💛79

A live stream development of RL tunning for LLM agents

4.0k

539

Apache-2.0

Python

Updated 17 hours ago

simpleRL-reason

hkust-nlp

💛72

Simple RL training for reasoning

3.8k

289

MIT

Python

Updated 2 days ago

RL-Stock

wangshub

💛82

📈 如何用深度强化学习自动炒股

3.6k

793

MIT

Jupyter Notebook

Updated 22 hours ago

AlphaZero_Gomoku

junxiaosong

💛84

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

3.6k

1.0k

MIT

Python

Updated 1 day ago

alphagoalphago-zeroalphazero+10

GitHub Explorer

Search Results

qlib

Llama-Chinese

FinRL

easy-rl

dopamine

tianshou

awesome-rl

OpenRLHF

ART

PaLM-rlhf-pytorch

Practical_RL

keras-rl

rllm

slime

notebooks

AReaL

deep-rl-class

EasyR1

OpenClaw-RL

Hands-on-RL

Search-R1

awesome-RLHF

Awesome-LLM-Robotics

ElegantRL

LLM-RL-Visualized

verifiers

OpenManus-RL

simpleRL-reason

RL-Stock

AlphaZero_Gomoku

qlib

Llama-Chinese

FinRL

easy-rl

dopamine

tianshou

awesome-rl

OpenRLHF

ART

PaLM-rlhf-pytorch

Practical_RL

keras-rl

rllm

slime

notebooks

AReaL

deep-rl-class

EasyR1

OpenClaw-RL

Hands-on-RL

Search-R1

awesome-RLHF

Awesome-LLM-Robotics

ElegantRL

LLM-RL-Visualized

verifiers

OpenManus-RL

simpleRL-reason

RL-Stock

AlphaZero_Gomoku