Search Results

Found 14,148 repositories(showing 30)

annotated_deep_learning_paper_implementations

labmlai

💚100

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

66.3k

6.7k

MIT

Python

Updated 3 hours ago

attentiondeep-learningdeep-learning-tutorial+10

easy-rl

datawhalechina

💚99

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

14.0k

2.2k

NOASSERTION

Jupyter Notebook

Updated 12 hours ago

a3cddpgdeep-reinforcement-learning+11

tianshou

thu-ml

💚93

An elegant PyTorch deep reinforcement learning library.

10.5k

1.3k

MIT

Python

Updated 3 hours ago

a2cataribcq+16

cleanrl

vwxyzjn

💚90

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

9.5k

1.0k

NOASSERTION

Python

Updated 58 minutes ago

a2cactor-criticadvantage-actor-critic+13

Reinforcement-learning-with-tensorflow

MorvanZhou

💚94

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

9.4k

5.0k

MIT

Python

Updated 14 hours ago

a3cactor-criticasynchronous-advantage-actor-critic+17

OpenRLHF

💛88

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

9.3k

915

Apache-2.0

Python

Updated 4 hours ago

large-language-modelsproximal-policy-optimizationraylib+5

deep-reinforcement-learning

udacity

💚90

Repo for the Deep Reinforcement Learning Nanodegree program

5.2k

2.4k

MIT

Jupyter Notebook

Updated 2 days ago

cross-entropyddpgdeep-reinforcement-learning+13

Reinforcement-Learning

andri27-ts

💛81

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

4.7k

670

MIT

Jupyter Notebook

Updated 8 minutes ago

a2cartificial-intelligencedeep-learning+9

Deep-reinforcement-learning-with-pytorch

sweetice

💛84

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

4.6k

898

MIT

Python

Updated 1 day ago

a2ca3cactor-critic+15

ElegantRL

AI4Finance-Foundation

💛84

Massively Parallel Deep Reinforcement Learning. 🔥

4.3k

970

NOASSERTION

Python

Updated 4 hours ago

a2cbipedalwalkerhardcoreddpg+14

Deep_reinforcement_learning_Course

simoninithomas

💛81

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

3.9k

1.2k

Jupyter Notebook

Updated 1 day ago

a2cactor-criticdeep-learning+9

pytorch-a2c-ppo-acktr-gail

ikostrikov

💛82

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

3.9k

843

MIT

Python

Updated 21 hours ago

a2cacktractor-critic+17

DeepRL

ShangtongZhang

💛80

Modularized Implementation of Deep RL Algorithms in PyTorch

3.4k

697

MIT

Python

Updated 1 day ago

a2ccategorical-dqnddpg+13

DRL-Pytorch

XinJingHao

💛72

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

3.3k

389

Python

Updated 1 day ago

aslc51categorical-dqn+13

minimalRL

seungeunrho

💛78

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

3.2k

494

MIT

Python

Updated 3 days ago

a2ca3cacer+12

FinRL-Trading

AI4Finance-Foundation

💛82

FinRL-X: An AI-Native Modular Infrastructure for Quantitative Trading

2.9k

900

Apache-2.0

Python

Updated 2 hours ago

a2c-algorithmautomated-stock-tradingddpg+11

PPOxFamily

opendilab

💛75

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

2.6k

212

Apache-2.0

Python

Updated 2 days ago

coursedecision-intelligencedeep-reinforcement-learning+2

PPO-PyTorch

nikhilbarhate99

💛77

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

2.3k

425

MIT

Python

Updated 1 day ago

deep-learningdeep-reinforcement-learningpolicy-gradient+8

on-policy

marlbenchmark

💛76

This is the official implementation of Multi-Agent PPO (MAPPO).

2.0k

373

MIT

Python

Updated 17 hours ago

algorithmshanabimappo+5

DRL-code-pytorch

Lizhi-sjtu

💛74

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

1.5k

207

MIT

Python

Updated 2 days ago

ddpg-pytorchdqn-pytorchppo-gru+7

MOSS-RLHF

OpenLMLab

🧡62

Secrets of RLHF in Large Language Models Part I: PPO

1.4k

105

Apache-2.0

Python

Updated 1 week ago

ai-safetyalignmentrlhf

SLM-Lab

kengz

💛74

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

1.3k

288

MIT

Python

Updated 4 days ago

a2ca3cbenchmark+7

Popular-RL-Algorithms

quantumiracle

💛73

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

1.3k

148

Apache-2.0

Jupyter Notebook

Updated 14 hours ago

reinforcement-learningsoft-actor-criticstate-of-the-art

Super-mario-bros-PPO-pytorch

vietnh1009

💛74

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

1.3k

236

MIT

Python

Updated 1 day ago

aideep-learninggym+11

PyTorch-RL

Khrylx

💛73

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

1.3k

191

MIT

Python

Updated 6 days ago

a2cdeep-reinforcement-learningfisher-vectors+8

PPO-for-Beginners

ericyangyu

💛73

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

1.2k

159

MIT

Python

Updated 21 hours ago

machine-learningppopytorch+2

LearningHumanoidWalking

rohanpsingh

💛72

Training a humanoid robot for locomotion using Reinforcement Learning

1.1k

130

BSD-2-Clause

Python

Updated 41 minutes ago

bipedal-robotscassiehumanoids+4

DeepRL-Tutorials

qfettes

🧡59

Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch

1.1k

328

Jupyter Notebook

Updated 3 weeks ago

a2cactor-criticadvantage-actor-critic+17

xuance

agi-brain

💛73

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

1.1k

155

MIT

Python

Updated 3 days ago

a2catariddpg+16

purejaxrl

luchris429

💛72

Really Fast End-to-End Jax RL Implementations

1.0k

Apache-2.0

Python

Updated 2 days ago

deep-reinforcement-learningjaxppo+2

GitHub Explorer

Search Results

annotated_deep_learning_paper_implementations

easy-rl

tianshou

cleanrl

Reinforcement-learning-with-tensorflow

OpenRLHF

deep-reinforcement-learning

Reinforcement-Learning

Deep-reinforcement-learning-with-pytorch

ElegantRL

Deep_reinforcement_learning_Course

pytorch-a2c-ppo-acktr-gail

DeepRL

DRL-Pytorch

minimalRL

FinRL-Trading

PPOxFamily

PPO-PyTorch

on-policy

DRL-code-pytorch

MOSS-RLHF

SLM-Lab

Popular-RL-Algorithms

Super-mario-bros-PPO-pytorch

PyTorch-RL

PPO-for-Beginners

LearningHumanoidWalking

DeepRL-Tutorials

xuance

purejaxrl

annotated_deep_learning_paper_implementations

easy-rl

tianshou

cleanrl

Reinforcement-learning-with-tensorflow

OpenRLHF

deep-reinforcement-learning

Reinforcement-Learning

Deep-reinforcement-learning-with-pytorch

ElegantRL

Deep_reinforcement_learning_Course

pytorch-a2c-ppo-acktr-gail

DeepRL

DRL-Pytorch

minimalRL

FinRL-Trading

PPOxFamily

PPO-PyTorch

on-policy

DRL-code-pytorch

MOSS-RLHF

SLM-Lab

Popular-RL-Algorithms

Super-mario-bros-PPO-pytorch

PyTorch-RL

PPO-for-Beginners

LearningHumanoidWalking

DeepRL-Tutorials

xuance

purejaxrl