Search Results

Found 495 repositories(showing 30)

Deep-reinforcement-learning-with-pytorch

sweetice

💛84

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

4.6k

898

MIT

Python

Updated 12 hours ago

a2ca3cactor-critic+15

pytorch-a2c-ppo-acktr-gail

ikostrikov

💛82

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

3.9k

843

MIT

Python

Updated 4 days ago

a2cacktractor-critic+17

DRL-Pytorch

XinJingHao

💛72

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

3.3k

385

Python

Updated 6 hours ago

aslc51categorical-dqn+13

PPO-PyTorch

nikhilbarhate99

💛77

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

2.3k

424

MIT

Python

Updated 5 hours ago

deep-learningdeep-reinforcement-learningpolicy-gradient+8

DRL-code-pytorch

Lizhi-sjtu

💛74

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

1.5k

207

MIT

Python

Updated 3 hours ago

ddpg-pytorchdqn-pytorchppo-gru+7

Popular-RL-Algorithms

quantumiracle

💛73

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

1.3k

147

Apache-2.0

Jupyter Notebook

Updated 4 days ago

reinforcement-learningsoft-actor-criticstate-of-the-art

Super-mario-bros-PPO-pytorch

vietnh1009

💛74

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

1.3k

236

MIT

Python

Updated 15 hours ago

aideep-learninggym+11

PyTorch-RL

Khrylx

💛73

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

1.3k

191

MIT

Python

Updated 1 day ago

a2cdeep-reinforcement-learningfisher-vectors+8

PPO-for-Beginners

ericyangyu

💛73

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

1.2k

159

MIT

Python

Updated 1 day ago

machine-learningppopytorch+2

reinforcement-learning-algorithms

TianhongDai

🧡57

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

693

110

Python

Updated 3 weeks ago

a2cactor-criticalgorithm+14

machin

iffiX

🧡61

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

418

MIT

Python

Updated 2 weeks ago

a3c-pytorchddpgdeep-learning+11

DeepRL_Algorithms

RITCHIEHuang

🧡56

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

353

Python

Updated 1 week ago

deep-reinforcement-learningdqnmujoco+11

DRL-robot-navigation-IR-SIM

reiniscimurs

🧡61

Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

304

Python

Updated 1 day ago

ddpgddpg-pytorchdrl+11

Control-Algorithm

zhaohaojie1998

🧡66

控制算法，状态、输出反馈控制。ADRC自抗扰控制，抗积分饱和PID控制，增量式PID控制，模糊FuzzyPID控制，线性二次型调节器LQR控制，线性二次型积分器LQI控制，迭代iLQR控制，模型预测MPC控制，AI智能控制，启发算法控制，强化学习SAC、PPO控制，无人机、机器人、小车轨迹跟踪控制

271

Python

Updated 5 hours ago

adrcfeedback-controlfuzzy-pid+13

Multi-Agent-Reinforcement-Learning

yangchen1997

💛71

PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.

246

MIT

Python

Updated 1 day ago

centralized-ppogrid-wise-controlindependent-ppo+4

ppo-rocket-landing

taherfattahi

🧡61

Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment

244

MIT

Python

Updated 2 weeks ago

aimachine-learningppo+3

gail-airl-ppo.pytorch

toshikwa

🧡61

PyTorch implementation of GAIL and AIRL based on PPO.

241

MIT

Python

Updated 2 weeks ago

torchrl

RchalYang

🧡55

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

231

Python

Updated 1 week ago

algorithmddpgdqn+10

torch-ac

lcswillems

🧡61

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

207

MIT

Python

Updated 3 weeks ago

a2ca3cactor-critic+11

PPO-Continuous-Pytorch

XinJingHao

💛70

A clean and robust Pytorch implementation of PPO on continuous action space.

173

MIT

Python

Updated 5 days ago

gymRL

Starlight0798

🧡65

基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)

150

MIT

Python

Updated 1 day ago

dqngymppo+2

imitation_learning

CherryPieSexy

❤️35

PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.

149

Python

Updated 4 months ago

a2cadvantage-actor-criticdeep-learning+12

ppo-pytorch

adik993

❤️45

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

147

Python

Updated 2 months ago

cartpole-v1deep-learninggeneralized-advantage-estimation+8

Contra-PPO-pytorch

vietnh1009

❤️45

Proximal Policy Optimization (PPO) algorithm for Contra

143

Python

Updated 1 month ago

aicontracontra-nes+7

ROS_pytorch_RL

fangxiaoshen

🧡60

在turtlebot3，pytorch上使用DQN,DDPG,PPO,SAC算法，在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, and realize simulation on gazebo. Use DQN, DDPG, PPO, SAC algorithm to realize simulation on gazebo.

130

Apache-2.0

Python

Updated 1 week ago

ppo

lucidrains

💛70

An implementation of PPO in Pytorch

108

MIT

Python

Updated 19 hours ago

artificial-intelligenceproximal-policy-optimizationreinforcement-learning

simple-A2C-PPO

rgilman33

❤️35

Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.

100

Jupyter Notebook

Updated 3 months ago

Pensieve-PPO

godka

💛71

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, SAC, and support for both TensorFlow and PyTorch.

BSD-2-Clause

DIGITAL Command Language

Updated 11 hours ago

a2cdeep-learningdqn+5

ReinforceUI-Studio

dvalenciar

❤️45

ReinforceUI-Studio. A Python-based application designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC

MIT

Python

Updated 1 month ago

deep-learningdm-controlgymnasium+9

ProtoRL

philtabor

❤️45

A Torch Based RL Framework for Rapid Prototyping of Research Papers

MIT

Python

Updated 1 month ago

actor-criticddpgddpg-pytorch+16

GitHub Explorer

Search Results

Deep-reinforcement-learning-with-pytorch

pytorch-a2c-ppo-acktr-gail

DRL-Pytorch

PPO-PyTorch

DRL-code-pytorch

Popular-RL-Algorithms

Super-mario-bros-PPO-pytorch

PyTorch-RL

PPO-for-Beginners

reinforcement-learning-algorithms

machin

DeepRL_Algorithms

DRL-robot-navigation-IR-SIM

Control-Algorithm

Multi-Agent-Reinforcement-Learning

ppo-rocket-landing

gail-airl-ppo.pytorch

torchrl

torch-ac

PPO-Continuous-Pytorch

gymRL

imitation_learning

ppo-pytorch

Contra-PPO-pytorch

ROS_pytorch_RL

ppo

simple-A2C-PPO

Pensieve-PPO

ReinforceUI-Studio

ProtoRL

Deep-reinforcement-learning-with-pytorch

pytorch-a2c-ppo-acktr-gail

DRL-Pytorch

PPO-PyTorch

DRL-code-pytorch

Popular-RL-Algorithms

Super-mario-bros-PPO-pytorch

PyTorch-RL

PPO-for-Beginners

reinforcement-learning-algorithms

machin

DeepRL_Algorithms

DRL-robot-navigation-IR-SIM

Control-Algorithm

Multi-Agent-Reinforcement-Learning

ppo-rocket-landing

gail-airl-ppo.pytorch

torchrl

torch-ac

PPO-Continuous-Pytorch

gymRL

imitation_learning

ppo-pytorch

Contra-PPO-pytorch

ROS_pytorch_RL

ppo

simple-A2C-PPO

Pensieve-PPO

ReinforceUI-Studio

ProtoRL