Found 495 repositories(showing 30)
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
ikostrikov
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
XinJingHao
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
nikhilbarhate99
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Lizhi-sjtu
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
quantumiracle
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
vietnh1009
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Khrylx
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
ericyangyu
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
TianhongDai
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
iffiX
Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...
RITCHIEHuang
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
reiniscimurs
Deep Reinforcement Learning for mobile robot navigation in IR-SIM simulation. Using DRL (SAC, TD3, PPO, DDPG) neural networks, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.
zhaohaojie1998
控制算法,状态、输出反馈控制。ADRC自抗扰控制,抗积分饱和PID控制,增量式PID控制,模糊FuzzyPID控制,线性二次型调节器LQR控制,线性二次型积分器LQI控制,迭代iLQR控制,模型预测MPC控制,AI智能控制,启发算法控制,强化学习SAC、PPO控制,无人机、机器人、小车轨迹跟踪控制
yangchen1997
PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.
taherfattahi
Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment
toshikwa
PyTorch implementation of GAIL and AIRL based on PPO.
RchalYang
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
lcswillems
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
XinJingHao
A clean and robust Pytorch implementation of PPO on continuous action space.
Starlight0798
基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)
CherryPieSexy
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
adik993
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
vietnh1009
Proximal Policy Optimization (PPO) algorithm for Contra
fangxiaoshen
在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, and realize simulation on gazebo. Use DQN, DDPG, PPO, SAC algorithm to realize simulation on gazebo.
lucidrains
An implementation of PPO in Pytorch
rgilman33
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
godka
The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, SAC, and support for both TensorFlow and PyTorch.
dvalenciar
ReinforceUI-Studio. A Python-based application designed to simplify the configuration and monitoring of RL training processes. Supporting MuJoCo, OpenAI Gymnasium, and DeepMind Control Suite. Algorithms included: CTD4, DDPG, DQN, PPO, SAC, TD3, TQC
philtabor
A Torch Based RL Framework for Rapid Prototyping of Research Papers