Found 14,148 repositories(showing 30)
๐งโ๐ซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
datawhalechina
ๅผบๅๅญฆไน ไธญๆๆ็จ๏ผ่่ไนฆ๐๏ผ๏ผๅจ็บฟ้ ่ฏปๅฐๅ๏ผhttps://datawhalechina.github.io/easy-rl/
thu-ml
An elegant PyTorch deep reinforcement learning library.
vwxyzjn
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
MorvanZhou
Simple Reinforcement learning tutorials, ่ซ็ฆPython ไธญๆAIๆๅญฆ
OpenRLHF
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
udacity
Repo for the Deep Reinforcement Learning Nanodegree program
andri27-ts
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
AI4Finance-Foundation
Massively Parallel Deep Reinforcement Learning. ๐ฅ
simoninithomas
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
ikostrikov
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
ShangtongZhang
Modularized Implementation of Deep RL Algorithms in PyTorch
XinJingHao
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
seungeunrho
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
AI4Finance-Foundation
FinRL-X: An AI-Native Modular Infrastructure for Quantitative Trading
opendilab
PPO x Family DRL Tutorial Course๏ผๅณ็ญๆบ่ฝๅ ฅ้จ็บงๅ ฌๅผ่ฏพ๏ผ8่่ฏพๅธฎไฝ ็ๆธ ็ฎๆณ็่ฎบ๏ผ็้กบไปฃ็ ้ป่พ๏ผ็ฉ่ฝฌๅณ็ญAIๅบ็จๅฎ่ทต ๏ผ
nikhilbarhate99
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
marlbenchmark
This is the official implementation of Multi-Agent PPO (MAPPO).
Lizhi-sjtu
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
OpenLMLab
Secrets of RLHF in Large Language Models Part I: PPO
kengz
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
quantumiracle
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
vietnh1009
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Khrylx
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
ericyangyu
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
rohanpsingh
Training a humanoid robot for locomotion using Reinforcement Learning
qfettes
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
agi-brain
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
luchris429
Really Fast End-to-End Jax RL Implementations