Found 85 repositories(showing 30)
vwxyzjn
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
meta-pytorch
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
cpwan
Reinforcement learning for operation research problems with OpenAI Gym and CleanRL
vwxyzjn
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
firechecking
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
sash-a
Simple single file implementations of Reinforcement Learning algorithms in Julia
RyanNavillus
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
FhG-IISB
Clean and easy to understand implementations of many Quantum Reinforcement Learning agents as well as their classical analouges. Greately inspired by the orgininal CleanRL
vwxyzjn
No description available
AlignmentResearch
Experimenting with CleanRL for learned-planners
acezsq
cleanrl应用于自定义gym环境
jinPrelude
Implementation of the paper "Towards mental time travel: a hierarchical memory for reinforcement learning agent" using CleanRL
jualat
A framework for Reinforcement Learning from Human Feedback based on CleanRL
superboySB
MindSpore version of CleanRL, for supporting online reinforcement learning algorithms
LucMc
This repository aims to provide the minimalism of cleanRL with the performance of SBX
sarisabban
De novo cyclic protein polypeptide design using reinforcement learning.
FelipeMartins96
No description available
amulil
the rl algos implementation inspired by cleanrl
vwxyzjn
No description available
AbdullahVanlioglu
"A modular and clean research framework for Generative AI, based on popular open-source repositories like Transformers, TRL, Diffusions, and CleanRL."
binomiya
copy cleanrl at 20221230
kaushik-dev09
No description available
SemyonSkovpin
My implementation of dqn in jupyter lab. Hyperparameters by https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl
Be-Sharps
CleanRL PPO implementation for JAXAtari VideoPinball
Picus303
Stable and Mathematically Accurate Reinforcement Learning
roger-creus
No description available
wzhhasadream
A deep reinforcement learning library implemented with Flax/NNX, inspired by CleanRL design philosophy and integrated with the latest SimBa architecture
dibbla
Several minimal implemetations of RL/Imitation algorithms, following CleanRL's philosophy
psdecabooter
Trained a custom hexapod robot to crawl using reinforcement learning
0xhypevida
A Gymnasium-compatible deep RL framework for Hytale. Trains autonomous agents to gather, craft, build, and survive using PPO and curriculum learning. Includes a Java server plugin (TCP/MessagePack bridge), 7 environments, 29 block types, Hytale-accurate creature AI, and full SB3/CleanRL integration.