Search Results

Found 85 repositories(showing 30)

cleanrl

vwxyzjn

💚90

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

9.5k

1.0k

NOASSERTION

Python

Updated 2 hours ago

a2cactor-criticadvantage-actor-critic+13

LeanRL

meta-pytorch

🧡56

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

681

NOASSERTION

Python

Updated 1 week ago

RLOR

cpwan

💛70

Reinforcement learning for operation research problems with OpenAI Gym and CleanRL

128

NOASSERTION

Python

Updated 12 hours ago

attentioncvrpoperation-research+4

cleanba

vwxyzjn

💛70

CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL

124

NOASSERTION

Python

Updated 5 days ago

CleanRL

firechecking

🧡50

Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.

MIT

Python

Updated 1 month ago

CleanRL.jl

sash-a

❤️40

Simple single file implementations of Reinforcement Learning algorithms in Julia

MIT

Julia

Updated 4 months ago

PPO-v3

RyanNavillus

❤️35

Adding Dreamer-v3's implementation tricks to CleanRL's PPO

Apache-2.0

Python

Updated 3 months ago

cleanqrl

FhG-IISB

🧡55

Clean and easy to understand implementations of many Quantum Reinforcement Learning agents as well as their classical analouges. Greately inspired by the orgininal CleanRL

NOASSERTION

Python

Updated 3 weeks ago

ddpgdqnjumanji+7

envpool-cleanrl

vwxyzjn

❤️30

No description available

MIT

Python

Updated 5 months ago

train-learned-planner

AlignmentResearch

❤️35

Experimenting with CleanRL for learned-planners

NOASSERTION

Python

Updated 1 month ago

rlCode

acezsq

❤️35

cleanrl应用于自定义gym环境

Python

Updated 11 months ago

hcam-torch

jinPrelude

🧡50

Implementation of the paper "Towards mental time travel: a hierarchical memory for reinforcement learning agent" using CleanRL

MIT

Python

Updated 2 months ago

CleanRLHF

jualat

❤️20

A framework for Reinforcement Learning from Human Feedback based on CleanRL

Python

Updated 4 months ago

mindspore-cleanrl

superboySB

❤️40

MindSpore version of CleanRL, for supporting online reinforcement learning algorithms

MIT

Python

Updated 1 year ago

PPO-JAX

LucMc

❤️45

This repository aims to provide the minimalism of cleanRL with the performance of SBX

Python

Updated 1 month ago

MiraMar

sarisabban

❤️40

De novo cyclic protein polypeptide design using reinforcement learning.

GPL-2.0

Python

Updated 1 year ago

biomolecular-simulationcleanrlcomputational-biology+17

rsoccer-isaac-cleanrl

FelipeMartins96

❤️25

No description available

Python

Updated 2 years ago

Tao

amulil

❤️40

the rl algos implementation inspired by cleanrl

MIT

Python

Updated 1 year ago

envpool-xla-cleanrl

vwxyzjn

❤️15

No description available

MIT

Python

Updated 1 year ago

Modular-GenAI

AbdullahVanlioglu

❤️35

"A modular and clean research framework for Generative AI, based on popular open-source repositories like Transformers, TRL, Diffusions, and CleanRL."

Python

Updated 1 year ago

cleanrl

binomiya

❤️40

copy cleanrl at 20221230

NOASSERTION

Jupyter Notebook

Updated 3 years ago

Cleanrl

kaushik-dev09

🧡60

No description available

NOASSERTION

Python

Updated 2 days ago

dqn

SemyonSkovpin

❤️45

My implementation of dqn in jupyter lab. Hyperparameters by https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl

Python

Updated 1 month ago

JAXAtariVideoPinballAgents

Be-Sharps

❤️40

CleanRL PPO implementation for JAXAtari VideoPinball

MIT

Python

Updated 3 months ago

CleanRL

Picus303

🧡55

Stable and Mathematically Accurate Reinforcement Learning

Jupyter Notebook

Updated 3 weeks ago

auto-cleanrl

roger-creus

🧡55

No description available

NOASSERTION

Python

Updated 2 days ago

CleanRL-JAX

wzhhasadream

❤️35

A deep reinforcement learning library implemented with Flax/NNX, inspired by CleanRL design philosophy and integrated with the latest SimBa architecture

Python

Updated 4 months ago

clean-isaac-gym

dibbla

❤️40

Several minimal implemetations of RL/Imitation algorithms, following CleanRL's philosophy

MIT

Python

Updated 2 years ago

Hexapod-RL---AdvRLFinal

psdecabooter

🧡50

Trained a custom hexapod robot to crawl using reinforcement learning

NOASSERTION

Python

Updated 2 months ago

cleanrlpybullet-environmentsreinforcement-learning+1

A Gymnasium-compatible deep RL framework for Hytale. Trains autonomous agents to gather, craft, build, and survive using PPO and curriculum learning. Includes a Java server plugin (TCP/MessagePack bridge), 7 environments, 29 block types, Hytale-accurate creature AI, and full SB3/CleanRL integration.

Java

Updated 2 weeks ago

GitHub Explorer

Search Results

cleanrl

LeanRL

RLOR

cleanba

CleanRL

CleanRL.jl

PPO-v3

cleanqrl

envpool-cleanrl

train-learned-planner

rlCode

hcam-torch

CleanRLHF

mindspore-cleanrl

PPO-JAX

MiraMar

rsoccer-isaac-cleanrl

Tao

envpool-xla-cleanrl

Modular-GenAI

cleanrl

Cleanrl

dqn

JAXAtariVideoPinballAgents

CleanRL

auto-cleanrl

CleanRL-JAX

clean-isaac-gym

Hexapod-RL---AdvRLFinal

HytaleRL

cleanrl

LeanRL

RLOR

cleanba

CleanRL

CleanRL.jl

PPO-v3

cleanqrl

envpool-cleanrl

train-learned-planner

rlCode

hcam-torch

CleanRLHF

mindspore-cleanrl

PPO-JAX

MiraMar

rsoccer-isaac-cleanrl

Tao

envpool-xla-cleanrl

Modular-GenAI

cleanrl

Cleanrl

dqn

JAXAtariVideoPinballAgents

CleanRL

auto-cleanrl

CleanRL-JAX

clean-isaac-gym

Hexapod-RL---AdvRLFinal

HytaleRL