Found 38 repositories(showing 30)
Reytuag
No description available
bmazoure
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.
subho406
Implementation of Proximal Policy Optimization in Jax+Flax
PratikKunapuli
RL environments and PPO training code to develop trajectory tracking controllers for various robot systems, written in Jax.
CognitiveAISystems
KAGE-Bench: pure JAX 2D platformer RL benchmark for visual OOD generalization. Massively-parallel (vmap/JIT) RGB env with YAML-configurable visuals/physics, plus PPO-CNN (Flax) training scripts.
zombie-einstein
JAX Implementation of Proximal Policy Optimisation Algorithm
BeeGass
Implementing and testing HiPPO and S4
quangr
jax version of ppo algorithm in mujoco enviroment, achieve SOTA(tianshou)
karush17
No description available
LucMc
This repository aims to provide the minimalism of cleanRL with the performance of SBX
azzeddineCH
a jax implementation of highway environment PPO agent with discrete and continuous spaces
knyazer
Near-SOTA PPO implementation with Jax+Brax+Equinox
BDEvan5
A repo of Proximal Policy Optimisation (PPO) algorithms written in JAX
emiwar
Implementation of PPO for recurrent/stateful networks using JAX and flax.nnx
alessioarcara
A JAX/Flax implementation of Independent PPO for cooperative multi-agent reinforcement learning on the Overcooked-AI benchmark
emmanuelgendy
A custom, high-performance Proximal Policy Optimization (PPO) agent implemented in pure JAX/Equinox, designed to control building energy simulations (PhyLFlex project) at massive scale.
ConnorWatts
Deep Reinforcement Learning based Rubik’s Cube solver written in JAX, Haiku and RLax. Developed a custom environment supporting cubes of variable size. Currently developing support for DQN, QR-DQN and PPO.
yunusdanabas
JAX-Brax reinforcement learning pipeline for training a passive bipedal walker. Implements a curriculum from FSM expert demonstrations to BC-seeded PPO for robust locomotion. Features GPU-accelerated physics, parallel training, and scalable network architectures for efficient learning.
LeonEricsson
ppo in jax
sazzad2024
No description available
shivenaggarwal
ppo in pure jax
james-ralph8555
Cartpole PPO with JAX
loicmagne
Minimal implementation of Proximal Policy Optimization (PPO) using Jax
PedroLucas63
No description available
Kalyan0821
No description available
azimi99
Easy-to-follow implementation of Proximal Policy Optimization in JAX (Gym Compatible)
njomowachira
JAX implementation of Proximal Policy Optimization (PPO) applied to CartPole environment. Demonstrates policy gradient reinforcement learning with modular, efficient code. Includes neural network policy, value function, and adaptive training techniques.
tims457
No description available
Leikoe
a fully jitted & multi accelerator JAX implementation of PPO for discrete and continuous envs
tflahaul
minimal PPO training script (JAX)