Search Results

Found 240 repositories(showing 30)

Super-mario-bros-PPO-pytorch

vietnh1009

💛74

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

1.3k

236

MIT

Python

Updated 2 days ago

aideep-learninggym+11

ppo-rocket-landing

taherfattahi

🧡61

Proximal Policy Optimization (PPO) algorithm using PyTorch to train an agent for a rocket landing task in a custom environment

244

MIT

Python

Updated 3 weeks ago

aimachine-learningppo+3

Contra-PPO-pytorch

vietnh1009

🧡65

Proximal Policy Optimization (PPO) algorithm for Contra

144

Python

Updated 2 days ago

aicontracontra-nes+7

big2_PPOalgorithm

henrycharlesworth

❤️45

Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow

Python

Updated 1 month ago

car-racing-ppo

elsheikh21

❤️45

Implementation of a Deep Reinforcement Learning algorithm, Proximal Policy Optimization (SOTA), on a continuous action space openai gym (Box2D/Car Racing v0)

Python

Updated 1 month ago

deep-reinforcement-learningopenai-gympolicy-optimization+2

Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Strategy

HFTHaidra

🧡65

Stock trading strategies play a critical role in investment. However, it is challenging to design a profitable strategy in a complex and dynamic stock market. In this paper, we propose a deep ensemble reinforcement learning scheme that automatically learns a stock trading strategy by maximizing investment return. We train a deep reinforcement learning agent and obtain an ensemble trading strategy using the three actor-critic based algorithms: Proximal Policy Optimization (PPO), Advantage Actor Critic (A2C), and Deep Deterministic Policy Gradient (DDPG). The ensemble strategy inherits and integrates the best features of the three algorithms, thereby robustly adjusting to different market conditions. In order to avoid the large memory consumption in training networks with continuous action space, we employ a load-on-demand approach for processing very large data. We test our algorithms on the 30 Dow Jones stocks which have adequate liquidity. The performance of the trading agent with different reinforcement learning algorithms is evaluated and compared with both the Dow Jones Industrial Average index and the traditional min-variance portfolio allocation strategy. The proposed deep ensemble scheme is shown to outperform the three individual algorithms and the two baselines in terms of the risk-adjusted return measured by the Sharpe ratio.

Jupyter Notebook

Updated 6 hours ago

Constrained-SAC-PPO

ZhihanLee

🧡55

A pytorch implementation of Constrained Reinforcement Learning Algorithm, including Constrained Soft Actor Critic (Soft Actor Critic Lagrangian) and Proximal Policy Optimization Lagrangian

Python

Updated 1 week ago

Matlab_PPO

AIResearcherHZ

🧡50

This is a MATLAB-based reinforcement learning framework that includes the Proximal Policy Optimization (PPO) algorithm and its multi-agent extension (MAPPO). It supports GPU acceleration and parallel computing, making it suitable for research and engineering applications in control systems.

MIT

MATLAB

Updated 1 month ago

mbppol

akjayant

🧡65

This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm" accepted at NeurIPS 2022.

MIT

Python

Updated 2 days ago

model-based-reinforcement-learningmodel-based-rlneurips-2022+2

Sonic-PPO-pytorch

vietnh1009

❤️35

Proximal Policy Optimization (PPO) algorithm for Sonic the Hedgehog

Python

Updated 11 months ago

aideep-learninggym+8

CriptoWhisper

Solrikk

🧡50

TradeWhisperer is a sophisticated cryptocurrency trading bot that leverages advanced Reinforcement Learning techniques, specifically the Proximal Policy Optimization (PPO) algorithm, to navigate the complex world of crypto markets. Built with a focus on adaptability and risk management, this bot combines technical analysis with machine learning.

Apache-2.0

Python

Updated 1 month ago

aitradebybitbybit-api+11

PPO-Pytorch

dragen1860

❤️35

Pytorch Implementation of Proximal Policy Optimization Algorithm

Python

Updated 1 year ago

PPO

adi3e08

🧡65

A clean and minimal implementation of PPO (Proximal Policy Optimization) algorithm in Pytorch, for continuous action spaces.

Python

Updated 5 days ago

ppo-self-play

ImmanuelXIV

❤️40

Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment

MIT

Python

Updated 2 months ago

simple-ppo

Chris-hughes10

🧡55

A clean, modular implementation of the Proximal Policy Optimization (PPO) algorithm in PyTorch, written with a strong focus on readability and educational value, as well as performance.

MIT

Python

Updated 3 weeks ago

PPO

maitchison

❤️25

Example implemention of the Proximal Policy Optimization algorithm

Python

Updated 6 months ago

PPO-algo-with-custom-Unity-environment

dhyeythumar

❤️30

Implementation of Proximal Policy Optimization algorithm on a custom Unity environment.

MIT

ASP.NET

Updated 1 year ago

deep-reinforcement-learningkeras-modelsppo-keras+4

Proximal-Policy-Optimization-Pytorch

Jiankai-Sun

❤️35

Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch

GPL-3.0

Python

Updated 3 months ago

Landing-a-SpaceX-Falcon-heavy-using-Proximal-Policy-Optimization-

Barath19

💛70

Landing a SpaceX Falcon Heavy Rocket in simulation using Reinforcement learning. Reinforcement learning is a technique that lets an agent learn how best to act in an environment using rewards as its signal. OpenAI released a library called Gym that lets us train AI agents really easily. We'll also use Stable Baselines and gym libraries to build an RL agent capable of landing a rocket perfectly. The specific algorithm we will be using is called proximal policy optimization, this is an improved version of actor-critic algorithm.

Apache-2.0

Python

Updated 6 hours ago

PPO

shareeff

❤️25

Tensorflow implementation of proximal policy optimization (PPO) algorithm

Python

Updated 9 months ago

ProximalPolicyOptimizationKeras

nric

❤️20

This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.

Python

Updated 11 months ago

HumanoidRobotWalk

reinai

❤️45

Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.

Python

Updated 1 month ago

deep-reinforcement-learninggym-environmentgym-environments+16

donkeycar-rl

ian0

❤️45

Teaching the Donkey car to drive a track in the simulator using State Representation Learning and different Reinforcement Learning Algorithms including Deep Q-Network, Soft Actor-Critic and Proximal Policy Optimization Algorithms.

Jupyter Notebook

Updated 1 month ago

deep-q-networkdenoising-autoencoderpolicy-optimization-algorithms+3

Autonomous-Drone-Navigation-TD3-vs-SAC

sidharthmohannair

🧡60

Comparative Study of Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) for Autonomous Drone Navigation using ROS and Gazebo. This project explores the performance of these algorithms in complex environments, focusing on navigation efficiency, obstacle avoidance, and learning efficiency.

MIT

Python

Updated 4 weeks ago

nanoPPO

jamesliu

❤️40

An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.

Apache-2.0

Python

Updated 10 months ago

reinforcement-learning

tf2-a2c-ppo

LiubovSobolevskaya

❤️40

mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.

MIT

Python

Updated 2 years ago

HumanoidClimb-RL

s1ddh-rth

🧡55

This project explores the application of reinforcement learning (RL) to train humanoid robots for dynamic rock climbing movements, focusing on achieving the challenging "dyno" maneuver. Using the Proximal Policy Optimization (PPO) algorithm, the simulation integrates physics-based environments to model realistic climbing scenarios.

Python

Updated 3 weeks ago

machine-learningpporeinforcement-learning+1

spoof.io

iuliagroza

🧡65

A Proximal Policy Optimization Approach to Detect Spoofing in Algorithmic Trading

Python

Updated 3 days ago

algorithmic-tradinganomaly-detectiondeep-reinforcement-learning+12

PPO_PONG_DISCRETE

rossettisimone

❤️35

Proximal Policy Optimization Algorithm applied to PONG in discrete environment

Jupyter Notebook

Updated 4 months ago

Agent-contra

mycode2021

❤️35

Deep reinforcement learning project based on openAI's retro environment, proximal policy optimization and random network distillation algorithm to play Contra.

Python

Updated 1 year ago

GitHub Explorer

Search Results

Super-mario-bros-PPO-pytorch

ppo-rocket-landing

Contra-PPO-pytorch

big2_PPOalgorithm

car-racing-ppo

Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Strategy

Constrained-SAC-PPO

Matlab_PPO

mbppol

Sonic-PPO-pytorch

CriptoWhisper

PPO-Pytorch

PPO

ppo-self-play

simple-ppo

PPO

PPO-algo-with-custom-Unity-environment

Proximal-Policy-Optimization-Pytorch

Landing-a-SpaceX-Falcon-heavy-using-Proximal-Policy-Optimization-

PPO

ProximalPolicyOptimizationKeras

HumanoidRobotWalk

donkeycar-rl

Autonomous-Drone-Navigation-TD3-vs-SAC

nanoPPO

tf2-a2c-ppo

HumanoidClimb-RL

spoof.io

PPO_PONG_DISCRETE

Agent-contra

Super-mario-bros-PPO-pytorch

ppo-rocket-landing

Contra-PPO-pytorch

big2_PPOalgorithm

car-racing-ppo

Deep-Reinforcement-Learning-for-Automated-Stock-Trading-Strategy

Constrained-SAC-PPO

Matlab_PPO

mbppol

Sonic-PPO-pytorch

CriptoWhisper

PPO-Pytorch

PPO

ppo-self-play

simple-ppo

PPO

PPO-algo-with-custom-Unity-environment

Proximal-Policy-Optimization-Pytorch

Landing-a-SpaceX-Falcon-heavy-using-Proximal-Policy-Optimization-

PPO

ProximalPolicyOptimizationKeras

HumanoidRobotWalk

donkeycar-rl

Autonomous-Drone-Navigation-TD3-vs-SAC

nanoPPO

tf2-a2c-ppo

HumanoidClimb-RL

spoof.io

PPO_PONG_DISCRETE

Agent-contra