Search Results

Found 769 repositories(showing 30)

PaLM-rlhf-pytorch

lucidrains

💛85

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

7.9k

680

MIT

Python

Updated 2 hours ago

artificial-intelligenceattention-mechanismsdeep-learning+3

PyTorch-RL

Khrylx

💛73

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

1.3k

191

MIT

Python

Updated 4 days ago

a2cdeep-reinforcement-learningfisher-vectors+8

pytorch-maml-rl

tristandeleu

💛73

Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch

878

167

MIT

Python

Updated 20 hours ago

pytorch-rl

jingweiz

🧡62

Deep Reinforcement Learning with pytorch & visdom

805

144

MIT

Python

Updated 1 week ago

a3caceractor-critic+8

neural-combinatorial-rl-pytorch

pemami4911

🧡62

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940

607

145

MIT

Python

Updated 1 week ago

neural-combinatorial-optimizationpytorchreinforcement-learning+1

rl_a3c_pytorch

dgriff777

💛72

A3C LSTM Atari with Pytorch plus A3G design

568

115

Apache-2.0

Python

Updated 6 days ago

a3ca3c-gpua3g+9

pytorch-cpp-rl

Omegastick

🧡61

PyTorch C++ Reinforcement Learning

532

MIT

C++

Updated 1 week ago

a2cactor-criticadvantage-actor-critic+11

LaMDA-rlhf-pytorch

conceptofmind

💛71

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

469

MIT

Python

Updated 17 hours ago

artificial-intelligenceattention-mechanismdeep-learning+4

pytorch-rl

navneet-nmk

❤️46

This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch

452

Python

Updated 1 month ago

ddpgdeep-learningdeep-reinforcement-learning+17

PyTorch-ActorCriticRL

vy007vikas

🧡66

PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.

422

Python

Updated 6 days ago

pytorch-rl

bentrevett

🧡61

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

296

MIT

Jupyter Notebook

Updated 2 weeks ago

a2cactor-criticadvantage-actor-critic+11

Deep_RL_with_pytorch

sungyubkim

❤️46

A pytorch tutorial for DRL(Deep Reinforcement Learning)

225

Jupyter Notebook

Updated 1 month ago

a2cc51counterfactual-regret-minimization+13

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

221

MIT

Python

Updated 3 months ago

chatgptfinetunegpt+10

DeepRL_PyTorch

Kchu

🧡61

Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.

217

Apache-2.0

Python

Updated 1 week ago

algorithmalgorithmsc51+7

RLBotPythonExample

RLBot

🧡51

Example of a python bot using the RLBot framework

193

MIT

Python

Updated 1 month ago

rocket-league

berkeley-deep-RL-pytorch-solutions

mdeib

🧡65

Pytorch solutions for UC Berkeley's cs285 assignments

157

Python

Updated 5 days ago

cs285pytorch-rlreinforcement-learning

ChatGLM-LoRA-RLHF-PyTorch

jackaduma

🧡60

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

139

MIT

Python

Updated 1 week ago

chatglmchatglm-6bchatgpt+11

ROS_pytorch_RL

fangxiaoshen

💛70

在turtlebot3，pytorch上使用DQN,DDPG,PPO,SAC算法，在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, and realize simulation on gazebo. Use DQN, DDPG, PPO, SAC algorithm to realize simulation on gazebo.

131

Apache-2.0

Python

Updated 1 day ago

python-rl

amarack

❤️46

Some Reinforcement Learning in Python

115

GPL-3.0

Python

Updated 1 month ago

Pytorch-RL-CPP

navneet-nmk

❤️35

A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)

101

C++

Updated 4 months ago

ataricppdeep-learning+17

pytorch-noreward-rl

kimhc6028

❤️40

pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction

MIT

Python

Updated 9 months ago

a3cdeep-reinforcement-learningpytorch

berkeley-deep-RL-pytorch-starter

mdeib

🧡65

Pytorch starter code for UC Berkeley's cs285 assignments

Python

Updated 6 days ago

cs285pytorch-rlreinforcement-learning

pytorch_rl2

lucaslingle

🧡65

Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'

Python

Updated 6 days ago

deep-learningmeta-learningmeta-reinforcement-learning+2

InverseRL-Pytorch

seolhokim

🧡50

Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation

MIT

Python

Updated 2 months ago

airleairlgail+7

pytorch.rl.learning

moskomule

❤️35

for learning reinforcement learning using PyTorch.

Python

Updated 1 year ago

pytorchreinforcement-learning

RL-Python-Pytorch

quqixun

🧡55

《强化学习-原理与Python实现》的Pytorch实现。

Jupyter Notebook

Updated 4 weeks ago

pytorchreinforcement-learningrl-book

pytorch-rl-il

syuntoku14

🧡50

A library for building reinforcement learning and imitation learning agents in Pytorch

MIT

Python

Updated 2 months ago

Alpaca-LoRA-RLHF-PyTorch

jackaduma

🧡50

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

MIT

Python

Updated 1 month ago

alpacachatgptdeepspeed+10

pytorch-rl

ikostrikov

❤️25

No description available

Updated 4 months ago

pytorchreinforcement-learningreinforcement-learning-algorithms

pytorch_ppo_rl

chagmgang

❤️35

Pytorch implementation of intrinsic curiosity module with proximal policy optimization

Python

Updated 5 months ago

breakoutcuriositydeep-learning+6

GitHub Explorer

Search Results

PaLM-rlhf-pytorch

PyTorch-RL

pytorch-maml-rl

pytorch-rl

neural-combinatorial-rl-pytorch

rl_a3c_pytorch

pytorch-cpp-rl

LaMDA-rlhf-pytorch

pytorch-rl

PyTorch-ActorCriticRL

pytorch-rl

Deep_RL_with_pytorch

Vicuna-LoRA-RLHF-PyTorch

DeepRL_PyTorch

RLBotPythonExample

berkeley-deep-RL-pytorch-solutions

ChatGLM-LoRA-RLHF-PyTorch

ROS_pytorch_RL

python-rl

Pytorch-RL-CPP

pytorch-noreward-rl

berkeley-deep-RL-pytorch-starter

pytorch_rl2

InverseRL-Pytorch

pytorch.rl.learning

RL-Python-Pytorch

pytorch-rl-il

Alpaca-LoRA-RLHF-PyTorch

pytorch-rl

pytorch_ppo_rl

PaLM-rlhf-pytorch

PyTorch-RL

pytorch-maml-rl

pytorch-rl

neural-combinatorial-rl-pytorch

rl_a3c_pytorch

pytorch-cpp-rl

LaMDA-rlhf-pytorch

pytorch-rl

PyTorch-ActorCriticRL

pytorch-rl

Deep_RL_with_pytorch

Vicuna-LoRA-RLHF-PyTorch

DeepRL_PyTorch

RLBotPythonExample

berkeley-deep-RL-pytorch-solutions

ChatGLM-LoRA-RLHF-PyTorch

ROS_pytorch_RL

python-rl

Pytorch-RL-CPP

pytorch-noreward-rl

berkeley-deep-RL-pytorch-starter

pytorch_rl2

InverseRL-Pytorch

pytorch.rl.learning

RL-Python-Pytorch

pytorch-rl-il

Alpaca-LoRA-RLHF-PyTorch

pytorch-rl

pytorch_ppo_rl