Found 769 repositories(showing 30)
lucidrains
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Khrylx
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
tristandeleu
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
jingweiz
Deep Reinforcement Learning with pytorch & visdom
pemami4911
PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940
dgriff777
A3C LSTM Atari with Pytorch plus A3G design
Omegastick
PyTorch C++ Reinforcement Learning
conceptofmind
Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.
navneet-nmk
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
vy007vikas
PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.
bentrevett
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
sungyubkim
A pytorch tutorial for DRL(Deep Reinforcement Learning)
jackaduma
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
Kchu
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
RLBot
Example of a python bot using the RLBot framework
Pytorch solutions for UC Berkeley's cs285 assignments
jackaduma
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
fangxiaoshen
在turtlebot3,pytorch上使用DQN,DDPG,PPO,SAC算法,在gazebo上实现仿真。Use DQN, DDPG, PPO, SAC algorithm on turtlebot3, pytorch on turtlebot3, pytorch, and realize simulation on gazebo. Use DQN, DDPG, PPO, SAC algorithm to realize simulation on gazebo.
amarack
Some Reinforcement Learning in Python
navneet-nmk
A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)
kimhc6028
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
Pytorch starter code for UC Berkeley's cs285 assignments
lucaslingle
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
seolhokim
Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation
moskomule
for learning reinforcement learning using PyTorch.
quqixun
《强化学习-原理与Python实现》的Pytorch实现。
syuntoku14
A library for building reinforcement learning and imitation learning agents in Pytorch
jackaduma
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
ikostrikov
No description available
chagmgang
Pytorch implementation of intrinsic curiosity module with proximal policy optimization