High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Stars
9.5k
Forks
1.0k
Watchers
9.5k
Open Issues
95
Overall repository health assessment
No package.json found
This might not be a Node.js project
703
commits
51
commits
10
commits
7
commits
7
commits
6
commits
6
commits
3
commits
3
commits
3
commits
Use the training `end_e` as the `evaluation(..., epsilon=end_e)` (#430)
a00e71bView on GitHubFix: PQN needs to mask the return computation if next state is done (#494)
1ed8062View on GitHubremoving the unused noise_clip from sac_continuous_actions.py (probably a relic from TD3) (#456)
65789baView on GitHub