Back to search
From-scratch implementation of Proximal Policy Optimization (PPO) for continuous control using the Pendulum-v1 environment. The project focuses on policy/value network design, numerical stability, hyperparameter sensitivity, and performance analysis. Implemented in PyTorch without high-level reinforcement learning frameworks.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
2
commits
Add README.md for PPO Pendulum Continuous Control
88c4632View on GitHub