Found 2 repositories(showing 2)
BoopathiKumar6485
From-scratch implementation of Proximal Policy Optimization (PPO) for continuous control using the Pendulum-v1 environment. The project focuses on policy/value network design, numerical stability, hyperparameter sensitivity, and performance analysis. Implemented in PyTorch without high-level reinforcement learning frameworks.
shivam-0109
A reinforcement learning project using NVIDIA Isaac Lab to train PPO agents for swing-up and balance control of single and double inverted pendulum systems under continuous force actuation.
All 2 repositories loaded