Back to search
ARI3212 : Advanced Reinforcement Learning Project | Solving LunarLander-v3 (discrete) with DQN vs Double DQN (robustness under wind/turbulence), plus LunarLanderContinuous-v3 (continuous) with PPO vs SAC to compare on-policy vs off-policy learning, including hyperparameter tuning and learning-curve evaluation.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
6
commits