Week 11: Days 68-74
Topics to Cover
- RL fundamentals (MDP, rewards)
- Gym/Gymnasium environments
- Policy gradient methods
- PPO algorithm
- SAC algorithm
- Training in simulation
- Reward shaping
- Curriculum learning
Resources
Stable Baselines3
OpenAI Gym
RL for Robotics