Additional Readings
These are optional readings if you want to go deeper.
Monte Carlo and TD Learning
To dive deeper into Monte Carlo and Temporal Difference Learning:
- Why do temporal difference (TD) methods have lower variance than Monte Carlo methods?
- When are Monte Carlo methods preferred over temporal difference ones?
Q-Learning
- Reinforcement Learning: An Introduction, Richard Sutton and Andrew G. Barto Chapter 5, 6 and 7
- Foundations of Deep RL Series, L2 Deep Q-Learning by Pieter Abbeel