Reinforcement Learning | nightxade

home blog writeups reading notes projects resume about

home blog writeups reading notes projects resume about

Notes
Reinforcement Learning

Sutton & Barto

1 Chapter 2: Multi-armed Bandits
2 Chapter 3: Finite Markov Decision Processes
3 Chapter 4: Dynamic Programming
4 Chapter 5: Monte Carlo Methods
5 Chapter 6: Temporal-Difference Learning

© 2026 All rights reserved.

Template made with ♥ by enscribe and adapted by nightxade !