nightxade
home
blog
writeups
reading
notes
projects
resume
about
Toggle theme
Notes
Reinforcement Learning
Sutton & Barto
1
Chapter 2: Multi-armed Bandits
2
Chapter 3: Finite Markov Decision Processes
3
Chapter 4: Dynamic Programming
4
Chapter 5: Monte Carlo Methods
5
Chapter 6: Temporal-Difference Learning