-
Machine Learning: A Probabilistic Perspective, Exercise 11.1
-
Machine Learning: A Probabilistic Perspective, Exercise 7.9
-
Reinforcement Learning: An Introduction – Exercise 12.5
-
Reinforcement Learning: An Introduction – Exercise 6.1
-
On Optimal Value Functions
-
Reinforcement Learning: Eligibility Traces and TD(lambda)
-
Reinforcement Learning: Policy Evaluation through Temporal Difference
-
Reinforcement Learning: Evaluating Behavior
-
A Taste of Reinforcement Learning
-
Starting it up!