Step-by-step tutorial on the TD(0) algorithm. Learn how to update value function estimates step-by-step using the TD error without waiting for the episode end.