Understand the Deadly Triad in reinforcement learning. Learn why combining function approximation, bootstrapping, and off-policy learning causes divergence.