Understand Monte Carlo Control for optimizing policies. Learn the crucial differences between on-policy and off-policy learning using importance sampling.