Master the Markov Decision Process (MDP). Learn the mathematical framework underlying reinforcement learning, including transition probabilities and value functions.