Discover Thompson Sampling for reinforcement learning. Learn how to use Bayesian probability updates to sample actions from a posterior distribution.