The Policy Gradient Theorem