Policy Parameterization Softmax And Gaussian Policies
Policy Parameterization Softmax And Gaussian Policies
Learn how to parameterize policies for RL agents. Discover how to use Softmax for discrete action spaces and Gaussian distributions for continuous actions.