Policy Parameterization Softmax And Gaussian Policies