Policy Gradient Methods With Deep Neural Networks A2c A3c Ppo Trpo
Policy Gradient Methods With Deep Neural Networks A2C A3C Ppo Trpo
Explore state-of-the-art Deep Policy Gradient algorithms. Understand the architecture behind A2C, A3C, TRPO, and the highly popular Proximal Policy Optimization (PPO).