Actor Critic Methods Combining Policy Gradient And Value Function Learning
Actor Critic Methods Combining Policy Gradient And Value Function Learning
Master Actor-Critic architectures. Learn how to combine the variance reduction of value functions (Critic) with the direct optimization of policies (Actor).