INFO:
This video provides an introduction to the algorithms that reside within the agent. We’ll cover why we use neural networks to represent functions and why you may have to set up two neural networks in a powerful family of methods called actor-critic.
Policies and Learning Algorithms | Reinforcement Learning, Part 3 - MATLAB