Reinforcement Learning | AI Summer

Trust Region and Proximal policy optimization (TRPO and PPO)

Reinforcement Learning

Trust Region and Proximal policy optimization (TRPO and PPO)

Trust Region policy optimization vs Proximal policy optimization

The idea behind Actor-Critics and how A2C and A3C improve them

Reinforcement Learning

The idea behind Actor-Critics and how A2C and A3C improve them

Actor critics, A2C, A3C

Unravel Policy Gradients and REINFORCE

Reinforcement Learning

Unravel Policy Gradients and REINFORCE

Explore Policy-based methods and dive into policy gradients

Q-targets, Double DQN and Dueling DQN

Reinforcement Learning

Q-targets, Double DQN and Dueling DQN

Fixed Q-targets, Double DQN, Dueling DQN, Prioritized Replay

Deep Q Learning and Deep Q Networks

Reinforcement Learning

Deep Q Learning and Deep Q Networks

Learn what Q Learning is and build a Deep Q Network to play games

The secrets behind Reinforcement Learning

Reinforcement Learning

The secrets behind Reinforcement Learning

The central idea behind reinforcement learning and an overview of its algorithms