Deep Q-Networks
Deep reinforcement learning with Q-networks: DQN, Double DQN, Dueling DQN, Prioritized Experience Replay, and Rainbow DQN.
Learning Objectives
1. From Tabular to Function Approximation
1.1 Why Deep Networks?
1.2 The Deadly Triad
2. DQN (Deep Q-Network)
2.1 Experience Replay
2.2 Target Network
2.3 Full Algorithm
2.4 PyTorch Implementation: CartPole
3. Double DQN
3.1 Overestimation Bias
3.2 Solution
4. Dueling DQN
4.1 Architecture: V(s) + A(s,a)
5. Prioritized Experience Replay
5.1 TD Error as Priority
6. Rainbow DQN
7. DQN for Robotics: Discrete Control
Exercises
References