Implement Target Network | DQN PyTorch Beginners Tutorial #5
Ғылым және технология
In this Deep Q-Learning, aka Deep Q-Network (DQN), tutorial series, we'll code up the algorithm with PyTorch and train FlappyBird. In this video, we'll implement the Target Network. The Target Network is used to estimate expected Q values, which is used to train the Policy Network.
Github code: github.com/johnnycode8/dqn_py...
Support me here: www.buymeacoffee.com/johnnycode
Пікірлер: 2
Fantastic
nice