Optimize Target Network PyTorch Code | DQN PyTorch Beginners Tutorial #7
Ғылым және технология
In this Deep Q-Learning, aka Deep Q-Network (DQN), tutorial series, we'll code up the algorithm with PyTorch and train FlappyBird. In this video, we'll revisit the code in the Optimize function where we process the mini-batch of experiences one-by-one. We'll optimize the PyTorch code so that the whole mini-batch is processed at once.
Github code: github.com/johnnycode8/dqn_py...
Support me here: www.buymeacoffee.com/johnnycode
Пікірлер: 1
this tutorial series is awesome! looking forward to actor critic series!