Implement Epsilon-Greedy & Debug the Training Loop | DQN PyTorch Beginners Tutorial #4
Ғылым және технология
In this Deep Q-Learning, aka Deep Q-Network (DQN), tutorial series, we'll code up the algorithm with PyTorch and train FlappyBird. In this video, we'll implement the Epsilon-Greedy algorithm for the bird to explore the environment. We'll run the training code that we have up to this point in debug mode and understand what is happening internally.
Github code: github.com/johnnycode8/dqn_py...
Support me here: www.buymeacoffee.com/johnnycode
Пікірлер: 3
Waiting for the next video
@johnnycode
Ай бұрын
Working on it :D
could you do balancing double inverse pendulum example?