Johnny Code
Күн бұрын
314
1

Implement Epsilon-Greedy & Debug the Training Loop | DQN PyTorch Beginners Tutorial #4

Ғылым және технология

In this Deep Q-Learning, aka Deep Q-Network (DQN), tutorial series, we'll code up the algorithm with PyTorch and train FlappyBird. In this video, we'll implement the Epsilon-Greedy algorithm for the bird to explore the environment. We'll run the training code that we have up to this point in debug mode and understand what is happening internally.
Github code: github.com/johnnycode8/dqn_py...
Support me here: www.buymeacoffee.com/johnnycode

Пікірлер: 3

@ANKUSHKUMAR-jr1pfАй бұрын
Waiting for the next video
@johnnycode
Ай бұрын
Working on it :D
@hakankosebas2085Ай бұрын
could you do balancing double inverse pendulum example?