David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86
Ғылым және технология
David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.
Support this podcast by signing up with these sponsors:
- MasterClass: masterclass.com/lex
- Cash App - use code "LexPodcast" and download:
- Cash App (App Store): apple.co/2sPrUHe
- Cash App (Google Play): bit.ly/2MlvP5w
EPISODE LINKS:
Reinforcement learning (book): amzn.to/2Jwp5zG
PODCAST INFO:
Podcast website:
lexfridman.com/podcast
Apple Podcasts:
apple.co/2lwqZIr
Spotify:
spoti.fi/2nEwCF8
RSS:
lexfridman.com/feed/podcast/
Full episodes playlist:
• Lex Fridman Podcast
Clips playlist:
• Lex Fridman Podcast Clips
OUTLINE:
0:00 - Introduction
4:09 - First program
11:11 - AlphaGo
21:42 - Rule of the game of Go
25:37 - Reinforcement learning: personal journey
30:15 - What is reinforcement learning?
43:51 - AlphaGo (continued)
53:40 - Supervised learning and self play in AlphaGo
1:06:12 - Lee Sedol retirement from Go play
1:08:57 - Garry Kasparov
1:14:10 - Alpha Zero and self play
1:31:29 - Creativity in AlphaZero
1:35:21 - AlphaZero applications
1:37:59 - Reward functions
1:40:51 - Meaning of life
CONNECT:
- Subscribe to this KZread channel
- Twitter: / lexfridman
- LinkedIn: / lexfridman
- Facebook: / lexfridmanpage
- Instagram: / lexfridman
- Medium: / lexfridman
- Support on Patreon: / lexfridman
Пікірлер: 455
I really enjoyed this conversation with David. Here's the outline: 0:00 - Introduction 4:09 - First program 11:11 - AlphaGo 21:42 - Rule of the game of Go 25:37 - Reinforcement learning: personal journey 30:15 - What is reinforcement learning? 43:51 - AlphaGo (continued) 53:40 - Supervised learning and self play in AlphaGo 1:06:12 - Lee Sedol retirement from Go play 1:08:57 - Garry Kasparov 1:14:10 - Alpha Zero and self play 1:31:29 - Creativity in AlphaZero 1:35:21 - AlphaZero applications 1:37:59 - Reward functions 1:40:51 - Meaning of life
@abogaziah
4 жыл бұрын
OMG THANK YOU
@riccardomereu1813
4 жыл бұрын
Thank you very much Lex 🙏
@pyshine_official
4 жыл бұрын
Thanks
@franj4139
4 жыл бұрын
Please invite Humberto Maturana: He had develop theories on human intelligence, consciousness and understanding. He is in his 90s, we could lose his takes on artificial intelligence
@ivannogolica364
4 жыл бұрын
Bring David Deutsch please! :)
"He'll be remembered as the last person to beat AlphaGo" man!!
@joelkavanagh1464
2 жыл бұрын
,,, kudos n respect on that comment! ... greetINX from s.lem jr ... .. . ...............
Seeing this after the AlphaGo doc!
@rdcalderon
4 жыл бұрын
Watching the documentary before watching this interview definitely adds value. kzread.info/dash/bejne/iYyprZiglc67Ybw.html
@ecavero1
4 жыл бұрын
As have I! I was searching of an Alpha Zero doc. This is where I got so far. Not disappointed at all!
@maplegoose6364
4 жыл бұрын
Yes came here directly after the Doc as well. Had never heard of GO! prior to 3hrs a go. Indelibly registered and imprinted now :D
@khall187
4 жыл бұрын
Same
@schwajj
4 жыл бұрын
maap no need to capitalize and exclaim, any more than you’d write CHESS!
His answers are so articulate!
THIS IS THE ONE I'VE BEEN WAITING FOR!
@oncedidactic
4 жыл бұрын
@@mikhailfranco dude, thanks 🙌
Amazing, this conversations are so meaningful to the future of humanity that they should be broadcasted on national television. That way children would more easily find meaningful role models and access to the type of insightful ideas that give birth to passions and eventually discoveries.
I am very happy to see that 3.22M people are watching this channel.
This is a banger of an interview. AlphaZero is a harbinger of the future
Again, Mr. Fridman, THANK YOU for keeping this going, especially now. When I need to get my mind off the current world situation I come here. Your talks always take me to a better place. Thank you. Be safe. Stay healthy.
3 years later I am here... Latest AI developments makes me ask for a second round with David Silver. Thanks for sharing 👍🏼
I watched Alpha Go vs. Lee sedol tournament documentary Deepmind recently uploaded, and I cried. It was so inspiring, touching and beautiful. Thanks very much Lex for this podcast.
Man, David Silver is so incredibly humble...
I can't describe or express how valuable this interview is for understanding what's going to happen in the future
Thanks for making this podcast. David Silver chooses his words very well, his stories are very clear and inspiring! I could have listened much longer ;-)
Oh man! That meaning of life interpretation! I think I'm gonna click this 1:41:20 every night before sleep from now on. Thank you Lex for making this possible! ❤️
@sabelch
4 жыл бұрын
I initially cringed a little when Lex decided to "go there" with the meaning of life question but pshew! Silver gave a great answer.
@Jannikheu
4 жыл бұрын
sabelch yes that answer was very impressive and I think demonstrated his capacity of deep thinking
@iwanjones7334
4 жыл бұрын
I was laughing to myself and thinking: "All he needs to do now is ask him the meaning of life question". And then he did!
@decidrophob
4 жыл бұрын
Indeed, probably David's comment regarding the meaning of life was by far the most philosophically meaningful I have ever come across.
@Mikey-lj2kq
3 жыл бұрын
there's a book called 'the fabrics of reality'
Many academics are terrible at explaining their domain of expertise. David is a quality academic and has remained grounded enough to explain himself to normal folk like me. Well done.
you just gotta love David Silver and his ideas, thoughts and accent
I can ignore everyone else but David Silver talking about AI. His lectures and courses taught me RL.
Mind teased, tantalized, and finally thrown into a tizzy. Love every one of your interviews Lex. All I want to do is watch them to get inspired to think in new ways. THANKS MAN!
This interview is LEGENDARY!... watching it for the second time. Definitely in the top 3 on youtube!
Discovery is a joy. Discovering the existence of David Silver and his amazing way of thinking is pure gold. Thank you Lex.
Awesome conversation, David is incredibly interesting and humble also amazing questions from Lex. Thanks to both of you for making it.
Wow! This was an incredibly insightful and inspiring conversation. Thank you Lex, David, and your teams for this.
Incredible podcast, probably my favourite! It would be incredible to have a second part!
This is a really great interview and very enlightening. Thanks for all of your hard work bringing this stuff to us. Keep up the good work.
Thank you both! It was, again, an awesome conversation.
1:40:51 : One of the best answers for the purpose and meaning of life I have heard so far. Incredible!
I love the content you put out man! It's always interesting, always paradigm challenging, calm, informed, you! Thanks!
Thank you for Lex and David! Very interesting and inspiring conversation about first principles of Artificial Intelligence.
This was the AI interview I've been waiting for - it did deliver. It could have been a bit longer and included the protein folding work, though. Perhaps that's ongoing and still a competitive area. There is a certain clarity of articulation from the guests I enjoy most - reminds me of Jeff Hawkins. Also a sense of practical application.
@palakrishna9921
3 жыл бұрын
Pala
@Jacob-sb3su
3 жыл бұрын
They figured it out
@andrewtoebbe3885
3 жыл бұрын
@@Jacob-sb3su they?
I love how the wall and window are decorated to resemble a go board
David and demis, hope you get nobel prize someday soon.
Lex, It is very clear that you love what you do. It totally shows. You are always super prepared and well engaged with your guests. Yours has become my absolutely favorite podcast. Listening to a 2 hr podcast of yours is as intellectually fulfilling as reading a 400 page incredible book.
David is an amazing being.
I love your guests and the way you carry the conversation brother! Great job, love your channel.
Thank you for another enlightening, exploratory, and meaningful conversation that pushes us towards self-questioning and, one hopes, self-understanding.
Awesome interview. I start jumping around with excitement. Get so eager to learn more!
My Saturday blockbuster, thanks Lex. David is a cool dude, have to get Demis in now :)
its beautiful to see a man that lives his passion. a man that is what he is creating.
Get Demis on here please!
@fatayas9463
4 жыл бұрын
Amen
@Brad_Jacob
4 жыл бұрын
Yes!
@amandamoore9183
2 жыл бұрын
Yes please Lex Demi’s would be awesome 😎
Thank you so much LF! Great job.
Thank you!! Been looking forward to this.
This interview was so good it brought a tear to my eye!
Many thanks for sharing this amazing interview!
David Silver is a real legend
Thanks for putting the ads in the beginning !! It's way better than getting your concentration broke mid interview
1:06:48 That part implies that Lee Se-dol retired because of AlphaGo, while in reality he retired because of his dissatisfaction with the Korea Baduk Association, from which he quit in 2016. He mentioned AlphaGo but it is not the reason he quit.
Fantastic one!! So many cool ideas in there!! Thanks Lex 🤘🏽
thank you again lex, another phenomenal interview, i cannot get enough of this wonderful channel!
Thanks for Boss content empowering people, many young people enjoying this content and in my opinion, such a treasure it is, the exponential tune to your tone.
Thank you, one of the most interesting talks in a long time!
I learnt about New dimension of thinking and understanding things.
Excellent podcast, thank you
Brilliant interview. Articulate and like yourself, I believe AlphaGo was a tipping point for the progress of humanity.
Alpha Zero - "Give the system the ability to correct its own errors"
Love David Silver's lectures on RL
6 months ago I didn’t even know who Lex was, now I can’t get enough of his podcasts. The powers of the internet. I hope he does become a billionaire.
Very proud of my old university - University of Alberta. Dr. Silver got his PhD there under Richard Sutton. Great interview. Was looking forward to this one.
David is adorable, I have watched his RL Course 3-4o times. Brilliant guy and funny too
Trying to reproduce the MCTS results on some other tasks. After several weeks of struggling, I learned that David Silver is really great in a sense that he foresee the future of deep learning research -- computational power really matters.
This is an instant like from me :)! Many thanks Lex!
Such an inspiring conversation, as a phd candidate who works on deep RL, I am quite motivated to try even harder! Thanks for your efforts Lex!
@smegmaprince314
4 жыл бұрын
such an annoying comment, as someone who hates humble bragger, I am quite motivated to downvote your comment! Thanks mr poo on road!
@DaDankStrafe
11 ай бұрын
@@smegmaprince314??? He just said he's inspired because he's working toward entering the same field as the podcast guest. Don't be dumb and weird.
Crazy Lex.. I just went down the alpha learning machine rabbit hole this week. I watched the documentary on alphago, which was fascinating. I also watched the matches between the pro starcraft players and alphastar, which was even more fascinating (partially because I'm familiar with the game). I wonder in this sphere, how far a deep learning machine like this can go. This podcast was the icing on the cake at the bottom of the rabbithole, thanks brother!
Thank you Lex, Great convo.
Thank you for this amazing discussion!
The great conversation! Now I finally understand how alphaGo and alpha Zero were created.
I am struck by how small the audience is for this astonishing talk. It is so important that it should number in the millions, even billions.
Wow, very insightful, nice to get our minds off of the pandemic and look to a bright future. Incredible potential behind DRL!
Hey man, awesome interviews! You seems to be a really good person. Thank you for what you are doing.
Thanks Lex! Even bigger greatness is coming your way!! Cheers! Stay safe!
his course on youtube is amazing
This is the best of all episodes and I know I am biased. Thanks Lex.
Good to hear the logic based programming language PROLOG mentioned.
Hey lex, really interesting episode. A guest I think you should have on your podcast is Leo Gura. His work is more particularly focused on the nature of consciousness and he is for me one of the most insightful people I have ever listened to.
Well done. Its great how you went into the deep background at the end there/
Great stuff, guys! Keep up the hustle
Mate thank you for your videos. your channel is great.
You, Sir, are a gentleman and a scholar.
I must say, one of the best podcasts. Thanks, Lex and David
YES DEEPMIND!!! (I had decided to write in all caps when I saw the thumbnail)
What a fantastic conversation!!!
It's funny I got chance to watch it today again. Now this interview.
Thank you lex David you seem like a real gamer very competitive. Great podcast
Favorite parts: 1:21:16 - Self-play is optimal because the NN learns most robustly by making mistakes. Conclusion: there is no “pill” for intelligence, you evolve intelligence by correcting errors. AI introduced to the physical world would need systems tolerant to making countless errors. 1:24:47 - One model will beat another 100-0. We can construct a tower of models this way, each better than the previous. What's unclear is if this tower is totally ordered or partially ordered. Can a lower node beat a node higher in the tower? When does this occur? Where is this saturation point? How much higher is it than human intelligence in Go? There may exist an equilibrium of Go intelligences, not a greatest Go intelligence. This is the result of minimax optimization vs global optimization. 1:41:20 - It concludes with a fun interpretation of the meaning of it all :)
@SpaceCadet4Jesus
4 жыл бұрын
Agree with your comments. Regarding 1:24:47, I feel his statement merely reflects his desires and not the future reality of all programmed systems. Yes, something far intelligent can beat something far less ordered in a limited gaming setting but possibly not all the time. There is a limit to success in a totally ordered system where the outcome of two perfect playing systems end in stalemate most of the time. I would of liked to have heard the results of AlphaGo or AlphaZero playing against itself with recursive/feedback learning turned off.
This interview is eye opening👍👍
I've been taking his rl lectures currently.Thanks
Man, David Silver is such a genius! I've enjoyed the interview so much. I wouldn't say Lex interview policy can be considerd as optimal yet, but the story you create through your questions, the way you try to go to the essence when you close your eyes and just the way you are make it be really close. If you read this, thank you
I found it interesting that there was some bafflement at the power of randomness. Randomness (mutations) coupled with an objective function (maximise fitness) produced us and all the wonders of life. What could be more powerful than that?!
@chrisofnottingham
4 жыл бұрын
That's exactly what I thought. From being microbes in the sea our algorithm was basically small random variations and then pass/fail reproduction.
@5th_Interaction
4 жыл бұрын
Quite standard game theory concept. Correctly randomising close EV decisions will result in the most optimal/un-exploitative solution.
@hoolerboris
4 жыл бұрын
Celtlen i'm sure your intuition is correct. How about you run a simulation for a few billion years and get back to us with your results?
@SpaceCadet4Jesus
4 жыл бұрын
Who/What inserted the objective function phase? Evolution is not a power, or a force, or an entity that can be identified, cataloged or bottled. It's a process inherent in the variation of pre-coded genetic material. Time itself codes nothing, it's possibility already has to be there within the code, same as Alpha*anything. The programmers programmed it to learn from reiteration or feedback repetition, after first a combination of repetition and a database of the best played games of Go, which is a breakthrough in our traditional thinking of how programming should or could work. I, for one, welcome our new AI Overlords.
@SpaceCadet4Jesus
4 жыл бұрын
@@chrisofnottingham failure in genetics means eventual, if not immediate, death of the system, no procreation, no passing Go, no collecting $200. Your simple pass/fail random reproduction system as the *progression* of a biological organism doesn't exist in any course of biology. Random mutations, otherwise known as re-coding errors, are mutations that no biological system known is built upon for speciation. Interspecies copulation usually results in sterility, if successful.
Those who don’t have sophisticated backgrounds in Programming can really appreciate the way you relate what the computers are doing and capable of doing to the romantic human narratives
This talk is so inspiring.
changing the world, by bringing us the people who are changing them :) Thanks Lex! you rule :)
What a profound way to discuss about the meaning of life! First there are several layers for the meaning of life. First layer would be "Does the universe have a meaning?". Well, it looks like it operates on some very fine-tuned laws and constants. At first glance, there is no meaning. For the next layer, let's look at 2nd law of thermodynamics. It's purpose to increase entropy. What if the evolution is just a mechanism (a sub goal) in order to increase entropy further? Evolution's goal is how to reproduce efficiently. In other words how to spread energy efficiently. Because of it, entropy will increase as efficiently as possible. This line of thought is truly mind-blowing. Probably I will not able to sleep for 2-3 days, because of thinking about this concept... Lex and David, Thank you for the conversation!
Haven't watched yet, just settling in for it but I really wanted to say something. Yay!
@jonaspiva41
4 жыл бұрын
Greatly enjoyed it, and I have a feeling there are more interviews with Deepmind team and I am sooooo stoked. Be safe & have fun.
What a cool view of the meaning of life, it was enlightning!
Really enjoyed this one
Very enlightening thanks.
Absolutely amazing.
Man i can’t thank you enough ❤️
I love you Lex Fridman
amazing episode