David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

Поділитися
Вставка
  • Опубліковано 4 тра 2024
  • David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.
    Support this podcast by signing up with these sponsors:
    - MasterClass: masterclass.com/lex
    - Cash App - use code "LexPodcast" and download:
    - Cash App (App Store): apple.co/2sPrUHe
    - Cash App (Google Play): bit.ly/2MlvP5w
    EPISODE LINKS:
    Reinforcement learning (book): amzn.to/2Jwp5zG
    PODCAST INFO:
    Podcast website:
    lexfridman.com/podcast
    Apple Podcasts:
    apple.co/2lwqZIr
    Spotify:
    spoti.fi/2nEwCF8
    RSS:
    lexfridman.com/feed/podcast/
    Full episodes playlist:
    • Lex Fridman Podcast
    Clips playlist:
    • Lex Fridman Podcast Clips
    OUTLINE:
    0:00 - Introduction
    4:09 - First program
    11:11 - AlphaGo
    21:42 - Rule of the game of Go
    25:37 - Reinforcement learning: personal journey
    30:15 - What is reinforcement learning?
    43:51 - AlphaGo (continued)
    53:40 - Supervised learning and self play in AlphaGo
    1:06:12 - Lee Sedol retirement from Go play
    1:08:57 - Garry Kasparov
    1:14:10 - Alpha Zero and self play
    1:31:29 - Creativity in AlphaZero
    1:35:21 - AlphaZero applications
    1:37:59 - Reward functions
    1:40:51 - Meaning of life
    CONNECT:
    - Subscribe to this UA-cam channel
    - Twitter: / lexfridman
    - LinkedIn: / lexfridman
    - Facebook: / lexfridmanpage
    - Instagram: / lexfridman
    - Medium: / lexfridman
    - Support on Patreon: / lexfridman
  • Наука та технологія

КОМЕНТАРІ • 457

  • @lexfridman
    @lexfridman  4 роки тому +224

    I really enjoyed this conversation with David. Here's the outline:
    0:00 - Introduction
    4:09 - First program
    11:11 - AlphaGo
    21:42 - Rule of the game of Go
    25:37 - Reinforcement learning: personal journey
    30:15 - What is reinforcement learning?
    43:51 - AlphaGo (continued)
    53:40 - Supervised learning and self play in AlphaGo
    1:06:12 - Lee Sedol retirement from Go play
    1:08:57 - Garry Kasparov
    1:14:10 - Alpha Zero and self play
    1:31:29 - Creativity in AlphaZero
    1:35:21 - AlphaZero applications
    1:37:59 - Reward functions
    1:40:51 - Meaning of life

    • @abogaziah
      @abogaziah 4 роки тому +1

      OMG THANK YOU

    • @riccardomereu1813
      @riccardomereu1813 4 роки тому +3

      Thank you very much Lex 🙏

    • @pyshine_official
      @pyshine_official 4 роки тому +1

      Thanks

    • @franj4139
      @franj4139 4 роки тому +7

      Please invite Humberto Maturana: He had develop theories on human intelligence, consciousness and understanding. He is in his 90s, we could lose his takes on artificial intelligence

    • @ivannogolica364
      @ivannogolica364 4 роки тому +1

      Bring David Deutsch please! :)

  • @vedgupta1686
    @vedgupta1686 4 роки тому +281

    "He'll be remembered as the last person to beat AlphaGo"
    man!!

    • @joelkavanagh1464
      @joelkavanagh1464 2 роки тому +1

      ,,, kudos n respect on that comment! ... greetINX from s.lem jr ... .. . ...............

  • @chanleystow
    @chanleystow 4 роки тому +371

    Seeing this after the AlphaGo doc!

    • @rdcalderon
      @rdcalderon 4 роки тому +12

      Watching the documentary before watching this interview definitely adds value. ua-cam.com/video/WXuK6gekU1Y/v-deo.html

    • @ecavero1
      @ecavero1 4 роки тому +2

      As have I! I was searching of an Alpha Zero doc. This is where I got so far. Not disappointed at all!

    • @maplegoose6364
      @maplegoose6364 4 роки тому +6

      Yes came here directly after the Doc as well. Had never heard of GO! prior to 3hrs a go. Indelibly registered and imprinted now :D

    • @khall187
      @khall187 4 роки тому +2

      Same

    • @schwajj
      @schwajj 4 роки тому +1

      maap no need to capitalize and exclaim, any more than you’d write CHESS!

  • @AakarshNair
    @AakarshNair 2 місяці тому +3

    His answers are so articulate!

  • @oncedidactic
    @oncedidactic 4 роки тому +207

    THIS IS THE ONE I'VE BEEN WAITING FOR!

    • @oncedidactic
      @oncedidactic 4 роки тому +1

      @@mikhailfranco dude, thanks 🙌

  • @hamentaschen
    @hamentaschen 4 роки тому +73

    Again, Mr. Fridman, THANK YOU for keeping this going, especially now. When I need to get my mind off the current world situation I come here. Your talks always take me to a better place. Thank you. Be safe. Stay healthy.

  • @joaodesouza4649
    @joaodesouza4649 4 роки тому +17

    I can't describe or express how valuable this interview is for understanding what's going to happen in the future

  • @burkebaby
    @burkebaby Рік тому +2

    Amazing, this conversations are so meaningful to the future of humanity that they should be broadcasted on national television. That way children would more easily find meaningful role models and access to the type of insightful ideas that give birth to passions and eventually discoveries.

  • @litvinenkoalexander5331
    @litvinenkoalexander5331 7 місяців тому +2

    I am very happy to see that 3.22M people are watching this channel.

  • @TrappedinaBrain
    @TrappedinaBrain 4 роки тому +89

    This is a banger of an interview. AlphaZero is a harbinger of the future

  • @TheTessatje123
    @TheTessatje123 3 роки тому +28

    Thanks for making this podcast. David Silver chooses his words very well, his stories are very clear and inspiring! I could have listened much longer ;-)

  • @bruceturner4858
    @bruceturner4858 4 роки тому +2

    Discovery is a joy. Discovering the existence of David Silver and his amazing way of thinking is pure gold. Thank you Lex.

  • @TheRealStructurer
    @TheRealStructurer Рік тому +7

    3 years later I am here... Latest AI developments makes me ask for a second round with David Silver. Thanks for sharing 👍🏼

  • @r1s1112
    @r1s1112 4 роки тому +2

    Awesome conversation, David is incredibly interesting and humble also amazing questions from Lex. Thanks to both of you for making it.

  • @camillorohe6996
    @camillorohe6996 4 роки тому +15

    you just gotta love David Silver and his ideas, thoughts and accent

  • @asdf_600
    @asdf_600 3 роки тому +3

    Incredible podcast, probably my favourite! It would be incredible to have a second part!

  • @L.someone
    @L.someone 4 роки тому +2

    Wow! This was an incredibly insightful and inspiring conversation. Thank you Lex, David, and your teams for this.

  • @geraldsierveldphotographyi1406
    @geraldsierveldphotographyi1406 4 роки тому +4

    Outlining the episode is the MOST awesome and thoughtful thing foru2have done...

  • @UnpluggedPerformance
    @UnpluggedPerformance 2 роки тому +1

    This interview is LEGENDARY!... watching it for the second time. Definitely in the top 3 on youtube!

  • @supersnowva6717
    @supersnowva6717 4 роки тому +13

    I watched Alpha Go vs. Lee sedol tournament documentary Deepmind recently uploaded, and I cried. It was so inspiring, touching and beautiful. Thanks very much Lex for this podcast.

  • @vladimirgetselevich4704
    @vladimirgetselevich4704 4 роки тому +3

    Thank you for Lex and David! Very interesting and inspiring conversation about first principles of Artificial Intelligence.

  • @samuelec
    @samuelec 4 роки тому +2

    Thank you both!
    It was, again, an awesome conversation.

  • @user-jx8gv1rd8e
    @user-jx8gv1rd8e Рік тому +1

    Lex,
    It is very clear that you love what you do. It totally shows.
    You are always super prepared and well engaged with your guests.
    Yours has become my absolutely favorite podcast. Listening to a 2 hr podcast of yours is as intellectually fulfilling as reading a 400 page incredible book.

  • @minerwilly
    @minerwilly 4 роки тому +3

    This is a really great interview and very enlightening. Thanks for all of your hard work bringing this stuff to us. Keep up the good work.

  • @ufozencom
    @ufozencom 4 роки тому +4

    Mind teased, tantalized, and finally thrown into a tizzy. Love every one of your interviews Lex. All I want to do is watch them to get inspired to think in new ways. THANKS MAN!

  • @gallerksee
    @gallerksee 3 роки тому +2

    I love the content you put out man! It's always interesting, always paradigm challenging, calm, informed, you! Thanks!

  • @hariomt348
    @hariomt348 3 роки тому +1

    1:40:51 : One of the best answers for the purpose and meaning of life I have heard so far. Incredible!

  • @ottolehto
    @ottolehto 3 роки тому

    Thank you for another enlightening, exploratory, and meaningful conversation that pushes us towards self-questioning and, one hopes, self-understanding.

  • @andrewg2355
    @andrewg2355 3 роки тому +1

    I love your guests and the way you carry the conversation brother! Great job, love your channel.

  • @jung8935
    @jung8935 4 роки тому +2

    Man, David Silver is so incredibly humble...

  • @isakrathestre6748
    @isakrathestre6748 4 роки тому

    Awesome interview. I start jumping around with excitement. Get so eager to learn more!

  • @saulocerqueiradealmeida9700
    @saulocerqueiradealmeida9700 4 роки тому

    Thank you so much LF! Great job.

  • @perfumedsea
    @perfumedsea 4 роки тому +15

    I can ignore everyone else but David Silver talking about AI. His lectures and courses taught me RL.

  • @Lagruell
    @Lagruell 4 роки тому +1

    Many thanks for sharing this amazing interview!

  • @shawnchen6338
    @shawnchen6338 3 роки тому

    Trying to reproduce the MCTS results on some other tasks. After several weeks of struggling, I learned that David Silver is really great in a sense that he foresee the future of deep learning research -- computational power really matters.

  • @einemailadressenbesitzerei8816
    @einemailadressenbesitzerei8816 4 роки тому

    its beautiful to see a man that lives his passion. a man that is what he is creating.

  • @duderadley2383
    @duderadley2383 4 роки тому

    Thanks for Boss content empowering people, many young people enjoying this content and in my opinion, such a treasure it is, the exponential tune to your tone.

  • @JohnHAdams-vo2pk
    @JohnHAdams-vo2pk 4 роки тому +2

    Very proud of my old university - University of Alberta. Dr. Silver got his PhD there under Richard Sutton. Great interview. Was looking forward to this one.

  • @darylallen2485
    @darylallen2485 4 роки тому +5

    Many academics are terrible at explaining their domain of expertise. David is a quality academic and has remained grounded enough to explain himself to normal folk like me. Well done.

  • @sathvikudupa1668
    @sathvikudupa1668 4 роки тому +2

    Thank you!! Been looking forward to this.

  • @alexcherfan7762
    @alexcherfan7762 4 роки тому +1

    Crazy Lex.. I just went down the alpha learning machine rabbit hole this week. I watched the documentary on alphago, which was fascinating. I also watched the matches between the pro starcraft players and alphastar, which was even more fascinating (partially because I'm familiar with the game). I wonder in this sphere, how far a deep learning machine like this can go. This podcast was the icing on the cake at the bottom of the rabbithole, thanks brother!

  • @kennethcrandall8131
    @kennethcrandall8131 Рік тому +1

    This interview was so good it brought a tear to my eye!

  • @JousefM
    @JousefM 4 роки тому +5

    My Saturday blockbuster, thanks Lex. David is a cool dude, have to get Demis in now :)

  • @garyswift9347
    @garyswift9347 2 роки тому +1

    I love how the wall and window are decorated to resemble a go board

  • @chiefrabbi6735
    @chiefrabbi6735 2 роки тому

    Love David Silver's lectures on RL

  • @michaeltheunissen609
    @michaeltheunissen609 4 роки тому +4

    Brilliant interview. Articulate and like yourself, I believe AlphaGo was a tipping point for the progress of humanity.

  • @adeep_jain
    @adeep_jain 4 роки тому +1

    Fantastic one!! So many cool ideas in there!! Thanks Lex 🤘🏽

  • @shuhu1234
    @shuhu1234 4 роки тому

    Thank you for this amazing discussion!

  • @englishiguana4304
    @englishiguana4304 Рік тому

    thank you again lex, another phenomenal interview, i cannot get enough of this wonderful channel!

  • @peacock8730
    @peacock8730 4 роки тому +1

    The great conversation! Now I finally understand how alphaGo and alpha Zero were created.

  • @bernardvantonder7291
    @bernardvantonder7291 2 роки тому +2

    David is an amazing being.

  • @karlisstigis
    @karlisstigis 3 роки тому +1

    Thank you, one of the most interesting talks in a long time!

  • @jingtao1181
    @jingtao1181 3 роки тому

    Thank you Lex, Great convo.

  • @roseleelauper1193
    @roseleelauper1193 4 роки тому +3

    Excellent podcast, thank you

  • @Kyle-oe2vs
    @Kyle-oe2vs 4 роки тому +7

    Wow, very insightful, nice to get our minds off of the pandemic and look to a bright future. Incredible potential behind DRL!

  • @oudarjyasensarma4199
    @oudarjyasensarma4199 4 роки тому +1

    Thanks Lex! Even bigger greatness is coming your way!! Cheers! Stay safe!

  • @JackSPk
    @JackSPk 4 роки тому +45

    Oh man! That meaning of life interpretation! I think I'm gonna click this 1:41:20 every night before sleep from now on.
    Thank you Lex for making this possible! ❤️

    • @sabelch
      @sabelch 4 роки тому +3

      I initially cringed a little when Lex decided to "go there" with the meaning of life question but pshew! Silver gave a great answer.

    • @Jannikheu
      @Jannikheu 4 роки тому +2

      sabelch yes that answer was very impressive and I think demonstrated his capacity of deep thinking

    • @iwanjones7334
      @iwanjones7334 4 роки тому

      I was laughing to myself and thinking: "All he needs to do now is ask him the meaning of life question". And then he did!

    • @decidrophob
      @decidrophob 3 роки тому +1

      Indeed, probably David's comment regarding the meaning of life was by far the most philosophically meaningful I have ever come across.

    • @Mikey-lj2kq
      @Mikey-lj2kq 3 роки тому

      there's a book called 'the fabrics of reality'

  • @bobwelham8792
    @bobwelham8792 4 роки тому +1

    Good to hear the logic based programming language PROLOG mentioned.

  • @johnsharkey5255
    @johnsharkey5255 4 роки тому +4

    Hey lex, really interesting episode. A guest I think you should have on your podcast is Leo Gura. His work is more particularly focused on the nature of consciousness and he is for me one of the most insightful people I have ever listened to.

  • @JT-xb6zs
    @JT-xb6zs 4 роки тому +3

    Thanks for putting the ads in the beginning !! It's way better than getting your concentration broke mid interview

  • @devonk298
    @devonk298 3 роки тому +2

    David is adorable, I have watched his RL Course 3-4o times. Brilliant guy and funny too

  • @iwanjones7334
    @iwanjones7334 4 роки тому

    I am struck by how small the audience is for this astonishing talk. It is so important that it should number in the millions, even billions.

  • @people93
    @people93 4 роки тому +3

    David Silver is a real legend

  • @egorpanfilov
    @egorpanfilov 4 роки тому +2

    This is an instant like from me :)! Many thanks Lex!

  • @josephsantarcangelo9310
    @josephsantarcangelo9310 4 роки тому +2

    his course on youtube is amazing

  • @shivamkushwaha9730
    @shivamkushwaha9730 4 роки тому +1

    This is the best of all episodes and I know I am biased. Thanks Lex.

  • @typo44
    @typo44 2 роки тому

    Well done. Its great how you went into the deep background at the end there/

  • @msulemanf
    @msulemanf 4 роки тому +70

    This was the AI interview I've been waiting for - it did deliver. It could have been a bit longer and included the protein folding work, though. Perhaps that's ongoing and still a competitive area. There is a certain clarity of articulation from the guests I enjoy most - reminds me of Jeff Hawkins. Also a sense of practical application.

  • @emmanuelboakye1124
    @emmanuelboakye1124 4 роки тому

    This interview is eye opening👍👍

  • @jordanjennnings9864
    @jordanjennnings9864 3 роки тому

    Thank you lex David you seem like a real gamer very competitive. Great podcast

  • @Voke
    @Voke 4 роки тому

    Great stuff, guys! Keep up the hustle

  • @corkkyle
    @corkkyle 2 роки тому

    What a fantastic conversation!!!

  • @tristonedwards7094
    @tristonedwards7094 4 роки тому +1

    Mate thank you for your videos. your channel is great.

  • @brixtoncruddy
    @brixtoncruddy 4 роки тому +93

    Get Demis on here please!

  • @Francisco-qh3qh
    @Francisco-qh3qh 4 роки тому +1

    You, Sir, are a gentleman and a scholar.

  • @kartikeydetha5582
    @kartikeydetha5582 Рік тому +1

    I learnt about New dimension of thinking and understanding things.

  • @mohamedarif604
    @mohamedarif604 4 роки тому +1

    I've been taking his rl lectures currently.Thanks

  • @PedroContipelli2
    @PedroContipelli2 2 роки тому

    Absolutely amazing.

  • @davida3922
    @davida3922 3 роки тому +1

    6 months ago I didn’t even know who Lex was, now I can’t get enough of his podcasts. The powers of the internet. I hope he does become a billionaire.

  • @rahulsagarpv
    @rahulsagarpv 4 роки тому +4

    Hey man, awesome interviews! You seems to be a really good person. Thank you for what you are doing.

  • @ishtar0077
    @ishtar0077 2 роки тому

    It's funny I got chance to watch it today again. Now this interview.

  • @muharremuguryavas9183
    @muharremuguryavas9183 4 роки тому +17

    Such an inspiring conversation, as a phd candidate who works on deep RL, I am quite motivated to try even harder! Thanks for your efforts Lex!

    • @smegmaprince314
      @smegmaprince314 3 роки тому +2

      such an annoying comment, as someone who hates humble bragger, I am quite motivated to downvote your comment! Thanks mr poo on road!

    • @DaDankStrafe
      @DaDankStrafe 10 місяців тому

      @@smegmaprince314??? He just said he's inspired because he's working toward entering the same field as the podcast guest. Don't be dumb and weird.

  • @olijones9953
    @olijones9953 4 роки тому

    Really enjoyed this one

  • @willikey
    @willikey 4 роки тому

    This talk is so inspiring.

  • @samvargas2868
    @samvargas2868 4 роки тому +4

    YES DEEPMIND!!! (I had decided to write in all caps when I saw the thumbnail)

  • @antoniolau8762
    @antoniolau8762 3 роки тому +1

    Man, David Silver is such a genius! I've enjoyed the interview so much.
    I wouldn't say Lex interview policy can be considerd as optimal yet, but the story you create through your questions, the way you try to go to the essence when you close your eyes and just the way you are make it be really close. If you read this, thank you

  • @wenhanzhou5826
    @wenhanzhou5826 2 роки тому

    What a cool view of the meaning of life, it was enlightning!

  • @dizbeefpvdizbeliefdizzy3612
    @dizbeefpvdizbeliefdizzy3612 4 роки тому

    Very enlightening thanks.

  • @hazemahmed8333
    @hazemahmed8333 4 роки тому +1

    Man i can’t thank you enough ❤️

  • @duderadley2383
    @duderadley2383 4 роки тому

    Those who don’t have sophisticated backgrounds in Programming can really appreciate the way you relate what the computers are doing and capable of doing to the romantic human narratives

  • @vigneshpadmanabhan
    @vigneshpadmanabhan Рік тому

    amazing episode

  • @viraatchandra8498
    @viraatchandra8498 3 роки тому

    changing the world, by bringing us the people who are changing them :) Thanks Lex! you rule :)

  • @james.arambam
    @james.arambam 4 роки тому +3

    I must say, one of the best podcasts. Thanks, Lex and David

  • @jonaspiva41
    @jonaspiva41 4 роки тому

    Haven't watched yet, just settling in for it but I really wanted to say something. Yay!

    • @jonaspiva41
      @jonaspiva41 4 роки тому +1

      Greatly enjoyed it, and I have a feeling there are more interviews with Deepmind team and I am sooooo stoked. Be safe & have fun.

  • @sabofx
    @sabofx 4 роки тому +2

    Anyone else get excited by Deepmind's latest "muzero" algorithm that David discussed, starting from about 1:28:00 into the video? Supposedly a new algorithm that is able to figure-out the rules and constraints of the environment by itself. I'd love to hear more in depth discussions about Muzero's capabilities in future talks with Deepmind's finest 😎!

  • @Chess_Intelligence
    @Chess_Intelligence 4 роки тому

    Good interview this one.

  • @aliancemd
    @aliancemd 3 роки тому +7

    1:06:48 That part implies that Lee Se-dol retired because of AlphaGo, while in reality he retired because of his dissatisfaction with the Korea Baduk Association, from which he quit in 2016. He mentioned AlphaGo but it is not the reason he quit.

  • @Dave-nz5jf
    @Dave-nz5jf 4 роки тому +1

    Lex I really admire your interviewing style, you made Silver really light up a number of times. Your voice is like liquid morphine, and you could see how easy it was for your subject to just 'let go', which is great. It's clear though that AlphaGo was his baby .. AlphaStar not so much. I was really hoping to have the same blow by blow for all of the Starcraft games.

  • @csswithmalikbedarbakht
    @csswithmalikbedarbakht 2 роки тому

    Great interview.

  • @pyshine_official
    @pyshine_official 4 роки тому

    We are in this together we will win!