Parables on the Power of Planning in AI: From Poker to Diplomacy: Noam Brown (OpenAI)

Поділитися
Вставка
  • Опубліковано 18 вер 2024
  • Title: Parables on the Power of Planning in AI: From Poker to Diplomacy
    Speaker: Noam Brown (OpenAI)
    Date: Thursday, May 23, 2024
    Abstract: from Deep Blue in 1997, to AlphaGo in 2016, to Cicero in 2022, games have long been used as a way to measure the frontier capabilities of AI systems and gain algorithmic insights that have wider applications. In this talk, I will cover research breakthroughs in games including poker, Go, and Diplomacy, and in particular highlight the key role that search/planning algorithms have played in all of these achievements. I will then point to potential future applications of this research to improving machine learning models more broadly.
    Bio: Noam Brown is an AI researcher at OpenAI investigating reasoning and self play. He co-created Libratus and Pluribus, the first AIs to defeat top humans in two-player no-limit poker and multiplayer no-limit poker, respectively. Noam was also the lead research scientist for Cicero, the first AI to achieve human-level performance in the natural language strategy game Diplomacy. He has received the Marvin Minsky Medal for Outstanding Achievements in AI, was named one of MIT Tech Review's 35 Innovators Under 35, and his work on Pluribus was named by Science as one of the top 10 scientific breakthroughs of 2019. Noam received his PhD from Carnegie Mellon University, for which he received the AAMAS Victor Lesser Distinguished Dissertation Award, the AAAI ACM-SIGAI Dissertation Award, and the CMU School of Computer Science Distinguished Dissertation Award.
    This video is in the process of being closed captioned.

КОМЕНТАРІ • 24

  • @DistortedV12
    @DistortedV12 День тому +24

    the architect of Cicero and "scaling inference time compute."

    • @windmaple
      @windmaple День тому +2

      Well, the talk actually took place in May if you look at the description. So he kind of hinted o1 3 months ago

    • @DistortedV12
      @DistortedV12 День тому

      @@windmaple ik my point exactly.. probably told UW to not release it until now

  • @pruff3
    @pruff3 День тому +8

    Never underestimate search. -Waldo

    • @smicha15
      @smicha15 День тому

      Oh my god brilliant.

  • @omadDev
    @omadDev 4 години тому

    Very interesting lecture. Thank you!

  • @RaviAnnaswamy
    @RaviAnnaswamy День тому +4

    His points on why people didn’t prioritize search is very illuminating
    The broader lesson here is that trained distilled knowledge is pattern recognition and good for perceptual take whereas adding a search and explore (as in GOFAI) is necessary for cognitive tasks
    I think there might be one more step: to distill the patterns discovered via search back into perceptual precepts which I think is what happens in grandmaster play in chess and genius such as Newton or Ramanujan
    If o1 already does this similar to alphazero I do not know as I am typing this half way the lecture

    • @masterchief7301
      @masterchief7301 День тому +1

      So, it'd be a loop of creating new patterns as it encounters novel situations.

    • @DistortedV12
      @DistortedV12 День тому

      Us cognitive scientists have known about this for a long time as well; "system 1" and "system 2."

    • @RaviAnnaswamy
      @RaviAnnaswamy День тому

      @@DistortedV12 yes I am aware of that and read Kahnemans great book on that topic too but what is fascinating is how facing human players beat the system 1 version of their bot forces them to add search

  • @RaviAnnaswamy
    @RaviAnnaswamy День тому +1

    Search means find a series of actions that lead from the current state to end state that you would
    Like
    Or alternatively avoid potentially bad states for you in future

  • @ankitkumarpandey7262
    @ankitkumarpandey7262 День тому +3

    The way AI is progressing is so closely related to evolution..just at a much faster time scale.

  • @triplea657aaa
    @triplea657aaa День тому

    Would love if some of these papers were in the description for easy reference!

  • @Erick_BMG
    @Erick_BMG 22 години тому

    COOL

  • @fil4dworldcomo623
    @fil4dworldcomo623 9 годин тому

    I have been listening for a while now, though I agree that enabling search is a big factor for GenAI intellect, it's still not clear from the context of poker game if why. I can only assume you taught the model to read people's faces and then search on their historical game record to know when they are bluffing and when they do really have a strong hand?

  • @pruff3
    @pruff3 День тому

    TGI MCTS

  • @IsaiahUla-r6w
    @IsaiahUla-r6w Годину тому

    Williams Helen White Mark Walker Amy

  • @FullChick-h4l
    @FullChick-h4l День тому

    Williams Helen Anderson Jessica Robinson Larry

  • @TempleElaine-z4l
    @TempleElaine-z4l День тому

    Wilson Daniel Robinson Scott Brown Jeffrey

  • @MalachiSpring-s1t
    @MalachiSpring-s1t День тому

    Brown Christopher Johnson Edward Young Eric

  • @HopkinsDean-r8i
    @HopkinsDean-r8i День тому

    Martinez Michael Lopez Thomas Moore Eric

  • @RossettiAries-s5w
    @RossettiAries-s5w День тому

    Robinson Karen Rodriguez Maria Walker Brian