LLMs will hit the data wall if they can’t generalize - OpenAI cofounder John Schulman

Поділитися
Вставка
  • Опубліковано 12 тра 2024
  • Full Episode: • John Schulman (OpenAI ...
    Apple Podcasts: podcasts.apple.com/us/podcast...
    Spotify: open.spotify.com/episode/1ivz...
    Transcript: www.dwarkeshpatel.com/p/john-...
    Me on Twitter: / dwarkesh_sp
  • Наука та технологія

КОМЕНТАРІ • 44

  • @oscbit
    @oscbit 15 днів тому +98

    Dwarkesh pls stop uploading teasers before the actual show.. seeing shortform content suggests that the episode exists and there is no way to know until visiting you channel, only to then get disappointed.

    • @daniellawson9894
      @daniellawson9894 15 днів тому +9

      Could keep it but put teaser / preview in the title

    • @radekwarowny
      @radekwarowny 15 днів тому +4

      Yeah I hate that too

    • @aazzrwadrf
      @aazzrwadrf 15 днів тому +1

      The full ep is probably not done editing yet. I don’t mind it tbh.

    • @forthehomies7043
      @forthehomies7043 15 днів тому +2

      such an entitled take bro. just sub and keep notis on

    • @noone-ld7pt
      @noone-ld7pt 15 днів тому +1

      @@forthehomies7043 not an entitled take at all, he shared his opinion an a lot of people agreed. that's useful constructive feedback.

  • @hdhgdhgdhfhjfjhfjh
    @hdhgdhgdhfhjfjhfjh 15 днів тому +32

    this guy AIs.

  • @charleshetterich8514
    @charleshetterich8514 15 днів тому +49

    co-founder ?? i swear they're just writing new characters into this OAI plot-line

    • @MatRuizMat
      @MatRuizMat 15 днів тому +15

      this guy is a scientific legend in the AI/RL field bro

    • @user-jf5uv9ir5k
      @user-jf5uv9ir5k 15 днів тому +4

      Exactly, he must be the 10th person to claim cofounder status

    • @kevinamiri909
      @kevinamiri909 14 днів тому

      Bro this is the real person behind all OpenAI innovations I swear.

  • @hemanthkorrapati1412
    @hemanthkorrapati1412 15 днів тому

    When will be the uploaded of full podcast link

  • @craiginzana
    @craiginzana 12 днів тому +2

    Didn't really age well with Claude 3 and GPT 4o

  • @user-bp2ol4wi1c
    @user-bp2ol4wi1c 15 днів тому +6

    what is with the sound mixing, something is off

    • @BadWithNames123
      @BadWithNames123 14 днів тому +2

      they use ai to "clean" the audio track.. I hate it

    • @user-bp2ol4wi1c
      @user-bp2ol4wi1c 14 днів тому

      @@BadWithNames123 it sounds shit , raw would do better i think

  • @nitap109
    @nitap109 14 днів тому +1

    Wow, great topic

  • @DynamicUnreal
    @DynamicUnreal 15 днів тому +5

    They will never run out of data. What they will likely run out of is captured data. Humans collectively likely produce massive amounts of _text data_ just by talking to each other every day, the question is how to capture it in a voluntary manner?
    Even if LLMs on their own can’t get us to AGI by themselves, they can serve as a sophisticated foundation on which to train other modalities on top of.

    • @dovekie3437
      @dovekie3437 15 днів тому +1

      How much of the human corpus of knowledge and history and science and literature are LLMs actually trained on? I would guess that it's less than 1/50th of existing books given the training size vs total amount of terabytes of text data all the books would require.

    • @squamish4244
      @squamish4244 7 днів тому

      @@dovekie3437 Not to mention the five million scientific papers produced every year, a number that has soared in recent years.

    • @dovekie3437
      @dovekie3437 6 днів тому

      @@squamish4244 Hopefully the LLMs put information gained from "scientific" papers from the humanities in the same place in its memory that it puts religious texts.

  • @junwang9927
    @junwang9927 15 днів тому

    Another legend. This is definitely my go-to AI podcast.

  • @groundcrewz
    @groundcrewz 8 днів тому

    and the game is back to algorithms and compute, again!

  • @jackbauer322
    @jackbauer322 15 днів тому +12

    it's not the data but the ARCHITECTURE that is a dead end

    • @kraithaywire
      @kraithaywire 15 днів тому

      What do you mean by dead? Will we not see any more progress for quite some time or what? I would really love to know. Thank you.

    • @IcySpicy3
      @IcySpicy3 15 днів тому

      You mean x86?

    • @JackLawrence-dn2jb
      @JackLawrence-dn2jb 14 днів тому

      @@kraithaywire People have been saying the ARCHITECTURE IS A DEAD END for years, but that continues to be disproven time and time again. Don't listen to the doomers and naysayers.

    • @egor.okhterov
      @egor.okhterov 13 днів тому

      ​@JackLawrence-dn2jb how is it disproven? By fancy UI? 😂

    • @JackLawrence-dn2jb
      @JackLawrence-dn2jb 13 днів тому +1

      @@egor.okhterov The fact that the models are getting better year by year. Elo scores going up, now we have multi-modality, improved text to video, improved text to image.
      People like you been saying these are a dead end for years. Clowns lmao

  • @tusharjain9366
    @tusharjain9366 15 днів тому +2

    My hypothesis ( yet don’t have data to support it) : Current generative AI technologies (LLMs ) will reach at plateau soon(again lacks data) due to at least three reasons. Reason 1: underlying models zero in on a single value which makes cross domain generation of text (or images, videos, or data points) very limited and sometimes awkward. Reason 2: post 2022/23 distinction between naturally occurring (as well generating data) and synthetic data is blurring very fast which puts learning data in downward self spiral. Reason 3: Limited labeled data availability with respect to niche . For example images about various trees vs images of tree.

    • @Hexanitrobenzene
      @Hexanitrobenzene 13 днів тому

      You might be right. Mike Pound on Computerphile discusses a new paper:
      ua-cam.com/video/dDUC-LqVrPU/v-deo.html

  • @kevinamiri909
    @kevinamiri909 14 днів тому

    I found someone that makes sense, please release the full interview, I cannot wait to watch his interview.

  • @vsma6517
    @vsma6517 15 днів тому +7

    "uhm"

  • @matiasortizxxi
    @matiasortizxxi 15 днів тому +10

    Well this aged like milk.

    • @assgoblin3981
      @assgoblin3981 15 днів тому

      what the fuck happened

    • @aloysius_music
      @aloysius_music 15 днів тому +20

      Did it? GPT-4o is super impressive (and uncanny), but the core reasoning isn't a massive step up. There's a reason they didn't call it GPT-5.

    • @squamish4244
      @squamish4244 7 днів тому

      @@aloysius_music It reveals the potential of where we can go from here, though. LLMs have a limit, but it's not GPT-4.

  • @CrunchyAI-fu6de
    @CrunchyAI-fu6de 14 днів тому

    REALLY don't like seeing clips of a full length interview that doesn't exist. Please stop doing this.

  • @Natron1time
    @Natron1time 15 днів тому +2

    uhhhhhh ummmmm uhhhhh

  • @alanrobertson3172
    @alanrobertson3172 14 днів тому +1

    He’s not a good public speaker.

    • @Derick99
      @Derick99 6 днів тому

      I think he's having trouble answering without saying to much