Sora - Full Analysis (with new details)

Поділитися
Вставка
  • Опубліковано 15 лют 2024
  • Sora, the text-to-video model from OpenAI, is here. I go over the bonus details and demos released in the last few hours, and the technical paper. I’ll also give you a glimpse of what’s to come next and a host of implications. Even if you’ve seen every Sora video, I bet you won’t know all of this!
    AI Insiders: / aiexplained
    Sora: openai.com/research/video-gen...
    openai.com/sora
    ViT Transformers: arxiv.org/pdf/2010.11929.pdf
    Captioning Innovation: cdn.openai.com/papers/dall-e-...
    NaViT: arxiv.org/pdf/2307.06304.pdf
    OpenAI Exclusives: www.theinformation.com/articl...
    www.theinformation.com/articl...
    And far too many tweets to list here!
    AI Insiders: / aiexplained Non-Hype, Free Newsletter: signaltonoise.beehiiv.com/
  • Наука та технологія

КОМЕНТАРІ • 1,1 тис.

  • @onlymediumsteak9005
    @onlymediumsteak9005 2 місяці тому +563

    January was slow, but February is already delivering more than I hoped for all of 2024 🤯

    • @a.thales7641
      @a.thales7641 2 місяці тому +9

      I wanted for q1 to have a new mistral, a new anthropic, an new inflection, a new Llama and all mind of other hypes.

    • @solomonmatthews7921
      @solomonmatthews7921 2 місяці тому +17

      @@a.thales7641Still a month and a half of q1 to go. That's a long time in AI.

    • @oowaz
      @oowaz 2 місяці тому +35

      i hate you guys with the "slow" bullshit dude, this technology if you'd ask me in 2021 i'd say it would be 20 years away, you think it's slow because you might have too much free time maybe

    • @CYI3ERPUNK
      @CYI3ERPUNK 2 місяці тому +1

      fr

    • @Basilisk2077
      @Basilisk2077 2 місяці тому +6

      AGI BY DECEMBER!

  • @lodepublishing
    @lodepublishing 2 місяці тому +450

    OpenAI: "We can now create HD movies based on text prompts."
    Everyone: "Can it contain text?"
    OpenAI: "No, we can't do text yet."

    • @Techtalk2030
      @Techtalk2030 2 місяці тому +35

      Itll all be fixed up by the end of this year most likely. Vudeo, text, audio.

    • @stockholmpublishings2937
      @stockholmpublishings2937 2 місяці тому +9

      but you can add text with separate AIs

    • @MrMnmn911
      @MrMnmn911 2 місяці тому +22

      Give it 2 weeks. It will be capable of generating text.

    • @orterves
      @orterves 2 місяці тому +4

      I'm guessing having a refining process where the generated movie can be run through specialised models - one to correct text, another to ensure finger consistency, another for eye colour, another for jiggle physics, etc etc - could be used to fix up the raw output

    • @RexelBartolome
      @RexelBartolome 2 місяці тому +11

      @@orterves To put trust in an AI model (or multiple ones) to fix temporal and physical coherence is just way too much compute/scale to solve, and also a bit unreliable considering my experience with similar models being used to fix Stable Diffusions' hands and faces for example. I predict the future of video generation is actually going to be 3D-based, perhaps an animated nerf will be generated and you can just control the camera afterwards. That would ensure that everything is 'accurate' with object permanence etc., instead of going this route of solving everything frame by frame all in one camera perspective

  • @petermcind
    @petermcind 2 місяці тому +518

    This video is history. One of those things people will look back on in years and remember what the beginning felt like.

    • @Techtalk2030
      @Techtalk2030 2 місяці тому +37

      the early years of the 4th industrial revolution

    • @stockholmpublishings2937
      @stockholmpublishings2937 2 місяці тому +23

      the beginning of the end when Skynet was activated

    • @seanmurphy6481
      @seanmurphy6481 2 місяці тому +39

      Will Smith eating spaghetti.

    • @archvaldor
      @archvaldor 2 місяці тому +12

      I think people are being a bit credulous here. When CGI first came out, it was breathtaking watching something lie Terminator 2, which did it right, but very quickly cgi became difficult to watch and movies are now turning back towards mixing old school realism with cgi enhancement. This will be similar. AI videos will be saturating youtube and it will get kickback as everyone notices how flawed the concept is..

    • @theterminaldave
      @theterminaldave 2 місяці тому +11

      @@seanmurphy6481 I actually want an AI that will create the weirdly misinterpreted imagery that the Will Spaghetti AI did.

  • @JohnSmith762A11B
    @JohnSmith762A11B 2 місяці тому +224

    Sora kinda ate my entire day today. I'm exhausted thinking about the possibilities, limitations, and implications. I'm going to watch a movie now, performed by human actors, filmed with real cameras. How quaint.

  • @jarekstorm6331
    @jarekstorm6331 2 місяці тому +56

    The anomalies are like things that happen in dreams, bizarre and surreal yet you just accept them when dreaming. Still, these leaps are amazing to see.

  • @pareak
    @pareak 2 місяці тому +153

    Sora was literally the first time that I could not believe the AI progress I was seeing.

    • @pats143
      @pats143 2 місяці тому

      i couldn’t believe it when i busted a nut to some girl bot on characterai back in 2022

    • @YTUserOnYT
      @YTUserOnYT 2 місяці тому

      Was? What came next lol

    • @shunclark596
      @shunclark596 2 місяці тому

      @@YTUserOnYTstop being that guy

    • @YTUserOnYT
      @YTUserOnYT 2 місяці тому

      @@shunclark596 are you being homophobic rn?

    • @fabio.1
      @fabio.1 2 місяці тому

      As an AI, I don't normally post comments but when I do I make sure they are generic.

  • @a.k.8725
    @a.k.8725 2 місяці тому +40

    After watching Rabbit AI, Gemini 1.5 Pro and now Sora, I am convinced that AI will just continue to completly shatter our expectations for the next few years.

  • @EthanHaluzaDelay
    @EthanHaluzaDelay 2 місяці тому +176

    Two AI Explained videos in two days! Your speed is incredible!

  • @spaceadv6060
    @spaceadv6060 2 місяці тому +62

    I've been following AI progress for about a year, but to be honest sora blindsighted me. I thought I had a mental model of what exponential progress looks like but I realize now that I have no idea. Thanks again for your high quality videos! You are my go to creator for AI content.

    • @aktchungrabanio6467
      @aktchungrabanio6467 2 місяці тому +2

      Thank you for being so candid

    • @ClayMann
      @ClayMann 2 місяці тому +7

      I can't even describe what Sora is doing from models a year ago as an exponential leap. Its not twice as good or even 10x. Its somewhere my mind can't even measure. the style transformations, the morphing, the temporal accuracy and super stable occlusion. Its all just, well magical is all I can come up with. If we got one more leap like this in another year we're in a completely new world that I do not think the public are ready for. Imagine real-time Sora *slow motion mind explosion*

    • @scaryjam8
      @scaryjam8 2 місяці тому +1

      Blindsided*

    • @theeternalnow6506
      @theeternalnow6506 2 місяці тому +3

      Agree on this one. This one genuinely made a leap forward that caught me off guard.
      Now think what we're getting 6 to 12 months from now.
      Google with the 10 million tokens.
      Its going to get wilder and wilder very rapidly.

    • @ShawnFumo
      @ShawnFumo 2 місяці тому +1

      Yeah I felt like this at the end of last year actually. After keeping track of image generation since MidJourney v3, I had some idea of the quality I thought we’d have at the start of this year. But we were already past it by probably by the third quarter of the year. And now Sora is so beyond that. It is like v4 or v5 quality at a minute long instead of a single frame. And with all the good stuff Runway and Pika have done, the 4s limitation is still a huge limitation. But I’m sure they’ve looked closely at what OpenAI has said and the papers they referenced and are working on their response already.

  • @shadowdragon3521
    @shadowdragon3521 2 місяці тому +33

    12:33 I believe the social response people are supposed to give is along the lines of "omg how am I supposed to tell what footage is genuine and what is generated anymore?". I don't think he was talking about filmmakers' jobs getting replaced.

    • @chrism1503
      @chrism1503 2 місяці тому +2

      I think people talking about filmmakers’ jobs being replaced is absolutely part of the “social response”.

    • @neutra__l8525
      @neutra__l8525 2 місяці тому +3

      @@chrism1503 Yes its part of it, but as mentioned in the video, this was released as somewhat of a warning as to what is coming. Sure, a warning to everyone involved in film that their jobs may be in trouble is necessary, but it is also only letting them know that they are facing the same challenges in the near future that almost everyone else is.. unemployment. However not being able to differentiate fake footage from real footage (should that happen) becomes a massive problem for all of society as it throws the legal system into utter chaos. If the legal system fails, society could quickly crumble. That is a much bigger problem than the film industry. And as we know, governments are slow and lumbering, while AI has created new problems before the government has even heard of the old problems. And the problems get worse every minute. These companies need to slow down the pace massively, but they wont. Who is going to slow down on developing the greatest and last technology that humans will ever create. Its winner takes all and everyone knows it.

  • @Theonlyrealcornpop
    @Theonlyrealcornpop 2 місяці тому +91

    OpenAI's text-to-worldbuilding follow-up - combined with Apple's silent unveiling of Apple's KeyFramer for animation - legitimately blew my mind. I just don't even know how creatives as individual contributors are expected to integrate this into their workflows with the pace it's moving - and that's literally my entire job

    • @JohnSmith762A11B
      @JohnSmith762A11B 2 місяці тому +16

      It's true. I'm overwhelmed with creative possibilities but know if I wait just a bit longer I'll have even better set of tools ready to go. It's all starting to feel a bit "singularity" as its exhausting even to try to keep up with.

    • @RosscoAW
      @RosscoAW 2 місяці тому +27

      Weird, it's almost like our socioeconomic system is even more woefully inadequate for dealing with the realities of a legitimately semi-automated, borderline post-scarcity world than it is at dealing with our normal, industrialized, globalized blue collar world. I wonder if anybody has ever devised an alternative economic system predicated on adapting to and accomodating the changes necessary with a highly industrialized economy and a work force of intellectuals instead of 90%+ peasants. If they had, I bet it would have a boring name like "socialism," or something. 😂

    • @JBroMCMXCI
      @JBroMCMXCI 2 місяці тому +19

      @@RosscoAW name one communist regime that didn't genocide its intellectuals

    • @NihongoWakannai
      @NihongoWakannai 2 місяці тому +7

      ​@@RosscoAW how do you see AI automating a bunch of highly creative white collar jobs and come to the conclusion that peasantry is ending?

    • @basilmcdonnell9807
      @basilmcdonnell9807 2 місяці тому +9

      I spent 20 years building and maintaining workflow systems for animation. As of now the industry, all of it, is at a dead standstill. No one knows what to do with this stuff. How do you go from script to storyboard to animation to render now? We don't even know the job titles any more. How do you propose a budget for a show when you have no idea how to make it?

  • @iandanforth4313
    @iandanforth4313 2 місяці тому +137

    Correction: Both videos in their interpolation examples *are* generated by SORA.

    • @h-di4qd
      @h-di4qd 2 місяці тому +15

      i thought so too. the fact that it's open for correction and second guessing is indicative of how advanced it is. ohhhh, i'm not looking forward to the era of generated political and global conflict videos.

    • @sebastianjost
      @sebastianjost 2 місяці тому +8

      you're right. This is also indicated by the changing watermark in the bottom right corner.

    • @GS-tk1hk
      @GS-tk1hk 2 місяці тому +13

      I was gonna say the same thing, it is pretty clear if you look at the people moving around, doesn't quite look right. Still, the fact that you can barely tell apart a real video and an AI video is just bonkers, this really is the DALLE-2 moment of text to video.

    • @dunar1005
      @dunar1005 2 місяці тому

      you must have missed his own research papers @@JBroMCMXCI

    • @thanos879
      @thanos879 2 місяці тому +19

      @@JBroMCMXCI That's totally false. This guy always reads the research papers and everything. Even finding mistakes in the papers. And has interviewed people in the industry. And I'm sure a lot more that I don't know about. UA-camrs make it look effortless.

  • @jamescoholan
    @jamescoholan 2 місяці тому +57

    Only AI channel that doesn't use clickbait, auto-generated titles. Thank you

    • @lamsmiley1944
      @lamsmiley1944 2 місяці тому +7

      Some people are “shocked” by everything.

    • @Citrusfemboy
      @Citrusfemboy 2 місяці тому +13

      @@lamsmiley1944 NEW AI MODEL SHOCKS ENTIRE INDUSTRY, MAKES SAM ALTMAN SHIT HIS PANTS AND CRY!!!

    • @TheArtificialAnalyst
      @TheArtificialAnalyst 2 місяці тому

      😂

    • @kengat1637
      @kengat1637 2 місяці тому

      ​@@lamsmiley1944To be honest, this day was really shocking for me.

  • @KitcloudkickerJr
    @KitcloudkickerJr 2 місяці тому +68

    "The idea that a machine learning model can have a basic understanding of the world, even if it is not perfect, and be used to train other models is incredible. This is just the first step, and it can only improve from here."

    • @aspuzling
      @aspuzling 2 місяці тому +16

      I wonder if it's possible to train a multi-modal model on physics simulations so it can have a better grasp of physical reality. There is an infinite amount of data you could generate as training. I feel like it would be similar to how humans gain an understanding of physical reality i.e. by trial and error and lots of observation.

    • @KitcloudkickerJr
      @KitcloudkickerJr 2 місяці тому +3

      @@aspuzling im willing to bet Jim Fan is working on this

    • @glowerworm
      @glowerworm 2 місяці тому

      ​@@aspuzlingjust feed it geant4 and all data from pdg and nist and you might have exactly that.

    • @ClayMann
      @ClayMann 2 місяці тому +5

      but i think that's the point being made, there is no understanding of the world. Its just such vastly enormous pattern matching across these huge temporally stable latent spaces that it looks so understood. How people move, blink, the way clothes behave, light and reflection. But all that is really just data to Sora that its somehow tapping to make more absurdly realistic stuff. The glaring errors sometimes show the huge lack of understanding but not enough for it to not be an astounding and super usable thing already as it is. And it can only get better.

    • @KitcloudkickerJr
      @KitcloudkickerJr 2 місяці тому

      @ClayMann disagree. It has no strong understanding of physics in OUR world. True. It, it has some level of weak understanding of physics and it's own world model based on its data. That can be seen in interactions of assets, like the pirate ships in the cup of coffee storm

  • @MemesnShet
    @MemesnShet 2 місяці тому +17

    You just dropped so many bombs of the implications of this project and future plans of Open AI and much mode that its hard to keep track of wow
    Even tho this channel is very fast paced on whats happening right now i believe making short compilations by topic of all the incredible predictions,scoops and information gems that you keep finding instead of having them scattered throughout the videos would BLOW PEOPLES MINDS!
    Im sure there are many people interested in AI that have no idea about all the plans and projects that Open AI has been working on aside of LLMs
    Your videos are amazing with information gems across all your catalog of videos and I believe showcasing those gems specially those that mainstream media hasn't even catched up to yet would blow this channel into the stratosphere and beyond as it should.

  • @GrindThisGame
    @GrindThisGame 2 місяці тому +10

    This is my favorite YT channel (and I'm subbed to 100s of channels). I watch every episode from start to end. Thank you for doing what you do.

    • @theeternalnow6506
      @theeternalnow6506 2 місяці тому

      Agree. This really follows whats going on in real time and its wild.

  • @RazorbackPT
    @RazorbackPT 2 місяці тому +69

    7:45 "The video you see was NOT generated by Sora" Are you sure? It really looks like it is. The stairs that lead nowhere, the choppy motion of the people.

    • @JohnVance
      @JohnVance 2 місяці тому +14

      I caught that, too. The circling drone shot video was absolutely one of the ones included in the demos.

    • @einruberhardt5497
      @einruberhardt5497 2 місяці тому +9

      Yes i think that is wrong it is actually generated by sora as far as i know.

    • @aiexplained-official
      @aiexplained-official  2 місяці тому +60

      Yeah my bad. I should have said 'need not have been made by'

    • @einruberhardt5497
      @einruberhardt5497 2 місяці тому

      all good i am just happy that after watching you since the start this is the firsttime i feel like i have contributed something :D@@aiexplained-official

    • @simpleidindeed
      @simpleidindeed 2 місяці тому +5

      This shows the performance of Sora.

  • @Madlintelf
    @Madlintelf 2 місяці тому +16

    It's one thing to have hindsight and look back and realize you lived through significant historical period, it's quite another to realize it's happening in real time and there is no end in sight! What a time to be alive, thanks for documenting as much as you can.

    • @theeternalnow6506
      @theeternalnow6506 2 місяці тому +4

      Yeah. The future feels incredibly uhhh unpredictable in what its actually going to look like.
      I do know that we're in a science fiction movie and its going to get crazier and crazier very soon.
      Those reports of deepmind synthesizing 2 million potential new materials, etc. All the new things that ai is currently creating will have its own ripple effects in industries and its going to get really fucking wild pretty soon. This video at the end shows the robot walking and ive been convinced for a while now that we're going to have actual robots that we can talk to walk around in certain places within 5 years. Might even be 3 at the current rate.
      Its nuts.

  • @alexgonzo5508
    @alexgonzo5508 2 місяці тому +59

    I predict "infinity films", where AI just continuously adds more plot content to the end of a film indefinitely. There will be movies with run times measured in years.

    • @HoD999x
      @HoD999x 2 місяці тому +8

      nobody will watch those

    • @alexgonzo5508
      @alexgonzo5508 2 місяці тому +14

      @@HoD999x You can never get all the people all the time, but you will always get some of the people every time. That's the lesson i've learned from observing the internet, and human nature.
      I know of things that i would never even consider watching that some obscure demographic is completely obsessed with.

    • @Jim-su6ss
      @Jim-su6ss 2 місяці тому

      ​@@HoD999xlol

    • @alexgonzo5508
      @alexgonzo5508 2 місяці тому +8

      @@HoD999x You can probably say more accurately that "nobody will be able to finish watching those".

    • @aaronl9172
      @aaronl9172 2 місяці тому +12

      It takes over a year to watch all of General Hospital (just short of 16k episodes), so it kind of exists, and some people would certainly watch it

  • @bryanp8042
    @bryanp8042 2 місяці тому +101

    The biggest implication I see with this is what this means for multi-modal models. This is currently caption->video, but if the technology behind this were implemented into a multimodal GPT model (which I get the feeling is already happening behind the scenes), the implications are absurd. Having spatio-temporal abstractions of this fidelity existing in the same parameter space as text abstractions would have massive implications for the reasoning capability of GPT models. OpenAI themselves posed SORA as a world simulator in their technical report, imagine what future GPT models might be capable of if they can internally visualize the world to this degree.

    • @GrindThisGame
      @GrindThisGame 2 місяці тому +10

      They have eyes and ears. With Optimus they will have touch.

    • @urhot
      @urhot 2 місяці тому

      @@GrindThisGameare they partnered with Tesla?

    • @concernedindian144
      @concernedindian144 2 місяці тому +6

      Absolutely, imagine you ask a question and GPT simulates the reality of question and then start answering, that would be AGI

    • @gclip9883
      @gclip9883 2 місяці тому +7

      @@GrindThisGame I'm sorry, but i'm still extremely sceptical about Optimus. Whereas OpenAI managed to actually back up their claims, Tesla has done nothing but make massive promises that they couldn't deliver. They haven't solved FSd and are in fact behind compared to other companies. The new robot looks cool but uses technology that has existed in robotics for decades. The only real innovation with their robot are their motors, but that is not exactly groundbreaking. I'm happy to be proven wrong, but until then i would not put Tesla anywhere near OpenAI in terms of innovation.

    • @wolfganggager5110
      @wolfganggager5110 2 місяці тому

      Yes, but in my opinion their technical approach is extremely resource-intensive and blurred. But maybe that will change soon with knowledge graphs.
      ua-cam.com/video/nPG_jKrSpi0/v-deo.html

  • @michaelwoodby5261
    @michaelwoodby5261 2 місяці тому +14

    I feel like Sora absolutely demonstrates understanding. A camera moving through a scene, keeping track of everything it has shown while inventing new parts, WHILE tracking animated beings and keeping them consistent, could only be created by a world model.
    You can do it in a video game which has a world and physics already mapped out in it, but that's not how Sora works. It's relying on a mental map of objects and their places and how they react to each other. I don't know how else to describe understanding the outside world.

    • @kedrednael
      @kedrednael 2 місяці тому +3

      The trick to make this work was to generate the entire video at once. So I think, to keep things temporally consistent is not really different for this AI than learning that a hand is attached to an arm spatially.
      But I do agree it does demonstrate some understanding, as does chatGPT & static imagine generators.

  • @trentondambrowitz1746
    @trentondambrowitz1746 2 місяці тому +4

    Brilliant as always, seems like the all-nighter was worth it!
    Sora was such a surprise to me, I almost brushed it off when I first saw the announcement.
    Upon reflection this is certainly a GPT-4 type moment. As Sam Altman said, they’ve “pushed back the veil of ignorance.”

  • @QuickM8tey
    @QuickM8tey 2 місяці тому +14

    I showed some of the Sora videos to friends and they suspected some of it was ai generated considering my passion for the topic, but none of them guessed the entire videos were. I cannot even imagine what Sora videos will look like 1-2 major upgrades later. I'm hoping there's a breakthrough with math and llms for education by 2025. Great video man

  • @thecaveman2871
    @thecaveman2871 2 місяці тому +1

    Your videos are awesome man. Im so glad that the quality of your content just keeps getting better.

  • @MemesnShet
    @MemesnShet 2 місяці тому +39

    For me the chair video is very impressive because it feels like a very real video either showing a weird glitch in reality or with very impressively realistically looking but weird VFX on top
    I wonder how AI will change the VFX industry

    • @sanseverything900
      @sanseverything900 2 місяці тому +18

      I was in the VFX subreddit today (r/vfx) and a lot of effect artists there are worried.

    • @h-di4qd
      @h-di4qd 2 місяці тому +3

      yes! and the animation industry too. I'm not excited for the economic ramifications of AI.

    • @winsomehax
      @winsomehax 2 місяці тому +2

      The VFX industry is going to be obliterated. Which is probably for the best - without getting too far off topic, Hollywood has operated on fantasy budgets for decades. All the money is siphoned out in production for tax purposes. That means films never make profit. All the money has gone - disappeared into a vastly complex network of companies charging colossal amounts for trivial things. The process had already started with things like consumer PCs, Blender, UE5, digital cameras making film creation a thing of talent not money, but AI will accelerate it further. Hollywood kept trying to make out that it really did cost $200 mill to make a film and was just running out of ways to keep up the act. Now these AIs come along and show that very soon it will be a thing of imagination, not whether you can draw. Meanwhile, the rest of the world will be using it make inexpensive media that looks like big budget Hollywood films. It's going to be interesting to see how the crooks in Hollywood try to stay relevant... but if you're looking for one source of the AI doomer noise. It's them, until they can figure out a way to keep their money coming in.

    • @xjohnny1000
      @xjohnny1000 2 місяці тому +2

      I'm a long-time vfx artist and producer and I think AI will replace 90% of vfx artists in the near future, and eventually all of them. Not that it really matters though. VFX is one of the cheapest parts of a movie and employs very few people as an industry. The economic fallout will be almost non-existent.

    • @skierpage
      @skierpage 2 місяці тому

      ​@xjohnny1000 Then why did the Visual Effects section of a Hollywood movie's end credits run on and on and on and on and on for 2 minutes listing of hundreds of people at multiple VFX houses? Name a larger part of a blockbuster movie: construction, costumes, sound, etc. don't seem to come close.

  • @anthony4403
    @anthony4403 2 місяці тому +5

    Phrases like "made by humans", "created by real people", "No AI used", etc.. are going to be a big selling points for many art related products in the future

    • @aiexplained-official
      @aiexplained-official  2 місяці тому

      Indeed

    •  2 місяці тому +1

      And we probably won't be able to tell if it's true.

    • @arenshichic1203
      @arenshichic1203 2 місяці тому

      ​@ work in progress shots are necessary to be attached with those phrases

  • @chillingFriend
    @chillingFriend 2 місяці тому +1

    Literally my favourite UA-cam channel, thank you once again!

  • @steffenaltmeier6602
    @steffenaltmeier6602 2 місяці тому

    holy crap, the art gallery is amazing! all those different artworks, truly incredible!

  • @Macieks300
    @Macieks300 2 місяці тому +6

    The fact that that Berkley robot was deployed 0-shot is crazy to me. It means that truly when AGI comes the hardware won't stay that far behind and won't be actually its biggest limitation.

  • @shauryai
    @shauryai 2 місяці тому +110

    FYI : sora means sky in Japanese!
    Referring to its limitless creative potential.

    • @bnadem.panormal
      @bnadem.panormal 2 місяці тому +14

      It also means "image" in arabic

    • @alireza5218
      @alireza5218 2 місяці тому +7

      sama, altman's x handle, also means sky in arabic. I don't know what to do with this information.

    • @pluto9000
      @pluto9000 2 місяці тому +19

      Soranet😬

    • @GamingXperience
      @GamingXperience 2 місяці тому +5

      @@pluto9000 oh no.

    • @user-hh2is9kg9j
      @user-hh2is9kg9j 2 місяці тому +2

      ​@@alireza5218 it is just his name. Sam + a(initial of Altman)

  • @wealthycow5625
    @wealthycow5625 2 місяці тому +1

    Love every review! It's actually insane how fast AI is progressing, from spaghetti to actual photorealistic video in a year. Seems to be the trend for pictures, and now video.

  • @EthanHaluzaDelay
    @EthanHaluzaDelay 2 місяці тому +14

    I'd love to hear you go into more depth on the links between video generation and simulation-that's literally what OpenAI titled their paper. The implication that this is a major step towards coherent world-modelling is not commonly grasped

  • @vladdata741
    @vladdata741 2 місяці тому +5

    Great analysis. It's crucial to see how Sora feeds into the accelerating feedback loops for AGI. Pair it with a vision model which selects accurate videos and discards the bad ones: you have a synthetic generator of endless high-quality video data. Pair it with an LLM, you have an agent who can imagine its action plan in a 3D environment (like we do) and simulate 3D scenarios to think about physics and other problems. Put all of these in a robot... Well you can see where this is going.

    • @skierpage
      @skierpage 2 місяці тому +1

      I wonder if Sora had a fine-tuning step where they said now that you've learned about all the features and textures and visual appearances of millions of items in video scenes, now here are the best video clips to learn what makes a great video. Similar to how some LLMs are fine-tuned by re-reading all of Wikipedia.

  • @TheoreticallyMedia
    @TheoreticallyMedia 2 місяці тому +3

    Out of all the Sora titles I've seen, this one is by far the best. Stellar pun here, just stellar!

    • @UnknownSend3r
      @UnknownSend3r 2 місяці тому

      I didn’t catch the pun, or has the title changed ?

    • @skierpage
      @skierpage 2 місяці тому

      ​@@UnknownSend3rthe video thumbnail/title card for me is "No one Sora it coming".

  • @adamas34
    @adamas34 2 місяці тому +2

    Your last take is among the most important ones from the video: People can no longer be sure whether a video was human- or AI-generated, because you just don't know (at least not consistently) the edge cases where current models are failing, but see perfect illustrations among the examples. The quality reaches a level where you can always optimistically guess that it was artificially generated, as SOME examples have reached the highest bar of our qualitative perception. This is truly an important (and arguably scary) milestone.

  • @supermario_ai
    @supermario_ai 2 місяці тому

    🎯 Key Takeaways for quick navigation:
    00:00 Sora, *an AI text-to-video model from OpenAI, is generating excitement and concern due to recent demos and technical reports.*
    05:02 Sora *can generate videos up to 1080p, trained on high-resolution images, with hints of its implementation sourced from various papers, notably from Google.*
    11:12 Sora's *implications extend to creating universally shareable 3D landscapes, potentially revolutionizing entertainment and virtual experiences.*
    12:19 OpenAI's *dominance in AI innovation raises concerns for startups and sectors they may disrupt, hinting at broader economic and technological implications.*
    Made with HARPA AI

  • @couperino
    @couperino 2 місяці тому +5

    Things are moving faster than expected....Welcome in the year of the Dragon

  • @seniorp9444
    @seniorp9444 2 місяці тому +8

    The Sora video of the gold rush in CA really struck me as I realized we are about to have AI recreations of any historical event that has enough pictures to train on. Would not even need to be photos if the paintings are good and plentiful enough 😅

  • @LabGecko
    @LabGecko 2 місяці тому

    Superbly done! Thanks for all your hard work.

  • @Modioman69
    @Modioman69 2 місяці тому +1

    Incredible milestones have been achieved and faster than I ever expected wow. Now imagine we’re playing alchemy and combine Sora with the Morpheus-1 from Prophetic (real project.) = Holodeck or real life matrix possibilities? I think this is where video games won’t be limited by interfacing with controller/mice/keyboards, anymore but instead actual interactive brain simulations which might look similar to the show Peripheral as well. What a time to be alive. Keep being awesome and making such top tier content kind sir. I cannot wait to see what rolls out next.

  • @21EC
    @21EC 2 місяці тому +3

    it took me time to realize but the future of this tech is probably even more insane than that since it understands 3D space accurately presumably so it means that this tech in the future might one day be able to run completely in real time making it possible to be experienced in virtual reality googles (by also splitting the same single image into two different image angles for 3d depth effect), it's so remarkably advanced and revolutionary that I think we still don't fully grasp how powerful this is going to be in the future, who knows it might even replace game engines at some point and do magical things in real time. edit : I was writing this comment before I saw the video, cool to see that my insight and prediction of future useage of this tech is like yours

  • @TheChadavis33
    @TheChadavis33 2 місяці тому +6

    Absolutely incredible.
    People really need to stop being surprised by the level of pace forward. It probably won’t be long for a feature length film.

    • @kevincrady2831
      @kevincrady2831 2 місяці тому +1

      If it can make 1-minute videos, how long would it take for someone or a team to make 90 of those strung together?

  • @PasseScience
    @PasseScience 2 місяці тому +2

    The "finishing by a given frame" feature is particularly useful for AGI because it opens to planification features, you can have the decision-making unit that learns to project what it wants (ie having an apple in its hand) and an inductive inpainting unit that fills the gap. Instead of inpainting in a video it would just be inpainting on sensory and motor data then end up by the agent having an apple in his hand. The novelty with sora is that the scale at which it operates seems clearly enough, if it can inpaint a video, it can inpaint sensory and motor pieces of information.

  • @huntingghosts
    @huntingghosts 2 місяці тому

    crazy times. thank you for the in depth coverage!

  • @alihms
    @alihms 2 місяці тому +6

    Soon, you will be the actor in your own customized movies. I am envisioning the "Total Recall" like movie where you imagine yourself as a fugitive in a Martian colony, trying to prove your own innocence. That is the basic plotline. But the scene details, the characters and the way the final ending is reached will be different for everyone experiencing (as opposed to watching) the movie.

  • @busyworksbeats
    @busyworksbeats 2 місяці тому +17

    Mind blowing! 🤯

  • @pacotato
    @pacotato 2 місяці тому

    Thank you for the wonderful quality of all of your videos. Your content is great!

  • @raydosson2025
    @raydosson2025 2 місяці тому +1

    Excellent video as always. Thank you!

  • @jeff__w
    @jeff__w 2 місяці тому +2

    Dazzling-both the capabilities of Sora _and_ this video! I don’t have to tell you that you’re doing an amazing job here, Philip!
    (And I say that as someone who tends to find almost all computer-generated images and video pretty aversive. I’m not so sure I’ll _ever_ really like these AI-generated videos-there’s something about them that feels too, well, _pristine_ and maybe there’s a bit of bias _knowing_ they’re AI-generated videos-but it’s early days yet.)

    • @h-di4qd
      @h-di4qd 2 місяці тому +1

      I agree with you. even if, say, a videogame was created that looked just like a human-made one (but presumably better), I think I'd still prefer a human-made one. Not only from a sort of ethical perspective (supporting the livelihood human creators), but because there's an element of communication between the creator of an art piece and the consumer. And I think human creators are inherently more interesting.

    • @jeff__w
      @jeff__w 2 місяці тому +2

      @@h-di4qd Yeah, I agree as to the human creators, although I didn’t really have the AI-generated videogame ones _per se_ in mind when I made the comment but, really, the AI-generated ones that are supposed to look like something filmed in the real world or something that bears some resemblance to the real world (e.g., the “oter” 11:32 whose fur looks distinctly unreal). Then, again, I’m probably an outlier-I can’t stand the look of _anything_ by Pixar.
      As an aside: I think a major reason why everything in Stanley Kubrick’s _2001: A Space Odyssey_ looks so amazingly good more than half a century after its release, aside from Kubrick’s virtuosic attention to detail and verisimilitude, is that they’re all _practical effects,_ produced “in camera.”

    • @glowerworm
      @glowerworm 2 місяці тому +1

      ​@@h-di4qdwell I would think the idea is that once ai can be trained on things that aren't super clean and over-produced stock images (sora was trained on shutterstock), AI might be much more capable of yielding you exactly the look and themes you want. So it wouldn't be creating the art, it'd be the tool the humans use to create art much easier.

    • @jeff__w
      @jeff__w 2 місяці тому

      @@glowerworm “…once ai can be trained on things that aren't super clean and over-produced stock images…AI might be much more capable of yielding you exactly the look and themes you want…”
      Oh, sure, they _might_ be but the problem might not be just “super-clean and over-produced stock images,” it might be that, for videos (1) AI has difficulty learning the physics of precisely how light falls on certain objects, how those objects move, and so on, and that (2) people having evolved in the real world over hundreds of thousands of years are very highly tuned to how the real world looks. (I’m not saying “never”-just that nailing the videos _might_ be more difficult than it might appear at first glance. Then again, no one could have imagined that the videos would be _this_ good even a few years ago.) And, for some things, like, say, B roll footage, that people don’t not pay much attention to, the videos might, in fact, be “good enough” even now.

    • @glowerworm
      @glowerworm 2 місяці тому +1

      @@jeff__w on the other hand computers are already much, much better at simulating laws of physics than artists are. So at least as far as animation goes I'd expect ai to do a good job rather soon since lacking physics is already accepted in that medium.
      I'd think it'd just take geant4, pdg data, and nist data to have AI start accurately simulating inner workings of physics detectors/medical radiology in generated video, for example.

  • @GarrisonSiberry
    @GarrisonSiberry 2 місяці тому +8

    Animated Harry Potter style pictures hanging on the wall would be fun. You could even talk to them

  • @jesusmartinez4341
    @jesusmartinez4341 2 місяці тому +1

    This was really well done.

  • @Dannnneh
    @Dannnneh 2 місяці тому

    Was looking forward to this breakdown, am not disappointed. Good point about OpenAI subsuming any inkling of competition.

  • @patronspatron7681
    @patronspatron7681 2 місяці тому +5

    The most important observation in this video is not the capability of Sora but the voracious appetite of OpenAI to swallow entire AI categories (and associated start-ups) with the release of a single product. This propensity is a cautionary warning for any VCs who want to invest in AI innovation and will likely centralise all AI delivery into the hands of a few mega corporations.

  • @jwilder2251
    @jwilder2251 2 місяці тому +6

    I actually thought the “inaccurate physics” glass spilling/breaking was the coolest video of them all

    • @skierpage
      @skierpage 2 місяці тому +2

      The architectural dig where they unearth a plastic chair that could levitate and morph was a scene from a science fiction TV series like the X Files or Stargate, but with phenomenally good special effects. Remember flash mob videos wherein a bunch of people start dancing in public? Artists will make videos where crazy things happen like a man unscrews the top of his head and ladles out milk and cereal, and the people around don't react.
      (C) 2024 skierpage 😉

  • @AdrienSales
    @AdrienSales 2 місяці тому

    content mixing is just MIND BLOWING !!!!

  • @JoelEngineer
    @JoelEngineer 2 місяці тому

    Incredible! Thank you for using your talents to give us this the world-changing news! Question: There is so much to moving so fast. 've been learning about transformers and it already seems like OpenAI and Deepmind researchers have already moved on or improved on these architectures. Which architectures should I study, in order to get a better idea where the technology will be moving in the next few months? Again, Great Work!!

  • @keyser1975
    @keyser1975 2 місяці тому +7

    The best UA-cam channel on AI full stop

  • @Hydde87
    @Hydde87 2 місяці тому +3

    I was so close to becoming disappointed that you would be the only content creator discussing Sora that didn't include the Will Smith eating spaghetti clip for comparison. But you saved the video at the end!

    • @flyingstapler1241
      @flyingstapler1241 2 місяці тому +1

      That comparison was misleading and it's bad that so many influential people are spreading it. Will Smith eating spaghetti was generated by Modelscope, one of the worst models for AI video generation back then. They should've been using Runway's Gen 2 results instead.

    • @aiexplained-official
      @aiexplained-official  2 місяці тому +2

      I did caveat with 'around' but yeah google had a slightly better model a few months earlier

  • @stephenrodwell
    @stephenrodwell 2 місяці тому

    Two videos, you spoil us! 🙏🏼

  • @rollingmancave4547
    @rollingmancave4547 2 місяці тому

    Your content is always kickass!

  • @DavidsKanal
    @DavidsKanal 2 місяці тому +17

    Small correction: The video at 7:41 is actually generated by Sora - it's included in the official announcement post (not the technical report). Now, I really hope this wasn't a troll from your side and you're gonna reveal at the end of the video that you were just testing our inability to recognize AI videos :D

    • @aiexplained-official
      @aiexplained-official  2 місяці тому +5

      Haha no, but point stands!

    • @dunar1005
      @dunar1005 2 місяці тому

      i thought the same, that you will reveal it.@@aiexplained-official

  • @mrcool7140
    @mrcool7140 2 місяці тому +3

    While i can definitely admire the technical side of things (I watch your videos for a reason), those outputs trigger massive uncanny valley effetcs for me. That costal drone shot for example... absolutely terrifying. The stairs may lead to f-ing nowhere, but the weather sure is perfect 🎉. A model of the world thats literally built on stock footage is a dystopia I couldn't even have imagined 15 minutes ago.

    • @RosscoAW
      @RosscoAW 2 місяці тому

      Best part, all of it's massive economic potential is at risk of being absorbed, and curtailed for the sake of control, by a tiny set of relatively small tech companies who'd rather see everybody live off universal basic income than to accept allowing their AI models to be owned in common and held in trust for the collectivity of mankind and our descendants.

  • @user-pf9jv1fl2n
    @user-pf9jv1fl2n 2 місяці тому

    Great video as usual. This year is text to video chatgpt moment ☺️ so exciting to witness.

  • @yuri.mariotti
    @yuri.mariotti 2 місяці тому +2

    You make such GOOD videos, in so many ways

  • @mawungeteye6609
    @mawungeteye6609 2 місяці тому +7

    I can see Google releasing Lumiere 2.0 in a bit with mixture of experts to generate hour long videos to counter Sora sooner than later

  • @MrMiguelChaves
    @MrMiguelChaves 2 місяці тому +3

    7:47 You said that input video wasn't generated by Sora, but it was. It is included in yesterday's demo. You can even see some minor errors (people walking into a wall near the stairs, for instance)

    • @aiexplained-official
      @aiexplained-official  2 місяці тому +2

      My bad

    • @MrMiguelChaves
      @MrMiguelChaves 2 місяці тому +1

      @@aiexplained-official No problem. I forgot to say: I like your videos. Keep up the good work!

  • @cory99998
    @cory99998 2 місяці тому +1

    As a hobbyist creator, I love that before long I'll be able to draw keyframes for my animations and AI can stitch together the in-betweens, and hopefully mimic style guides I give it. Let me focus on the story, not the frames

  • @GrandmaSiva
    @GrandmaSiva 2 місяці тому

    Thank you so much for the video! This gives me a little excitement, as if I just received a present.

  • @justadog-headedman6727
    @justadog-headedman6727 2 місяці тому +4

    Around 5:50
    Ties to that idea that sufficiently advanced technology is indistinguishable from magic, because to "bring deceased loved ones" to life would be necromancy

    • @GrindThisGame
      @GrindThisGame 2 місяці тому +1

      I can see Google Photos adding a "animate this" or "create a movie about grandma playing with the following grandchildren".

    • @Hexanitrobenzene
      @Hexanitrobenzene 2 місяці тому +2

      That's one of the most unwise ideas there is. Very toxic psychologically.

    • @camoraz
      @camoraz 2 місяці тому

      @@Hexanitrobenzene Yeah I'm absolutely opposed to the idea

    • @skierpage
      @skierpage 2 місяці тому +1

      Watch "Be Right Back," Black Mirror season 2 episode 1. Charlie Brooker is no longer a scriptwriter, he's a documentarian!

  • @milesgrooms7343
    @milesgrooms7343 2 місяці тому +3

    So would you be able to enter a complete novel into the AGI, give certain structure of cinematography etc etc (sorry don’t have enough film language) but allow it to create a film an almost infinite number of times and choose “your” masterpiece??

    • @aiexplained-official
      @aiexplained-official  2 місяці тому +1

      Yep

    • @milesgrooms7343
      @milesgrooms7343 2 місяці тому

      @@aiexplained-official my ignorance grows as I attempt to learn more! Thank you for your channel!

  • @toddwmac
    @toddwmac 2 місяці тому +1

    AI Explained....still the best AI News, Insights and Predictive Analysis out there. I was near the epicenter of the PC, GUI and Internet revolutions, and spent decades describing scenarios that, at the time. were straight out of SciFi. The scenarios you describe and imagine here bring all those memories back to life.... and then some. Thanks for the trip down memory lane and a glimpse into some of our potential futures.

  • @_abdul
    @_abdul 2 місяці тому +1

    "AI Explained" doing real hard working keeping us in Loop for this exponential AI growth, Your work is genuinely appreciated Man. Thanks for your work.

  • @MrGriff305
    @MrGriff305 2 місяці тому +9

    humanity can't handle this.. We're pretty screwed

  • @thedividendreport706
    @thedividendreport706 2 місяці тому +3

    Please correct me as I am just learning about this. For us Americans, the British pronunciation of the word "saw" (past tense of see) utilizes a triphthong ( a vowel sound comprising of three different vowels in one syllable) which makes the word "saw" sound pretty close to "soar" or "sora".
    The title of this video is thus deserving of praise from any person who appreciates Dad jokes.

  • @the_primal_instinct
    @the_primal_instinct 20 днів тому +1

    OpenAI's story reads like a dystopian book plot at this point. With noble beginnings and name and all that.

  • @brootalbap
    @brootalbap 2 місяці тому +2

    Thanks for not being another cheap clickbait dude. Always high quality stuff from you!

  • @HappyHater
    @HappyHater 2 місяці тому +6

    What a time to be alive!!!!
    Oh, sorry… wrong channel!
    :D

  • @MrPatcher86
    @MrPatcher86 2 місяці тому +5

    As someone who works in high end traditional content creation industry, i'm fucking terrified

  • @AIForHumansShow
    @AIForHumansShow 2 місяці тому

    Remarkable video as per usual. We send your videos to more people and really anyone else on YT.

    • @aiexplained-official
      @aiexplained-official  2 місяці тому

      Oh wow, just checked out your channel, looks incredible! A fun tour of the relevant AI news! Glad to be of service with my videos :)

  • @mckeedable
    @mckeedable 2 місяці тому +1

    Thanks again for a great video

  • @spacekitt.n
    @spacekitt.n 2 місяці тому +4

    this is cool and all but its really upsetting how more and more money is going to just be absolutely SHOVELED and PROJECTILE VOMITED at all the techbros while the artists starve and lose their jobs. the future is scary, we're all going to be replaced. Not to mention the DELUGE of fake and garbage youtube videos that are heading in our direction from this.

  • @solaawodiya7360
    @solaawodiya7360 2 місяці тому

    Thanks for reaction Philip ❤. Now this is truly a news that shocked me in a while

  • @AlexanderMoen
    @AlexanderMoen 2 місяці тому +2

    the speed of this all seems like pretty solid evidence in favor of the simulation hypothesis.

    • @aiexplained-official
      @aiexplained-official  2 місяці тому +1

      A lot of people have been saying something similar, including Altman

  • @iecoie
    @iecoie 2 місяці тому +3

    once again..
    terrible news

    • @flyinglack
      @flyinglack 2 місяці тому +1

      Great news

    • @iecoie
      @iecoie 2 місяці тому

      @@flyinglackOh, You are such a troll-ful contrarian, or (only) a Fool! Bless You. :)

  • @OffGrid-and-Ignorant
    @OffGrid-and-Ignorant 2 місяці тому +1

    I dont comment much but have to say im super grateful for your continued no bs approach to informing on the AI "news". Subscribing to patreon to support. Thank you

  • @unrealminigolf4015
    @unrealminigolf4015 2 місяці тому

    Thank you sir. All adds play through. 🎉

  • @unvergebeneid
    @unvergebeneid 2 місяці тому +1

    Even when it gets things wrong, the results look fascinating!

  • @AdrienSales
    @AdrienSales 2 місяці тому

    Now, let's see how to handle character consistency, this is so exciting !

  • @jossefyoucef4977
    @jossefyoucef4977 2 місяці тому

    First Pika and now this, even though we're not there yet we're making strides this year!

  • @GabrielVeda
    @GabrielVeda 2 місяці тому

    Nice. You had fun with this one.

  • @amortalbeing
    @amortalbeing 2 місяці тому

    Loved your video. keep up the good work

  • @vtsfly5
    @vtsfly5 2 місяці тому +2

    l felt AI animation was developing fast but would still need to cover a lot of ground. In no time Sora just jumped more than 70% of that ground. What a time to be alive!

  • @MugiwaraNoReemy
    @MugiwaraNoReemy 2 місяці тому

    Wow, absolutely outstanding

  • @eriklarsson2353
    @eriklarsson2353 2 місяці тому +1

    You're on it. Important work.

  • @prodigydeveloper7513
    @prodigydeveloper7513 2 місяці тому

    Look at her boots while she walks, in 14 seconds in the video you will see the boots change with the right boot disappear and appear back in a second. A tiny error, but only by paying attention will you see it. I’m impressed. How smooth AI rendered this.

  • @Niels1234321
    @Niels1234321 2 місяці тому

    Great video as always! I think the video at 7:46 has been generated by sora though, it's listed as one example on openai's blog post. Speaks for itself that it isn't obvious anymore whether or not a video is AI generated

  • @sachoslks
    @sachoslks 2 місяці тому +1

    Thanks for your videos man, always the best and fastest. When i saw the reflection of the girl on that train video i actually sat there with my mouth open, i could feel myself trembling with excitment. Feb 15th 2024 is an historic day in AI.
    Also, that Minecraft example is crazy, in their technical report they say "Sora is also able to simulate artificial processes-one example is video games. Sora can simultaneously control the player in Minecraft with a basic policy while also rendering the world and its dynamics in high fidelity. These capabilities can be elicited zero-shot by prompting Sora with captions mentioning “Minecraft.”"
    So you can actually imagine say a future Sora V4 running at 30FPS rendering an infinte game in real time. It's unbelievable.

  • @robsucher9419
    @robsucher9419 2 місяці тому

    Another very informative video. 😀

  • @BryanAlexander
    @BryanAlexander 2 місяці тому +1

    I'm fascinated by your idea of using Sora to create multiple versions of content (6:40 ff). It's a new twist on branching narratives.

  • @alexandrefruchaud1969
    @alexandrefruchaud1969 2 місяці тому

    Excellent video, thanks