New open-source AI video generator is INSANE

Поділитися
Вставка
  • Опубліковано 11 жов 2024
  • Pyramid Flow is now the BEST open-source AI video generator. Approaching Sora & Kling quality
    #ainews #ai #aivideo #soraai
    Thanks to uPix for sponsoring this video: Generate AI selfies in just 1 click.
    upix.app/
    Pyramid Flow: Pyramidal Flow Matching for Efficient Video Generative Modeling
    pyramid-flow.g...
    github.com/jy0...
    Newsletter: aisearch.subst...
    Find AI tools & jobs: ai-search.io/
    Support: ko-fi.com/aise...
    Here's my equipment, in case you're wondering:
    Dell Precision 5690: www.dell.com/e...
    GPU: Nvidia RTX 5000 Ada nvda.ws/3zfqGqS
    Mouse/Keyboard: ALOGIC Echelon bit.ly/alogic-...
    Mic: Shure SM7B amzn.to/3DErjt1
    Audio interface: Scarlett Solo amzn.to/3qELMeu

КОМЕНТАРІ • 216

  • @theAIsearch
    @theAIsearch  2 дні тому +11

    Thanks to uPix for sponsoring this video: Generate AI selfies in just 1 click.
    upix.app/

  • @andersonluciano3
    @andersonluciano3 2 дні тому +54

    No Will Smith eating Spaghetti? Useless

  • @TheNjordy
    @TheNjordy День тому +44

    I'm sorry, this is a good result for opensource, but when you compare video with Sora saying "not that much difference...". No. They are lightyears apart.

    • @hdfsgervda
      @hdfsgervda День тому +3

      It's true, for the same prompt Pyramid Flow has worse results than what OpenAI Sora showed.
      But is it an apples-to-apples comparison on training compute? Or apples-to-apples on inference compute? What about the training data sets and algorithms?
      From the research paper it says Pyramid Flow is is trained in China on 20,700 hours on an Nvidia A100 GPU hours".
      By the way, China is supposed to be sanctioned for Nvidia chips including A100 so it's interesting they advertise this.
      But compare that to Sora. I saw an estimate that "Sora used between 4,200 and 10,500 NVIDIA H100 AI GPUs for one month, with a single H100 AI GPU capable of generating a one-minute video in about 12 minutes, or around 5 x one-minute videos per hour".
      So the H100 is way more powerful than A100. And there's 730 hours in 1 month.
      So by that math (with likely several incorrect assumptions), it appears Pyramid Flow has been trained on a very tiny fraction of OpenAI Sora.

    • @Steger13
      @Steger13 День тому +1

      Sora doesn't exist! It never came out. The best at this moment is Kling.

    • @TheNjordy
      @TheNjordy День тому

      @@hdfsgervda in terms of numbers -- maybe. But what about underlining tech? You can invest 100x more time and compute into bad architecture for subpar results.

    • @PeterStrmberg007
      @PeterStrmberg007 13 годин тому +2

      I agree, in my tests this is pretty much unusable, now we're used to the quality of Minimax and Kling. Paid quality beats free sub-par at the moment (I'm not including you Runway, unusable footage AND extortionate prices is the worst combo)

  • @vytah
    @vytah 2 дні тому +69

    It's not open-source as it doesn't allow unrestricted commercial use.

    • @theredknight9314
      @theredknight9314 2 дні тому +32

      Thats only if they catch you….🤫

    • @vytah
      @vytah 2 дні тому +29

      @@theredknight9314 In that case everything is open-source until they catch you.

    • @xuimod
      @xuimod 2 дні тому +6

      ​@@vytahnot if you have to pay for a license.

    • @theredknight9314
      @theredknight9314 2 дні тому

      @@vytah yep

    • @hqcart1
      @hqcart1 2 дні тому +1

      @@vytah how can they catch you????

  • @1sava
    @1sava День тому +29

    Come on, this is not sora level! Sora doesn't have as many morphing issues and it's not as realistic.

    • @beyounickvlog5285
      @beyounickvlog5285 День тому +4

      SORA doesn't exist Lol

    • @1sava
      @1sava День тому +2

      @@beyounickvlog5285 So all the film makers and artists that have been given access are just lying, right?

    • @beyounickvlog5285
      @beyounickvlog5285 17 годин тому +3

      @@1sava that was a joke btw . And yeah those artists create 100 of generations from the same prompts and cherry picks the best one. You can watch their interview.

    • @1sava
      @1sava 15 годин тому

      @@beyounickvlog5285 Fair enough. Cherry picked or not, they did generate hyper realistic generations. But this model is great for the Open Source industry

    • @angryox3102
      @angryox3102 8 годин тому

      @@1savait practically doesn’t exist. None of us have access to it.

  • @jzwadlo
    @jzwadlo День тому +7

    The industry has been waiting for an open-source kebab video generator.
    That wait is clearly over

    • @theAIsearch
      @theAIsearch  День тому +2

      we must make the most realistic kebabs

  • @johnzach2057
    @johnzach2057 2 дні тому +26

    Hollywood level movie with a simple prompt in the next 5 years. Not impossible.

    • @Z3DMANN
      @Z3DMANN 2 дні тому +5

      It'll be done before Christmas. Initially AI films will have continuity issues but clips will be compiled together to resemble full length feature film formats by indie devs, probably within this month.
      Edit: As @MartinZanichelli mentioned, the audio will be a hurdle but Ik my statement to be true because I am going to do it.

    • @AntonioSorrentini
      @AntonioSorrentini День тому +2

      3 years

    • @frankstrawnation
      @frankstrawnation День тому

      At least 20 years. But Hollywood will become superfluous.

    • @dgomez611
      @dgomez611 День тому +6

      Ya'll are underestimating AI. Mark my words, it will be no more than 6 months.

    • @MartinZanichelli
      @MartinZanichelli День тому +3

      Ok, but it will take you a lot of time to elaborate a really good script. Plan the scenes, arrange the footage with the sound. It will take you a lot of time and work, but you can do it yourself alone at home.

  • @linolino9306
    @linolino9306 День тому +12

    26 gig memory. Are you kidding me 😂😂😂

  • @favesongslist
    @favesongslist День тому +14

    Maybe we could see book to video in the next couple of years.

  • @labmike3d
    @labmike3d День тому +1

    The potential of AI generated videos is truly remarkable, particularly for architectural visualizations and establishing shots in zones where drone flights are restricted. Keep up the excellent work and enjoy the creative journey!

  • @MartinZanichelli
    @MartinZanichelli День тому +5

    3:57 Sora still the best. However we do not know how much cherry-picking they have done.

  • @nehemiasvasquez8536
    @nehemiasvasquez8536 2 дні тому +6

    Well, seems like at the end, technology came here to be OpenSource... The sora was left behind.

  • @brian_belanger
    @brian_belanger День тому +3

    What is going to be fun is when AI video gets to the level that it can be fed a book and create a movie from it. And that will be here soon.

    • @JaBigKneeGap
      @JaBigKneeGap День тому +1

      and i am a proponent of that. TOASTS TO FILMMAKING WITH AI TOMORROW!

  • @AdvantestInc
    @AdvantestInc День тому

    The tech behind Pyramid Flow is a major step forward. Imagine the creative potential once the consistency improves. Can’t wait to see where it goes from here.

    • @theAIsearch
      @theAIsearch  День тому

      yes! plus since its open source and tunable, im sure the community will improve this fast, like they did w stable diffusion

  • @HolidayAtHome
    @HolidayAtHome День тому +4

    "Sora Level Quality"? I think you have to rewatch the old Sora videos again. It's clearly far behind. You even show the Astronaut video and it's so obvious that it's morphing all over the place and getting blurry with strange double lines over time while Sora is super stable and clear ;P But beeing open source is of course super interesting!

  • @DrayNoR1
    @DrayNoR1 2 години тому +1

    For open source is good. But is not competitive right now

  • @ИванИванов-б8у4и
    @ИванИванов-б8у4и 2 дні тому +7

    Попробовал пару промтов и сделать видео из фото, не впечатлило пока.

  • @Roximoe
    @Roximoe День тому +1

    Ai search knows its not as good but he has to be subtle and promote this stuff to keep the channel going yall dont get whats really going on here (thats why hes showing them side by side) hes actually showing us how good sora is shhhhh!!

  • @povang
    @povang День тому +1

    The future AI video tech is clearly targeting the future and upcoming Nvidia 5090 cards at 32 GB. I have a 4090, but it looks like Im going to have to sell it soon and upgrade, wasnt planning on upgrading for many years...

  • @KingDeadMan
    @KingDeadMan День тому +3

    2:04 Has four legs. 💀

  • @playthisnote
    @playthisnote 10 годин тому

    I do believe it takes sora more compute. The latest models don’t need as much. Sora is probably almost two years old. But when they real ease their latest trained sora it will be top notch. They raised compute power to a mega scale on a vid gen with less training sora 1.0 and you see how it looked.

  • @elgodric
    @elgodric День тому +1

    These footage reminds of the early versions of Dalle!
    Its only gonna get better

  • @trevorama
    @trevorama День тому +21

    You lost me at “It’s not much different” [from Sora]. Sadly, if I can’t trust your judgment, I can’t trust your channel.

    • @cajampa
      @cajampa День тому +6

      And that "cat" looked pure nightmare fuel. I have to assume this dude have never been close to a real cat.

    • @trevorama
      @trevorama День тому +2

      @@cajampa Ha! Agreed.

    • @Darvid08
      @Darvid08 День тому

      You act like Sora is a piece of crap.

    • @hdfsgervda
      @hdfsgervda День тому +4

      Sora isn't publicly available. It's vaporware and if ever released will be wrapped in layers of OpenAI censorship.
      There's also no indication of how cherry-picked the examples Sora examples were (though neither about how cherry-picked Pyramid Flow is).
      Also we also don't know how much compute each Sora example takes.
      Algorithmically this new approach may be equally strong as Sora, just they might not have the compute to make a bigger model

    • @xenn2996
      @xenn2996 День тому

      Then don’t watch this.

  • @Aerospace_Education
    @Aerospace_Education День тому

    Looking forward to GTA VI, but we will probably be able to live in GTA VII and live entire secondary lives.

  • @vi6ddarkking
    @vi6ddarkking 2 дні тому +5

    Well the boys at black Forest Labs will have quieter the bar to reach once they realize their AI Video Tools.
    Hopeful alongside all 3 versions of Flux 2.0.
    That's quite the one Two Punch.

    • @theAIsearch
      @theAIsearch  2 дні тому +1

      all of this is just so exciting!

  • @Tobi_A
    @Tobi_A День тому

    By the end of the year this thing is gonna be crazy

  • @pretoasted
    @pretoasted 2 дні тому +1

    It's not bad at all, but still rough around the edges/details. The general concept is communicated clearly, it's just the details that need some work.
    Very happy about it being open source; Now it's your turn Meta/Llama ;)

  • @timothycook
    @timothycook 2 дні тому +2

    I wonder if one of the Apple M3 Max chips with 128GB VRAM would run this?

  • @MabelYolanda-c9i
    @MabelYolanda-c9i День тому +1

    Free looks always better 😂

  • @hqcart1
    @hqcart1 2 дні тому +12

    Why the chinese are open sourcing it???

    • @DjHazardous
      @DjHazardous 2 дні тому

      *I wouldn't be surprised if its to collect data since well a war with them is on the horizon*

    • @TheNjordy
      @TheNjordy День тому +2

      Because of communism :)

    • @AnimagicToons
      @AnimagicToons День тому +1

      @@TheNjordy They are not as greedy as the Americans and smarter.

    • @frankstrawnation
      @frankstrawnation День тому +1

      ​@@AnimagicToonsThat's not the case for sure.

    • @MartinZanichelli
      @MartinZanichelli День тому +1

      Because they are smart and think long term. They are better because they have no mixture, no impureness. 🙋🙋‍♂🙋‍♀✋

  • @jonmichaelgalindo
    @jonmichaelgalindo День тому +1

    Hyped! DiTs quantize great, so the FP4 version should fit in 26/4 or about 8GB of VRAM. 😊🎉

  • @joels7605
    @joels7605 День тому +1

    Not Sora level, but still veeeeerry cool.

  • @mainaccount888
    @mainaccount888 2 дні тому +1

    TBH most of the non-cherry picked outputs I've seen have some pretty bad decoherence, artifacts, and blending

  • @holdthetruthhostage
    @holdthetruthhostage День тому +1

    Hmm open Source oh man it has begun

  • @antoniok.4160
    @antoniok.4160 День тому +1

    Why are you showing us something that we can‘t even run locally? What‘s the point?

  • @renovacio5847
    @renovacio5847 День тому

    "When open source catches up"

  • @Sujal-ow7cj
    @Sujal-ow7cj День тому

    Waiting for meta video generator to be open source

  • @Selyan_
    @Selyan_ Годину тому

    Doesn't work as advertised. The videos i've generated are not really making sense anyone with a solution ? I'm running it on a 3090

  • @JuliaGuimaraes992
    @JuliaGuimaraes992 День тому

    OpenAI with Sora showcase videos: Do you want it? Do you want it? 🤭🤭 * never releases it *
    That one chinese: 🗿

  • @Charles-Darwin
    @Charles-Darwin День тому

    Test driving NotebookLM to do your voiceover?
    And how many hours of barbeque video would it take to train a model to output barbeque with this much freaking fidelity? I mean I can literally taste the peppers and shicken

  • @daoyuzhang1648
    @daoyuzhang1648 День тому

    5B/flux “1.1” model release date?

  • @Flightman453
    @Flightman453 17 годин тому

    Am I tripping? What are these comments lmao. Runway and Minimax have far surpassed Sora for a while now. Minimax, especially with this now IMG to Video tool is by far the best, Sora isn't close. Why do you keep talking about some video generator that still isn't out and there's like like 5-6 different ones released since then?

  • @meadbrow8479
    @meadbrow8479 День тому

    All it took was someone with brains and another game changing of A.I industry falls in the hands of the people.

  • @jovanniagara
    @jovanniagara 2 дні тому +6

    can we use it locally?

    • @theAIsearch
      @theAIsearch  2 дні тому +3

      yes (if u have enough vram)

    • @ielohim2423
      @ielohim2423 2 дні тому

      Yup. I'm installing it with Pinokio > Gepeto right now

  • @Comic_Book_Creator
    @Comic_Book_Creator День тому

    yes, nice, but how we test this?

  • @xuimod
    @xuimod 2 дні тому +2

    The results are pretty good but they're significant errors. For example, The video of the astronaut,. his eyes are messed up. And The video of the cat waking up demanding breakfast... The cat's mouth is a bit deformed.

    • @theAIsearch
      @theAIsearch  2 дні тому

      thanks for sharing!

    • @AlbertHardyJr
      @AlbertHardyJr День тому

      And I can't tell that it's a steam train. Looks more like several flat cars followed by a pair of diesels.

  • @Steger13
    @Steger13 День тому

    I don't think sora ever existed lol now we should compare things to kling no more sora 😅

  • @Elefantoche
    @Elefantoche 2 дні тому +1

    It would be interesting to translate a short tale into a sequence of clips using this

  • @fitnesstips4442
    @fitnesstips4442 2 дні тому

    So, it's not available to try online, right?

    • @theAIsearch
      @theAIsearch  2 дні тому

      they just added a hf space: huggingface.co/spaces/Pyramid-Flow/pyramid-flow
      I got 1 free video from it. 3s long

  • @kingKai2022
    @kingKai2022 День тому

    😅 Why say about colab and then delete the comment? Works on colab but image-to-video in just over 40gb so A100 won't do it.

    • @theAIsearch
      @theAIsearch  День тому

      cool. how many vids could you make in colab before the limit is exceeded?

    • @kingKai2022
      @kingKai2022 День тому

      ​@@theAIsearchthere is no limit.

  • @christophermoonlightproduction

    This is (again) one of the game-changers I've been waiting for. I have 32gigs of RAM, but this kind of install is beyond me, at the moment. I've already fbared my main drive with improper installs so I'm going to spend some time straightening that out and trying to learn a few more things before I dive this deep. Still, this is exciting and I can't wait for the updates. The future looks bright.

  • @AntonioSorrentini
    @AntonioSorrentini День тому

    What do I think about this? I think all the good in the world! Long live open source.

  • @ROBOTRIX_eu
    @ROBOTRIX_eu 2 дні тому

  • @Mfrt-e7n
    @Mfrt-e7n 2 дні тому

    will rtx 4060 ti 16 gb be good enough for this?
    you think that card is good enough for ai generation and voice changers?

    • @theAIsearch
      @theAIsearch  2 дні тому

      nope, not for now. i also have 16g
      this is good for images and voice though.

    • @Mfrt-e7n
      @Mfrt-e7n 2 дні тому +1

      @@theAIsearch I see. I will need something stronger. can I get it to work despite being slower? or at least can I use image to video generator?

    • @theAIsearch
      @theAIsearch  День тому +1

      @@Mfrt-e7n they will improve it for lower vram. gotta wait a few days hopefully

    • @Mfrt-e7n
      @Mfrt-e7n День тому

      @@theAIsearch thank you.
      let's hope they'll optimize it enough for 16 gigs at least lol

  • @todaychange5-7783
    @todaychange5-7783 2 дні тому +2

    Sora is pathetic. They really thought they did something in February but showing off their THEN boom🎉 Runway, Kling, Hauilo and Pikalabs AND with Meta gen coming up, they all put Sora in hiding😂

    • @theAIsearch
      @theAIsearch  День тому

      yep

    • @hdfsgervda
      @hdfsgervda День тому +1

      Give OpenAI a break. The censorship and political correctness filters won't code themselves

    • @Diogo85
      @Diogo85 День тому

      No. It's not pathetic.

  • @lasail6312
    @lasail6312 День тому +1

    Is it censored in any way? Can I generate hardcore waifus in action?

  • @amaesaeki7436
    @amaesaeki7436 2 дні тому

    RTX 5090 32 GB will run it just fine (you just need to pay 2500+$ for it first ^^)

    • @theAIsearch
      @theAIsearch  2 дні тому

      alright, this is what i'll save up for

  • @PantherDawg
    @PantherDawg 2 дні тому

    Turn captions on?

  • @MabelYolanda-c9i
    @MabelYolanda-c9i День тому +1

    Thanks for the video, thanks for sharing!

  • @tarcus6074
    @tarcus6074 День тому

    It's nowhere near Sora or Meta's video generator/

  • @Flaky111
    @Flaky111 2 дні тому +1

    So how can I use it exacly?

    • @theAIsearch
      @theAIsearch  2 дні тому +1

      they just added a hf space. literally just now: huggingface.co/spaces/Pyramid-Flow/pyramid-flow

    • @Noahperaudon
      @Noahperaudon 2 дні тому +1

      @@theAIsearchokay but it’s not free ? Hugginface have a limit use

    • @anubisai
      @anubisai 2 дні тому

      Yaaay

    • @theAIsearch
      @theAIsearch  2 дні тому

      @@Noahperaudon i got 2 videos out of it before my free limit was exceeded

    • @Noahperaudon
      @Noahperaudon 2 дні тому +1

      @@theAIsearch Yes but well it’s a shame, isn’t there an alternative to use it otherwise?

  • @bloodust7356
    @bloodust7356 День тому

    8:40.
    It's George.
    George Bush

  • @jolyonn5619
    @jolyonn5619 День тому

    should be able to run it locally using runpod, will ty it out now

  • @elyakimlev
    @elyakimlev День тому

    I have 2x RTX 3090, so I can probably run it on my PC in 768p. But I'm hesitant to install anything approved by the CCP on my PC. Kling as a web service is one thing, but this... I'll just wait for BFL to release their own model.

    • @theAIsearch
      @theAIsearch  День тому

      looks like i needa start stacking GPUs. one is not enough

    • @elyakimlev
      @elyakimlev День тому

      @@theAIsearch don't forget to get a good power supply (at least 1400w)

  • @XDgamer1
    @XDgamer1 2 дні тому +1

    i think it's good for landscape videos

  • @EastEuropean_1938
    @EastEuropean_1938 2 дні тому

    Could you do a video on how to do song cover's etc on mobile android and iOS, because my PC broke (best for free) and you could do it on the go as well outside the house, if you do it thanks :)
    From Poland 🇵🇱

  • @MichaelSuperbacker
    @MichaelSuperbacker 2 дні тому +1

    Hello

    • @theAIsearch
      @theAIsearch  2 дні тому +1

      yo!

    • @darwish..
      @darwish.. 2 дні тому +1

      yoo it's the legendary michael superbacker, didn't expect to see you here.

  • @matten_zero
    @matten_zero 2 дні тому

    Replicate gonna be making beaucoup dollars

  • @itested1
    @itested1 День тому +3

    Very similar to LSD visuals :D

  • @Dr.Computer365
    @Dr.Computer365 День тому

    my 1 tb is full of ai help me!

  • @petergedd9330
    @petergedd9330 День тому

    Thank you.

  • @Smabverse
    @Smabverse 2 дні тому +1

    Yo

  • @bravo1oh1
    @bravo1oh1 2 дні тому

    Can it do nsfw

  • @Brandie-Michelle
    @Brandie-Michelle 2 дні тому

    Yay !!!

  • @YeNLol10
    @YeNLol10 2 дні тому

    Really good

  • @Web3V
    @Web3V 10 годин тому

    All looks fake

  • @SimpsOfHumanProgress
    @SimpsOfHumanProgress 2 дні тому

    Awesome

  • @wonder111
    @wonder111 2 дні тому

    Too many ads. Your channel is just not worth it.

  • @deepdive-bd
    @deepdive-bd День тому

    Bad video quality.

  • @OfficialGundem
    @OfficialGundem День тому

    It still looks like hot garbage, not really useful for anything yet unfortunately

  • @Anatolian_lovers
    @Anatolian_lovers 2 дні тому

    First comment

  • @Too__bbb
    @Too__bbb 2 дні тому

    third
    \