How Stable Diffusion Works (AI Text To Image Explained)

Поділитися
Вставка
  • Опубліковано 13 січ 2025

КОМЕНТАРІ • 61

  • @justinwhite2725
    @justinwhite2725 Рік тому +2

    4:44 Midjourney does not use reactions to the images in production to train their model.
    It's a good example to explain it as a hypothetical, but it's untrue.

  • @websurferwizard
    @websurferwizard 9 місяців тому +3

    It's criminal that this only has 13k. Keep it up!!

  • @Pixelarter
    @Pixelarter Рік тому +16

    You should change the title, this is definitely not a "detailed explanation". It's more akin to a "summarized intuitive explanation".

    • @MistyB-yv1uw
      @MistyB-yv1uw 6 місяців тому

      Scott Morris works for the govt?? Lmao and Michael too?? And me?? Naw. I just call people like you out

  • @token4774
    @token4774 Рік тому +5

    I still don't understand how Stable Diffusion works, but now I know more. Maybe you can help me understand what's happening when I try to create some art: First, I upload an image to Stable Diffusion in the img2img tab and then I select Interrogate CLIP or Interrogate DeepBooru, then I copy/paste the prompt into txt2img -- Why don't I get an image that better resembles what I started with? How can I get better semblance to my original image? You seem to understand this stuff better than me, so maybe you can explore this in a future video. Thanks!

    • @allyourtechai
      @allyourtechai  Рік тому +6

      I will do a video on the subject. There are definitely some tricks to making it work and getting a decent result.

  • @RealmOfOk
    @RealmOfOk Рік тому +1

    It just clicked at 4:38 why midjourney and others are free to start, they need people to teach the system

  • @mihairusu
    @mihairusu Рік тому +4

    This was so informative! Thank you, love your videos!

    • @allyourtechai
      @allyourtechai  Рік тому +1

      Thank you so much, I really appreciate it!

  • @pranavshekhar9902
    @pranavshekhar9902 Рік тому +1

    Amazing explaination on such a short video !! Keep up the good work !!

  • @AllanGildea
    @AllanGildea 11 місяців тому +2

    Very well explained, thank you. And man, I love your studio! (D'oh - just noticed it is a fake background. Rather goes to your point).

  • @arturabizgeldin9890
    @arturabizgeldin9890 Рік тому +5

    such a good video, surpised to see so few likes. your explanation is great! since it works fine for a wider audience with minimal engineering or technical skills. please keep making the videos!

  • @oaahmed7515
    @oaahmed7515 Рік тому +2

    amazing. Thanks +wait for more ❤

  • @danilshubin5311
    @danilshubin5311 Рік тому +1

    after watching the video of the video, I still have questions. It turns out that we make Gaussian noise from the picture, and then we make noise back from the noise. But won't we be able to face the fact that the noise can be the same?

    • @allyourtechai
      @allyourtechai  Рік тому +1

      Pretty unlikely if you use a random seed to generate the noise, and you train it 1000 times per image. The odds of getting the same noise that many times are mathematically improbable.

  • @dpainter1526
    @dpainter1526 6 місяців тому

    But why is the "image" hidden in the "noise" to begin with? I kinda get what you're explaining, but I don't understand the starting point.

  • @hssp1534
    @hssp1534 Рік тому +2

    One basic question..What is the need to introduce noise in the first place?

    • @TheBigLeChowski
      @TheBigLeChowski 9 місяців тому +1

      The noise is the starting point when you reverse the diffusion process. It also provides randomness to the resulting image

    • @dpainter1526
      @dpainter1526 6 місяців тому

      OK, but I have the same issue; it sounds like the system requires one to 1) upload a bunch of images, 2) reduce those images to "noise" and then 3) use prompts to bring the images back....makes no sense

  • @stephanmodry1301
    @stephanmodry1301 Рік тому

    One of the best videos iv'e seen in a while. Thank you for taking the time and making such awesome content. Much appreciated.

  • @aldrinjenson
    @aldrinjenson Рік тому +1

    This was great. Thanks!

  • @iamritambhar
    @iamritambhar Рік тому +2

    Wow, such a great video man. Finally found the video that clearly explains how exactly images are made from text prompts. And the things you said in the end... yeah man... I agree with you. We should be careful on how to use these AI technologies.

  • @richie1027
    @richie1027 8 місяців тому

    Very well done. Many thanks

  • @omarei
    @omarei Рік тому +2

    Great video 👍 Subbed

  • @Howiefm28496
    @Howiefm28496 Рік тому

    So how does it know which pure noise image to use starting out with?

    • @allyourtechai
      @allyourtechai  Рік тому

      The software starts with a random number generator that is used as a seed to generate the noise.

    • @Howiefm28496
      @Howiefm28496 Рік тому

      @@allyourtechai let say the text prompt is “Rainnbow unicorn” . How does the process starts out ? Where does it get the noisy image of that in order to work back to the desired image?

  • @SHASHWATHPAIML--
    @SHASHWATHPAIML-- 11 місяців тому +1

    Great explanation!!

  • @alaad1009
    @alaad1009 11 місяців тому +1

    Awesome video !

  • @manimaran6582
    @manimaran6582 Рік тому +1

    Really awesome

  • @__-fi6xg
    @__-fi6xg Рік тому

    does it pull stuff only from the checkpoints used or also online?

    • @allyourtechai
      @allyourtechai  Рік тому

      You can define the source. In my video about how to “ai yourself”, I provided my own photos to train the model.

    • @krzysztofczarnecki8238
      @krzysztofczarnecki8238 Рік тому

      You can have a completely offline install, where you download the checkpoint and other files, run the Stable Diffusion server on your own computer and control it from the browser on that same computer. No one ever looks at what you generate or charges you for anything. And you can train your own checkpoints or embeddings locally, but that is really slow (several hours for like 10-50 images and a RTX2060).

    • @__-fi6xg
      @__-fi6xg Рік тому

      @@krzysztofczarnecki8238 i think thats what i got rn, its pretty cool running it locally. And yeah i pulled my internet plug and it was still able to draw somewhat accurate drawings of famous anime characters which is pretty awesome.

  • @trueintellect
    @trueintellect Рік тому +1

    It is an Erlenmeyer flask, not a beaker. ;)

  • @WifeWantsAWizard
    @WifeWantsAWizard Рік тому +1

    (9:08) Training the AI model with your custom data works better if you a) make sure each image is a square 512x512 pixels, and b) take the photos of your models specifically for this purpose in front of a solid color background. Also, I dare you to use "me from behind" in your prompts, as all of your photos appear to be selfies so it has no idea what the back of your head looks like.

  • @MistyB-yv1uw
    @MistyB-yv1uw 6 місяців тому

    Breeze?? That black guy at Walmart all crazy looking at me cause I didn't have on a mask - you know what's up don't ya?

  • @akila_the_third
    @akila_the_third Рік тому +2

    You made a strong point on the confusion between really and AI generated really. Not to be pessimistic, but this is a huge risk for humanity. I believe right from the start we should have regulatory institutions to force AI companies to put a disclaimer on any art or content that’s produced. Tools should be developed and make It available to people maybe through their phones, laptops, tv as an extension so they can clearly differentiate between both.
    With the consumption of content being already high for most people, these technologies can easily turn into tools of mass control if strong measures are not taken right from the start.

    • @allyourtechai
      @allyourtechai  Рік тому

      It’s something we need to all pay close attention to for sure.

    • @jimlthor
      @jimlthor Рік тому

      People will just remove those things.
      I'm sure something will happen, the govt will probably step in, and do something stupid because they're all old, ill-informed, and don't understand how e-mail even works
      Whatever they do will either be over the top, or a waste of time.
      I think most people already know fake images, video and audio are already circulating. People are already questioning anything they see, so I'd say awareness is already out there.
      We just have to hope that "trusted" mediums don't mislead people with fake stuff, and actually do a little research. Fortunately (and unfortunately), I think most Americans already don't trust the media as it is right now. Especially with all the law suits these companies have had to pay out over the last few years

  • @stevet.4820
    @stevet.4820 8 місяців тому

    I like my noise Gaazian

  • @MistyB-yv1uw
    @MistyB-yv1uw 6 місяців тому

    See. This is why my phone don't know what to do when we start stable diffusion
    Because the archives don't have this information
    It does,but it has to search in other places, and I don't know just how in the world I could do this on my phone??!! I don't think my " app" would have enough storage to do it. Or somethin somethin. Also,does my mother know her computer has a virus?? Just wondering if she knows that there has been a file downloaded to her computer and she might need to get it looked at by someone other than flight simulator or RAY saville. LMAO also,I guess American airlines don't allow this huh? Can we switch countries in tor and go to Singapore or something?? Lol wow. I'm not Anna btw. Y'all apparently have her in the pod...cast . This is how y'all make your dirty laundry money!! This right here!! Right?? Y'all do zoom calls during pandemics and get facial recognition and zoom call meetings and that's how you do your scams and make your dirty money. Wow
    Thanks for the info
    Did y'all do this to me????

  • @Adam-ui8iy
    @Adam-ui8iy Рік тому +1

    "my hope is that it brings us all closer together..." yyyeeaaaa....that's a no from me dawg

  • @airbawx
    @airbawx Рік тому +1

    Oh shit it's you brina 😂

  • @goghvonjohann2924
    @goghvonjohann2924 Рік тому

    You forget that this poses a huge problem for the legal system as well. Pictures or videos of you doing something are essentially worthless now given how easy it is to fake them.

  • @MatthewHolevinski
    @MatthewHolevinski Рік тому

    I have no idea who Drake is

  • @nienienie7567
    @nienienie7567 9 місяців тому

    jessus christ pliz mix yuour voice with some eq you have a terrible amount of sub bases (between 60 and 20 hz). Please ask your musician friend to show you how to do it bc it's unlistenable on many types of speakers

  • @gingercholo
    @gingercholo Рік тому +1

    Youre great