Stable Diffusion 3 IS FINALLY HERE!

Поділитися
Вставка
  • Опубліковано 18 гру 2024

КОМЕНТАРІ • 277

  • @MuckoMan
    @MuckoMan 6 місяців тому +32

    So much for the early release email I was supposed to get when SD3 came out. Thanks Sebastian!

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +6

      I think I beat the email by 30 minutes, give or take. Happy to help!

  • @MrDvneil
    @MrDvneil 6 місяців тому +35

    some sd1.5 paint/drawing styles, can do landscapes with a lot more detail per resolution than current sdxl models, at the end you use them all, for the parts they excel.
    i suppose sd3 will take some months to lift off, see how people use them and some good models appear, maybe in 6 months, will see.

    • @JustArtsCreations
      @JustArtsCreations 6 місяців тому +4

      Yeah I agree. I see far too many people comparing sd3 with fine tuned models like its a fair comparison. (I like how Sebastian compared to base models here. Much better IMO
      Will take time to see the true strength of the new stuff )much the same happened with SD1.5 too)
      Great comment yo!

    • @Eleganttf2
      @Eleganttf2 6 місяців тому +2

      ​@@JustArtsCreations i agree just like how SDXL first came out its rough especially lots of great 1.5 fine tuned models out there at the time

    • @szach-i-mat
      @szach-i-mat 5 місяців тому

      But problem is with license - no commercial license in version 3

    • @JustArtsCreations
      @JustArtsCreations 5 місяців тому +1

      @@szach-i-mat you must have missed the new license the other day. its exactly the same as the other models now.

  • @mauriciolee7349
    @mauriciolee7349 5 місяців тому +1

    Thank Sebastian for comparing Stable Diffusion, Midjourney and Dalle3 in such details. Your video helps me a lot in making an informed decision of selecting the app that meets my needs.

  • @mastertouchMT
    @mastertouchMT 6 місяців тому +15

    Great introduction. Cant wait to see what the community models bring to the table now

  • @jameshughes3014
    @jameshughes3014 6 місяців тому +86

    That mid journey character cracked me up.

    • @skycladsquirrel
      @skycladsquirrel 6 місяців тому +7

      Love his cinematic lighting! lol

    • @goodie2shoes
      @goodie2shoes 6 місяців тому +3

      he needs his own show

    • @Chyrre
      @Chyrre 6 місяців тому +3

      "Heay guuys i can doo perrtty imagess"

    • @fly_goku
      @fly_goku 4 місяці тому

      rofl *_*

    • @Catdevzsh01
      @Catdevzsh01 4 місяці тому

      great now do it for flux

  • @BoyanOrion
    @BoyanOrion 6 місяців тому +3

    Thank you. I tried it and it works really well. Waiting on A1111 to do a patch update so we can also test the models there as well.

  • @JustArtsCreations
    @JustArtsCreations 6 місяців тому +5

    Running the medium version on my 1070 just fine here. Love it!
    By the way you were by far the first to upload a video about this so ty

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +1

      You're welcome! How's the speed?

    • @JustArtsCreations
      @JustArtsCreations 6 місяців тому +1

      @@sebastiankamph Its not actually all that bad the step count is 9 seconds on average for me so really not far off from SDXL

    • @azuki2919
      @azuki2919 6 місяців тому

      @@JustArtsCreations Running on 1070ti pretty much the same

    • @Eleganttf2
      @Eleganttf2 6 місяців тому

      ​@@JustArtsCreations what's the Vram usage while generating with SD3 2gb if i may ask can you check it while its generating in task manager ? Thanks!

    • @JustArtsCreations
      @JustArtsCreations 6 місяців тому

      @@Eleganttf2 Oh its maxed it out at 8 GB i should also say other than my GPU at 1070 its a beast of a machine so the rest of the system is doing alot of heavy lifting (i9 1490k, 64 gb ram) but VRAM 8 GB

  • @AdonhiramSD
    @AdonhiramSD 6 місяців тому +1

    I have been watching your videos since I discovered Stable Diffusion in january, many thanks for all the info you are bringing to the community. The first part of this video with the role-playing was awesome and so funny, well done :) On reddit and on the UD Discord, people seem quite mad at Stability AI though, regarding the performance of SD3 and its absurde license.

  • @TheSnakecarver
    @TheSnakecarver 6 місяців тому +40

    2B or not 2B?... that is the question.

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +3

      Best comment

    • @Elwaves2925
      @Elwaves2925 6 місяців тому

      Damn, I just posted that myself. I guess I should have checked first. 🙂

    • @SkullModder
      @SkullModder 6 місяців тому +5

      no 2b nier :( gotta stick to pony xl for that

  • @peterr6595
    @peterr6595 6 місяців тому +2

    Sebastian,
    SD3 is GREAT …for text interpretation. I use AI to build graphics/posters for my screenplays.
    I need to describe more than one person in my prompt. In SDXL clothing A would ‘bleed’ in to clothing B. Characters were always wearing the other’s clothing. I had to submit multiple prompts - I mean a lot - just to get close.
    SD3 fixes that. Last night I started resubmitting all my graphics for my show bible. Accurate characters every time.

  • @TheCynicalNihilist
    @TheCynicalNihilist 6 місяців тому +11

    wait, is it only for Comfy? is there no automatic1111 version yet? im running forge. how do i install??

    • @IceMetalPunk
      @IceMetalPunk 6 місяців тому

      There's a GitHub issue opened on the WebUI Forge repo for SD3 support. This is the last comment posted 8 hours ago as of the time I write this reply: "@huchenlei could help with this on the dev branch, when he'll have time." ~dan4ik94
      So people are working on it, but it might take some time.

    • @Eleganttf2
      @Eleganttf2 6 місяців тому

      Lol same using Forge here

    • @149315Nico
      @149315Nico 6 місяців тому

      Comfys almost always quickest to adapt changes and the devs working on sd3 are using it themselves to test their shit, support was out before sd dropped, probably lots of people working on automatic support rn, forge is always really slow in updating but will probably deliver a great efficient implementation if they do someday.
      if you can’t wait, you’ll have to deal with the spaghetti I guess

  • @nunads
    @nunads 6 місяців тому +47

    Kudos on the roleplaying the different models scene! That was great, definately left me laughing 😆

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +8

      Glad you enjoyed it! Wanted to try something different and I had fun doing it :D

    • @bankenichi
      @bankenichi 6 місяців тому

      The midjourney bit at the end cracked me up xd

  • @justinwhite2725
    @justinwhite2725 6 місяців тому +3

    Missing from the tests: actually trying the examples from the resesrch paper in SD3 to see if it can actually do those things.

  • @richardadonnell
    @richardadonnell 6 місяців тому +3

    🎯 Key points for quick navigation:
    00:00 *🆕 Introduction to Stable Diffusion 3*
    - Overview of Stable Diffusion 3 release,
    - Instructions for downloading and starting usage,
    - Comparison of the 2B model with the 8B model.
    02:00 *🛠️ Key Features and Enhancements*
    - Enhanced text prompt understanding and resolution capabilities,
    - Introduction of the 16-channel VAE for better detail retention,
    - Compatibility with various image sizes.
    04:00 *💻 Performance and Requirements*
    - Differences in resource requirements between 2B and 8B models,
    - Benefits of the 2B model for most users,
    - Explanation of diminishing returns with higher capacity models.
    06:00 *📊 Research Insights and Comparisons*
    - Summary of research findings on improved autoencoders,
    - Comparison of FID scores across different channel configurations,
    - Examination of perceptual similarity metrics.
    08:00 *🖼️ Image Quality and Generation Comparisons*
    - Visual comparisons between SDXL, MidJourney, and DALL-E models,
    - Discussion on text rendering and image detail differences,
    - Analysis of various prompts and their outcomes.
    12:00 *📥 Downloading and Using Stable Diffusion 3*
    - Steps for downloading models and setting up,
    - Overview of different download options and encoders,
    - Initial generation examples and settings configuration.
    15:00 *🎮 Final Thoughts and Next Steps*
    - Encouragement to start using Stable Diffusion 3,
    - Mention of future content and continued exploration,
    - Closing remarks and invitation for viewer feedback.
    Made with HARPA AI

  • @MikkoRantalainen
    @MikkoRantalainen 6 місяців тому +1

    I guess it depends on what kinds of prompts you're looking for. If you go with more accurate description such as "transparent acrylic pig statue, a small opaque pig statue inside the bigger arcylic statue" you'll get much better results with Dall-E.
    A very short description is more like "I'm feeling lucky".

  • @Airbender131090
    @Airbender131090 6 місяців тому +43

    3.0 is horrible. Its broken. Censored. No comercial License. Noone wil fine-tune it. This is 2.0 all over again. All they shown on twoter was 8b. Bot this one. So sad :(((

    • @eliparrish9145
      @eliparrish9145 6 місяців тому +5

      Spread the word about what is in the terms of agreement. They are insane.

    • @Airbender131090
      @Airbender131090 6 місяців тому

      @@eliparrish9145not obvious for now. We need to wait for their answer.

    • @thinghy3
      @thinghy3 6 місяців тому +6

      xl is the only way still.

    • @szach-i-mat
      @szach-i-mat 5 місяців тому

      ​@@thinghy3 Exacly 🎉

  • @phoenixfire6559
    @phoenixfire6559 6 місяців тому +2

    The problem with SD3 is you can't make LoRA's or checkpoints with it for commercial purposes, even with the paid Creator's licence - the ToC for enterprise is not visible.
    Here's the relevant part of the Creator's License:
    In Section 2(b)(ii), except as expressly permitted in the agreement, You cannot "modify or prepare any derivative work based upon the Stability Technology or any component thereof".
    Additionally, the definition of "Derivative Work(s)" in Section 1(d) includes "any modifications to a Core Model, and any other model created which is based on or derived from a Core Model or a Core Model's Output(s)."
    So based on these provisions, you are not allowed to train or refine Stability's existing models to create derivative works.
    So let's say someone puts a checkpoint or LoRA on CivitAi, every single one will be for non-commercial use only - even if you have a paid licence. Afaik, nobody uses the base models so unless something changes, I don't see much use for SD3 even if we get the 8b param model.

  • @thejbot
    @thejbot 6 місяців тому +1

    I appreciate the download information at the end of the video.

    • @thejbot
      @thejbot 6 місяців тому

      rather than up top.

  • @ScottLahteine
    @ScottLahteine 6 місяців тому +3

    A little off-topic, but when and where do we still find SD 2.0 and SD 2.1 useful? Are there specific use-cases where one of these is a better choice than 1.5 or SDXL? As mentioned, 1.5 can still accomplish a lot, has great speed using LCM, uses fewer resources, and has more complete tools and models. Seems like the most facile workflow today would use SD 1.5 for speedy exploration and near-realtime painting, then apply a newer larger model for image refinement. But maybe the 2.x models have specific talents that make them worth including…?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +1

      Sadly not really. They're now dead. What they could do better than 1.5, sdxl now does (and now sd3).

  • @mistercapitale
    @mistercapitale 6 місяців тому +3

    Love the skit. Great work.

  • @Airbender131090
    @Airbender131090 6 місяців тому +2

    Have you seen cascade finetunes? No? Wanna know why? No comercial license. Pony Guy already said there will never be Pony 3.0 ( 2b) with this license.

  • @zephilde
    @zephilde 6 місяців тому +1

    I liked the fact you explained a bit the new tech behind the version number, but it could be interesting (in future though) to get info on the 3 clips, wtf is t5, model files specificity and more technical stuff than just how to install (there's READMEs).
    Hope you have to try it a bit before 😇

  • @GAMINGGEEKzzz
    @GAMINGGEEKzzz 6 місяців тому +1

    lovely cant wait to try it out thank you

  • @Maisonier
    @Maisonier 6 місяців тому +1

    Excellent video, what great news! Now I'm going to wait for you to release a video on how to make a LORA with SD3 so I can create my LinkedIn profile picture 😅

  • @neuraldee
    @neuraldee 6 місяців тому +2

    This new format is fire!

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +2

      Glad you like it! Do you think actor Seb should make a comeback for future videos?

    • @neuraldee
      @neuraldee 6 місяців тому

      @@sebastiankamph Definitely! 👌🏼

  • @matthallett4126
    @matthallett4126 6 місяців тому +1

    I just generated some images locally using the same prompt set I've used to test since 1.5 and they're better than even the SD3 API version. Beautiful lighting, better faces, but hands are still a problem. I just posted my results on the facebook SD page.

  • @peacefusion
    @peacefusion 6 місяців тому +2

    That SD3 license is a slap to the face to all those that supported SD for the open source community. Sd3 or Adobe? I cant tell them apart now. The license is a bill and a cuff combo

  • @udonpraguypanya2992
    @udonpraguypanya2992 6 місяців тому +1

    Can we use in normal A1111, I don't really comfortable with ComfiUi at all...

    • @Eleganttf2
      @Eleganttf2 6 місяців тому +1

      Yeah same, just wait for A1111 to patch it

  • @yu-gg
    @yu-gg 6 місяців тому

    Lets goooo I waited for your video!! thanks bro :D

  • @TheBagOfHolding
    @TheBagOfHolding 6 місяців тому +2

    Great video. I like the skit.

  • @maikonpagani1222
    @maikonpagani1222 6 місяців тому +1

    I'm getting "Error while deserializing header: HeaderTooSmall" with any of the models, do you know if I have to update something? (RTX 3080 10Gb). Thanks!

    • @AerisRG
      @AerisRG 6 місяців тому

      Try other UI for SD3, it called StableSwampUI or smt like that

  • @GAMINGGEEKzzz
    @GAMINGGEEKzzz 6 місяців тому +3

    will you make video for automatic 1111 as well in the future ?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      Once a1111 updates you can just drop the file in the /models/Stable-diffusion folder

  • @swannschilling474
    @swannschilling474 6 місяців тому

    Thanks a lot!! Cannot wait to get my hands on it! 😊

  • @A.polon.i.a
    @A.polon.i.a 6 місяців тому +1

    Hey Sebastian, what's the best version to download for general use please.... normal, with clips, with clips & t5? I'm still fairly new so I'd appreciate your advice.

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +1

      Probably the normal ones and then use clips separately, and running CLIP only (no t5). So basically what I did in Swarm at the end.

    • @A.polon.i.a
      @A.polon.i.a 6 місяців тому

      @@sebastiankamph Thank you... so the 4gb model then?

  • @lifemarketing9876
    @lifemarketing9876 6 місяців тому +1

    Woah SD3 is here! Heck yes

  • @sirmeon1231
    @sirmeon1231 6 місяців тому +21

    We can all hear the cooling fans of GPUs in the whole world run wild!
    Happy generating, folks!

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +4

      My little office is already much warmer. Yours too?

    • @joechip4822
      @joechip4822 6 місяців тому +2

      SD 3 has now officially made all concerns about climate change obsolete - Chapeau :-)

    • @sirmeon1231
      @sirmeon1231 6 місяців тому

      @@sebastiankamph Absolutely! I am experimenting with it as much as I can without things like pcm, lcm, hyper, turbo etc. out yet...

  • @JustArtsCreations
    @JustArtsCreations 6 місяців тому

    Hey at 14:07 how did you connect the two clips without having to go back to the main node? like was there a hotkey? That looks handy

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      I think that was just an accidental cut in edit. I dragged it two times. But that feature would be fantastic, maybe it exists.

    • @JustArtsCreations
      @JustArtsCreations 6 місяців тому

      @@sebastiankamph oh okay got ya haha that makes sense what perfect timing ! Thanks though for the reply eh

  • @bataltsev
    @bataltsev 6 місяців тому +3

    Hi there. Is Stable Diffusion 3 free and with private generations? Can it be used for creating stock images?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +4

      Hey, yes, this is correct! If you run it locally it's free

    • @bataltsev
      @bataltsev 6 місяців тому +1

      @@sebastiankamph great! All that's left to do is wait for step-by-step instructions from the good folks on how to install SD3 on Mac or Windows :)

    • @tomschuelke7955
      @tomschuelke7955 6 місяців тому

      @@sebastiankamph Not quite understanding this... if i install it localy on my company computer... use it for example for architectural images.. is it free? or is it only free if i use it uncomercialy

    • @PhotoshopArt
      @PhotoshopArt 6 місяців тому

      @@tomschuelke7955 It's free for non commercial use. Otherwise u need to buy a license.

    • @phoenixfire6559
      @phoenixfire6559 6 місяців тому

      @@tomschuelke7955 It's free for non-commercial use only. For commercial use, you need a licence, the cheapest of which limits you to 6000 images/ month and less than 1m revenue. If you want completely free, use SDXL or SD 1.5.

  • @Avenger222
    @Avenger222 6 місяців тому +10

    I'm so sad... They completely gutted anatomy because they repeated their mistake from 2.x with an overagressive filter. At least it can do landscapes...

    • @fabgeb667
      @fabgeb667 6 місяців тому +1

      vote for leftism, get this

    • @Avenger222
      @Avenger222 6 місяців тому +2

      @@fabgeb667 Bro... this is non-partisan around the entire world. This has nothing to do with "leftism". Both the left and right are in favor of regulating AI around the entire world. Who do you think nudity offends? 🤣

    • @beetwing
      @beetwing 5 місяців тому

      @@Avenger222 censorship means the end of the the free world, which has started since couple of years.

  • @divide0011
    @divide0011 6 місяців тому +1

    I've downloaded the file and put it in the models folder but its not working when im selecting the checkpoint

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      And the clips in the clips folder? (Unless you use swarm). What's happening?

  • @MilesBellas
    @MilesBellas 6 місяців тому

    Hooray !
    😊👍😁🙂
    Emad, Robin Rombach, Andreas Blattmann, Patrick Esser and Dominik Lorenz & team = Amazing!

  • @sub-jec-tiv
    @sub-jec-tiv 5 місяців тому +2

    Finally here, and horrendous corporate garbage license! Go away SD3!

  • @Yoshi92
    @Yoshi92 6 місяців тому +4

    I really liked the comedy part as a good explanation of the differences. Thank you for this video! :)

  • @happycollapse6348
    @happycollapse6348 6 місяців тому +2

    My English is bad I may have missed something but I did not understand the point of comparing the images of SDXL, Midjourney and Dall-E, knowing that the video talks about SD3. . Why not compare with SD3 ? I don't understand this video.

    • @happycollapse6348
      @happycollapse6348 6 місяців тому

      But thanks for the good news

    • @tomschuelke7955
      @tomschuelke7955 6 місяців тому

      he showed SD3 images before from the 3D3 webpages, and afterwords tried the same prompts for the other models

    • @happycollapse6348
      @happycollapse6348 6 місяців тому

      @@tomschuelke7955 Oh ok thanks !

  • @stefanb.8462
    @stefanb.8462 6 місяців тому +1

    I installed SwarmUI to test SD3, but I just get terrible results with the same prompts that worked in older models. I guess there are some aspects in SwarmUI (I used ForgeUI previously) or in SD3 that I overlooked.

    • @BeastLT
      @BeastLT 6 місяців тому

      No.. the model is just dumpster fire. Just look at the r/StableDiffusion

    • @McxgamecasterYT
      @McxgamecasterYT 6 місяців тому

      Fr

  • @SnoochyB
    @SnoochyB 6 місяців тому +2

    LOL, that was a good laugh, you should do more stuff like that. 😀

  • @killergod1419
    @killergod1419 6 місяців тому +1

    you are unbeliable good actor...

  • @cihiris2206
    @cihiris2206 6 місяців тому

    Really excited! So many even better models to come for the community. Now how do I run this? :D

  • @MadazzaMusik
    @MadazzaMusik 6 місяців тому

    i been using sdxl and generating 1080x1280 fine no bad hands or weird things excellent quality too

  • @SiCSpiT1
    @SiCSpiT1 6 місяців тому

    Incase you're too afraid to download the 8B and 16B encoders, they both have a low VRAM mode that I've ran on a 3070 (8GB). I've 32GB of system memory that gets maxed out when loading the 16B encoder, so if you've less RAM you may not want to bother. They both produce similar images, at least in low VRAM mode, so I'd sick to the 8B encoder.

  • @CamoPeng
    @CamoPeng 6 місяців тому

    I'm new to this, i didn't understand what you did in the interface and how you're supposed to configure ComfyUI.
    I managed to follow the docs but I get the following error when trying to "Queue Prompt"
    AttributeError: 'NoneType' object has no attribute 'tokenize'

  • @ilyapo
    @ilyapo 6 місяців тому

    What is the name of the extension for comfyui that adds a performance monitor to this panel on the right?

  • @tails_the_god
    @tails_the_god 6 місяців тому

    hey do you know which EXACT version of stable diffusion perchance ai art plugin uses?

  • @aykayorg9236
    @aykayorg9236 6 місяців тому

    umm... you didn't mention how to get the nodes for these example workflows. Installing missing nodes via the manager doesn't seem to work. What did I miss here?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      They're in Comfy by default. Just make sure you have the latest version.

  • @adamjenkins3065
    @adamjenkins3065 6 місяців тому

    found out about the release from your vid. Thanks!

  • @ai.research
    @ai.research 6 місяців тому +4

    I would like to point out, that the license for the SD3 is completely useless at its current state. This need to be sorted out before people will invest ANY effort into SD3. This is very, very dissapointing. Just an example exerpt from this mess: Creator License $20 per month, the number of Images generated is limited to 6,000/month. I wonder what Stability wants to achieve with it. How they even plan to control this? Too many nasty questions.

    • @peacefusion
      @peacefusion 6 місяців тому

      all while using the open source community as thier personal honeybees.

  • @moonstrobe
    @moonstrobe 6 місяців тому +1

    You should have done more comparisons with actual SD3 :)

  • @shifureisaikyou2055
    @shifureisaikyou2055 5 місяців тому

    What is safe? more accuratly was was unsafe about previous versions?

  • @midjourneyman
    @midjourneyman 6 місяців тому

    I am excited to give this a try but I am very looking forward to AD model! :D

    • @goodie2shoes
      @goodie2shoes 6 місяців тому

      you the guy in the video?

  • @guns1inger
    @guns1inger 6 місяців тому +12

    Running the Medium version on a 3060ti. 1024x1024 in under 25 seconds at 28 steps. Still does weird things with limbs . . . . . . .

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +4

      What was your speed with SDXL?

    • @AgustinCaniglia1992
      @AgustinCaniglia1992 6 місяців тому

      Why are they still releasing a +20 steps models when we have 5 step models

    • @godlesschannel7730
      @godlesschannel7730 6 місяців тому +4

      @@AgustinCaniglia1992 faster= usually worse quality and loras rarely works with them so less variety,less unique art

    • @AgustinCaniglia1992
      @AgustinCaniglia1992 6 місяців тому

      @@godlesschannel7730 i have been using dreamshape or reavix and juggernaut lighting versions and quality is amazing. Loras work good in my experience as well..

  • @ESan-lh6dq
    @ESan-lh6dq 6 місяців тому

    In my SD Models folder there is no Clips folder. Where do the 4 clip files go?

  • @jonathanedward5288
    @jonathanedward5288 6 місяців тому

    Can it be used with FORGE UI? I'm not familiar with comfy UI.

    • @daantilburg8354
      @daantilburg8354 6 місяців тому +1

      Not yet, I tried.. Will probably be updated soon

  • @brianjanssens8020
    @brianjanssens8020 6 місяців тому

    How do i get this for forge UI or A1111? Which files should i download?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      Not available yet, needs an update from ui devs.

  • @tex1297
    @tex1297 6 місяців тому

    Sir you are the winner

  • @hjups
    @hjups 6 місяців тому

    You will probably have to use T5+CLIP to get the prompt adherence from the paper.

  • @Typiakk
    @Typiakk 6 місяців тому

    SD3 is a model, not an actual update of the Stable diffusion "itself" I'm so confused please someone explain me

    • @phoenixfire6559
      @phoenixfire6559 6 місяців тому +3

      Stability AI - The parent company
      Diffusion - an AI image learning technique which converts noise (nonsense dots) into a coherent image after several passes through its model.
      Stable Diffusion - The name of Stability AI's diffusion model.
      Dall E - the name of OpenAI (Chat GPT owner) diffusion model
      There have been three main releases of Stable Diffusion (there are actually more, but the following three are the most relevant)
      SD 1.5 - the start of it becoming mainstream. Many derivative models are based on it (512x512 pixels, ~900M param model)
      SDXL - an improvement in text generation, image quality and resolution (1024x1024 pixels, ~3bn+ param model)
      SD 3 - there are multiple models, the one released today is SD3 medium because it has 2b params (the API uses the larger 8b param model and their research paper concentrates on the 8bn param model). The medium model is less than SDXL but it should generate similar quality images because of better training and a better architecture. The main improvement in the SD3 models is prompt adherence i.e. it will draw more complicated prompts accurately (because it has a better text encoder - these are the things which translate your prompt into something the model can understand) and it has the best text generation.

  • @spawnv2
    @spawnv2 6 місяців тому

    try comparing between models that they benchmark themselves in their paper. SD3, dalle3 and Ideogram, specially in typography ideogram 1 is king. (page 10)

  • @jopansmark
    @jopansmark 6 місяців тому

    StabilityAI is so back!

  • @ESFAndy011
    @ESFAndy011 6 місяців тому +1

    The Dall-E pig-inside-a-pig generations remind me of that jontron episode where he checks out cursed Frozen flash games (Disney bootlegs episode for anyone who's curious).

  • @Michael-gt4mv
    @Michael-gt4mv 6 місяців тому

    Should I move away from Automatic1111 and Forge? I noticed he didn't show either in his examples.

    • @Eleganttf2
      @Eleganttf2 6 місяців тому

      Just wait for the update for em

  • @LegendD112
    @LegendD112 6 місяців тому

    How can I have the modelsamplingS3D model, I don't have it 🥺

  • @hotlineoperator
    @hotlineoperator 6 місяців тому

    Was comparsion images made with SD3 or SDXL ?

  • @quantumangel
    @quantumangel 6 місяців тому

    Wooooo!, It's here!

  • @Pawel_Mrozek
    @Pawel_Mrozek 6 місяців тому

    I checked this four prompt examples in Ideogram and it smashed all four ot them. Moreover the frog and the pretzel it has done better and more accurate than Dalle or SD3.

  • @Rasukix
    @Rasukix 6 місяців тому

    what file should we be downloading for a1111?

    • @TheViktorofgilead
      @TheViktorofgilead 6 місяців тому +1

      Not sure as A1111 is not yet supported.

    • @PhotoshopArt
      @PhotoshopArt 6 місяців тому

      Its not supported in a1111 yet. Wait for a1111 update.

  • @CarloCamaso-lv1hp
    @CarloCamaso-lv1hp 6 місяців тому

    Can i use SD 1 on iPad 9,.?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      Not as a local install. But you can use cloud solutions like ThinkDiffusion. Or install on a local pc and use that ip.

  • @corruptedsmurf260
    @corruptedsmurf260 6 місяців тому

    Does Sebastian mostly Comfy now? Haven't seen much Auto1111 lately.

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +1

      Not much happening with a1111

    • @149315Nico
      @149315Nico 6 місяців тому

      Comfy just allows so much more creative freedom in how you use sd
      Auto is basically doing the same few manual steps over and over again, maybe mixed up in a different order
      Comfy on the other hand is a source of unlimited content potential

  •  6 місяців тому

    Was expecting a tutorial on how to install and setup SD3.

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      See my "How to install comfy" in the description, then download the model from SD3 video and use the workflows provided.

  • @changtc8873
    @changtc8873 6 місяців тому

    After use SD3 should we delete SDXL?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому

      No, you can keep it if you want. Up to you.

  • @streamtabulous
    @streamtabulous 6 місяців тому

    Request video onetrainer lora training for 8gig systems for settings for us potato users,
    include cmd torch install and versions of python nvidia drivers etc needed to run, i have tried lora is made but not working or traning correctly.

  • @zephilde
    @zephilde 6 місяців тому

    That MJ bleu/red light !! 🤣

  • @noobandfriends2420
    @noobandfriends2420 6 місяців тому +1

    10:50 All of them were wrong. A translucent pig inside another pig would just be a picture of a pig.

    • @govindb7837
      @govindb7837 6 місяців тому

      My man, read it again. The prompt is "Translucent pig. Inside is a smaller pig". None of them got it right but looks like all of them have better prompt adherence than you 😅, no offense of course ✌️

  • @xilix
    @xilix 6 місяців тому

    I can't wait to train this.

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +1

      What are you going to train it on first?

    • @goodie2shoes
      @goodie2shoes 6 місяців тому

      @@sebastiankamph bwoops?

    • @jasonhemphill8525
      @jasonhemphill8525 6 місяців тому

      @@goodie2shoesbobs and vagene

    • @xilix
      @xilix 6 місяців тому

      @@sebastiankamph People's faces and royalty free photography as well as my own. I do stable diffusion photoshoot packages and I have a feeling these extra layers are going to provide precisely the boost in flexibility I've been looking for.

    • @Airbender131090
      @Airbender131090 6 місяців тому

      @@xilixsory to disapoint. Ot sosnt work woth humans. They destroyed anatomy. Not sutavle for lora/dreambooth of humans

  • @dontrez8412
    @dontrez8412 6 місяців тому

    That kinda didn't age well.....but then again, in the near future, it may have..... Great video regardless. It still helped me understand the differences between the different models a little better.

  • @ngshighlites7507
    @ngshighlites7507 6 місяців тому

    Here’s a money idea! Just remember me whoever hits it big. Once AI can continuously generate hands it will be even more difficult to distinguish AI from non AI art. The idea make something that can analyze images in-depth and can easily tell the difference between the AI and real art.

  • @metanulski
    @metanulski 6 місяців тому

    Anyone know the Fooocus Version he is talking about?

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +2

      I know RuinedFooocus is working on it as we speak

  • @darklotusdx
    @darklotusdx 6 місяців тому

    Will this work in A1111?

  • @davidm8966
    @davidm8966 6 місяців тому +5

    SD3 is an absolute joke! :D
    even my standard pony models create better limbs.
    holy shit this release is so scuffed

  • @DisgustingJustinAD
    @DisgustingJustinAD 6 місяців тому +1

    I hear that it's free to download, but hearing many others saying it's going to cost $20 a month to use.

  • @Lv7-L30N
    @Lv7-L30N 6 місяців тому

    Thank you

  • @eriuz
    @eriuz 6 місяців тому

    works with zluda ?

  • @testales
    @testales 6 місяців тому

    Gave it a quick shot, worked out of the box to my suprise.But it has a hefty bias, not as extreme as the google one but there seems to be a strong preference for "people of color" . I also already got twisted limbs basically instantly, deformed hands and even people without face which my due to some censorship effect though. But we'll see what comes out of this. As long as there's no control net support its usefulness is limited at best anyway I'd say. Otherwise a 2B model also seems to be a good choice considering that's right in the middle between SD 1.5 with about 1B and SDXL with 3.5B.

  • @monamibob
    @monamibob 6 місяців тому +2

    a bit disapointing that the SD3 results miss the mark by so much and that you didn't include them in the side-by-side comparison. Also whats the point of snarky comments at MJ? it clearly had the best results in your own tests...

  • @CoconutPete
    @CoconutPete 6 місяців тому +1

    Taking a wild guess this won't work on A111

    • @sebastiankamph
      @sebastiankamph  6 місяців тому +2

      Sadly not yet as their last update was 2 weeks ago. But I'm sure it won't be too long.

    • @CoconutPete
      @CoconutPete 6 місяців тому +2

      @@sebastiankamph surprised I was able to get sd3 running on comfy with a GTX 1660 ti

  • @Bone-studio
    @Bone-studio 6 місяців тому

    Please, talk about the license and the limitations it suppose for any kind of use other than hobbyists, even for you as a youtuber.

  • @Skystunt123
    @Skystunt123 6 місяців тому

    10:55 you can see the Dall-E one also has a lof of diversity in the filter with the black wizard lol. Cesnorship and wokeness should also be taken into consideration

  • @lumasku
    @lumasku 6 місяців тому

    Do ppl still use fooocus?

  • @Sujal-ow7cj
    @Sujal-ow7cj 6 місяців тому

    Can't we run it on amd 😢

  • @TE-qu4jz
    @TE-qu4jz 6 місяців тому

    Hmm, how would you be able to see the translucent pig, if it were inside the smaller pig? ;)

  • @jamesclow108
    @jamesclow108 6 місяців тому

    Now......how do we train it?