A GPT-3 for Images? Dall-E is the most impressive AI ever created!

Поділитися
Вставка
  • Опубліковано 6 січ 2021
  • DALL·E / Dall-E is a model based on GPT-3 but for generating images. In the realm of Machine Learning or AI, this has to one of the most impressive models ever released. OpenAI again pushes the boundaries of what's possible.
    Support me on Patreon: www.patreon.com/user?u=25285137
    ML-Agents Discord Channel: / discord
    Keep in touch: / sebastianschuc7
    Original Article: openai.com/blog/dall-e/
    Music by Lemmino: soundcloud.com/lemmino/encoun...
  • Наука та технологія

КОМЕНТАРІ • 118

  • @randyjordan1320
    @randyjordan1320 3 роки тому +68

    2:42 a green... what?

    • @SebastianSchuchmannAI
      @SebastianSchuchmannAI  3 роки тому +14

      Oh lol :D

    • @akuma7616
      @akuma7616 3 роки тому +4

      They're memeing us

    • @adamklam1
      @adamklam1 3 роки тому +5

      i read it with a pause after red gloves- sounds DALL-E has some opinions about penguins.

    • @randyjordan1320
      @randyjordan1320 3 роки тому +1

      @@adamklam1 lmao

    • @NicksStuff
      @NicksStuff 3 роки тому +2

      So...the neural network didn't offer any good answer

  • @zarthy4169
    @zarthy4169 3 роки тому +14

    3:57 For the one on the top left, the ai just said. “No, S P H E R E”

  • @jonatan01i
    @jonatan01i 3 роки тому +17

    2:37
    That penguin wears some green shit!

  • @KlaudiusL
    @KlaudiusL 3 роки тому +14

    Most prediction say: AI will reach the singularity at the 40s. Looks like will happen before the 30s

    • @vtopia.1679
      @vtopia.1679 2 роки тому

      @@hitlagenjoyer imagine playing a hyper realistic game on an old smartphone

  • @qcdatabasevideos3282
    @qcdatabasevideos3282 3 роки тому +1

    Just found your channel. Great stuff. Keep up the good work!

  • @McDonaldsCalifornia
    @McDonaldsCalifornia 3 роки тому +2

    I love the concept of prompt engineering becoming a new kind of coding/computer job. I feel like it plays more to my personal strengths than writing code

  • @pathaleyguitar9763
    @pathaleyguitar9763 3 роки тому +2

    Everyone noticed the "green shit" at 2:42, but I would also like to draw everyone's attention to the penguin at the bottom of the center column who seems to be a tad angry at us....

  • @zarthy4169
    @zarthy4169 3 роки тому +6

    2:52 It looks like for some of those images, the AI took “in the shape of a square” literally.

    • @FarfettilLejl
      @FarfettilLejl 3 роки тому

      How else could it have been interpreted?

    • @zarthy4169
      @zarthy4169 3 роки тому

      @@FarfettilLejl Well, for four of them, they literally just put a light bulb inside of a square.

    • @a-ragdoll
      @a-ragdoll 2 роки тому

      i like how one of them is a sort of hexagon (top right)

    • @zarthy4169
      @zarthy4169 2 роки тому

      @@a-ragdoll Yeah.

  • @digitalspecter
    @digitalspecter 3 роки тому +2

    We should use crowdsourcing for data and a distributed computer program to get the computing power from volunteers (it should work well with machine learning because amount of data to computation power is low) like they used for dna analysis / seti program. It's dangerous to let only a select few big companies with the data and resources to develop the next step in computer algorithms...

  • @whnvr
    @whnvr 3 роки тому +6

    with the ‘openai storefront’ one i understand why they had to write it in that way though. when i’m working with gpt-3 i often have to use abstract, convoluted wording that makes more sense to the model than to me in order to coax the right results out. i often feel like i’m developing this completely unique, new skillset of ‘communicating w/ ai’ due to how many unforeseen interpretations it has of my instructions.
    kinda lends weight to the theory that ai will destroy us simply by misunderstanding the purpose we give it haha

    • @a-ragdoll
      @a-ragdoll 2 роки тому

      how did u access gpt-3 tho

    • @whnvr
      @whnvr 2 роки тому

      @@a-ragdoll i applied and gave them a strong use-case for what i’m looking to achieve w/ it

    • @bestofthebest3812
      @bestofthebest3812 2 роки тому

      @@whnvr what are you doing? Just curious and intrigued

  • @genericusername1243
    @genericusername1243 3 роки тому +9

    long live yt recommendations AI

  • @moved8575
    @moved8575 3 роки тому +9

    2:43 look at the green shirt part

    • @shilohv
      @shilohv 3 роки тому

      Lol. Pretty sure it’s a typo in the video. Now I’m curious what would happen is you actually ran that text. Would the penguin be standing on a green poop emoji?

    • @PS0DEK
      @PS0DEK 3 роки тому +2

      @@shilohv Dall-e tries to make sense of what's in the context, even if you add some noise (a.k.a. typos or out-of-context text).

    • @shilohv
      @shilohv 3 роки тому

      @@PS0DEK I was wondering about that. That makes sense. Kind of like Google’s "did you mean" filter. What would happen if you turned that off? Would it get confused, or come up with something completely different? For that matter what if you typed "and now for something completely different"?

    • @PS0DEK
      @PS0DEK 3 роки тому +2

      @@shilohv We lack a proper paper to explain how exactly it works. But it may be impossible to turn this feature off anyway since neural networks are non differentiable, you cannot separate the funcions into smaller blocks.

  • @markusbuchholz3518
    @markusbuchholz3518 3 роки тому +1

    Great video as a whole your YT channel. Your performance is outstanding and effort impressive. Yes the GPT-3 is very cool and promising, State of the of work and many fascinating application can be build on this knowledge. However I prefer to capture your awesome ML agents ... first! You are amazing man who has to be cloned so our planet will be even better. Keep fingers for your great success!

  • @isd4154
    @isd4154 3 роки тому +1

    I love the vintage one because you can put something that wasn't even invented during that time and see what it'll look like

  • @juanmanuelcirotorres6155
    @juanmanuelcirotorres6155 3 роки тому +3

    The best channel that I found today

  • @adrianv.v.4445
    @adrianv.v.4445 3 роки тому +4

    Nope, it can in fact generate reflections, its just that the network was given a cut-out image of a mirror where it was virtually impossible for it to make something coherent

  • @Neceros
    @Neceros 3 роки тому

    Copyright free product suggestions with a photo of the item. Heckn.

  • @laurasmith9135
    @laurasmith9135 3 роки тому +1

    but how do you use this? do you have to install it on your computer first?

  • @OneArmDan
    @OneArmDan 3 роки тому +34

    Okay, I can just feel the rise of this channel.

    • @HarrisonBorbarrison
      @HarrisonBorbarrison 3 роки тому +2

      Okay, I can just feel the rise of this comment.

    • @jayknox339
      @jayknox339 3 роки тому +1

      I agree. Give it some time. Itll happen.

  • @akuma7616
    @akuma7616 3 роки тому +2

    Oof... I thought you had half a million subscribers but hen I noticed a dot in the middle...
    And I'm like: man, really?
    This channel deserves much more.
    I can't help but say that it's sad seeing someone like you getting so little views and support for amazing work that you're doing.
    I think you'll get to 50k subs this year.
    Best of luck.

  • @kenneth7239
    @kenneth7239 3 роки тому +5

    I've noticed, It's REALLY good with hands, and lighting / texture.

  • @itsjustthatsimple628
    @itsjustthatsimple628 3 роки тому +1

    That's sick!!

  • @Guytron95
    @Guytron95 3 роки тому +1

    don't suppose you have a link to the clone of the network architecture?

  • @KlimovArtem1
    @KlimovArtem1 3 роки тому +4

    5:05 - every mirror has the same shape for some reason)

    • @connormc4050
      @connormc4050 3 роки тому

      It's because they all have the same seed image

    • @KlimovArtem1
      @KlimovArtem1 3 роки тому

      @@connormc4050 what do you mean by "seed image"? Is it not taking only the text string as an input?

  • @StagnantMizu
    @StagnantMizu 3 роки тому +1

    How do I get acces? I have a gpt3 key.

  • @mykulpierce
    @mykulpierce 3 роки тому +1

    I was just reading about this today. Are there any plans for this being a tool for developers or artists? I'd really love to give it a try

  • @phooogle
    @phooogle 3 роки тому

    How do you use it? All the options seem locked.

  • @canaldoapolinario
    @canaldoapolinario 3 роки тому +1

    It seems like there is room for yet another layer of abstraction between natural language from the prompts and the actual model, maybe training a neural network to get natural language prompts and "translate" to the weird english that the AI seems to work best on

  • @Desertpunk1986
    @Desertpunk1986 3 роки тому

    Puppetmaster will come out of the primordial soup that is GPT-3.

  • @simonstrandgaard5503
    @simonstrandgaard5503 3 роки тому

    Incredible

  • @MindSweptAway
    @MindSweptAway 3 роки тому

    Wow!

  • @ConnoisseurOfExistence
    @ConnoisseurOfExistence 3 роки тому

    Sharing here and there...

  • @sumdud2129
    @sumdud2129 3 роки тому +1

    So if the API isn't actually open source I can't just download this and start making images myself?

    • @a-ragdoll
      @a-ragdoll 2 роки тому

      its open source, but it doesnt have the thingy that lets u generate pictures

  • @TrueValience
    @TrueValience 3 роки тому +2

    This video was great. You should experiment with videos like two minute papers does

    • @TrueValience
      @TrueValience 3 роки тому +1

      its kind of like this

    • @IconoclastX
      @IconoclastX 3 роки тому

      i cant wait until 2mp does a vid on this and we get to have this model for ourselves c:

  • @ilzhukov-art-copy
    @ilzhukov-art-copy 3 роки тому

    Hello! how can we use it on personal pc? Where is the soft?

  • @alan2here
    @alan2here 3 роки тому +1

    A lizard is practising calligraphy.

  • @JACBoyJesse
    @JACBoyJesse 3 роки тому

    I love the animal - food/objects hybrids.

  • @635574
    @635574 3 роки тому

    Just wait for the video dal-EE

  • @serta5727
    @serta5727 3 роки тому

    Subbed

  • @sgt391
    @sgt391 2 роки тому

    Can't wait in 20 years when phones will be able to run the training for this model in seconds

  • @findahuman6110
    @findahuman6110 3 роки тому

    Thank you for the great video and content as always! The noise transitions were quite jarring though

  • @Hennesg
    @Hennesg 3 роки тому +1

    If the public gains access to this a few million people will loose their jobs over time. Stockimage creators, illustrators, product designers

    • @a-ragdoll
      @a-ragdoll 2 роки тому

      if it was released in its current state it would be easy to see that its made by ai

  • @megaheroes3611
    @megaheroes3611 3 роки тому

    How can I use this?

  • @boknonoyski
    @boknonoyski 2 роки тому

    2:38 they spelt shirt wrong

  • @godofthecripples1237
    @godofthecripples1237 3 роки тому +1

    We all know what this is really going to be used for once it's stable and publicly available.

    • @a-ragdoll
      @a-ragdoll 2 роки тому

      nightmare fuel?

    • @godofthecripples1237
      @godofthecripples1237 2 роки тому

      @@a-ragdoll I was thinking along the lines of something more NSFW, but yeah, plenty of nightmare fuel will be out there

    • @a-ragdoll
      @a-ragdoll 2 роки тому

      @@godofthecripples1237 if someone tries to generate nsfw stuff on this thing its still gonna be nightmare fuel, maybe in 10 years it will look better

  • @MindSweptAway
    @MindSweptAway 2 роки тому

    I think the reason why this is a demo is because it’s still In beta, and if you use the model it would break.

  • @robo1540
    @robo1540 3 роки тому

    yoo is that 1:57 the default minecraft grass top texture

    • @robo1540
      @robo1540 3 роки тому

      holy shit it pixel-by-pixel is
      how much minecraft does one have to play to be able to tell that texture apart from all other random noise

    • @abrampainter3764
      @abrampainter3764 3 роки тому

      @@robo1540 Yeah just googled it. That's crazy

  • @VincentFischer
    @VincentFischer 3 роки тому +1

    This is shockingly near AGI level isn't it? I mean the multi disciplinary understanding of all things baffles me.

  • @vitotonello261
    @vitotonello261 3 роки тому +2

    bring Pokemon to the next level!

  • @robo1540
    @robo1540 3 роки тому +1

    is that the mf cicada 3301 song by lemmino

    • @k8ieone
      @k8ieone 3 роки тому

      Yup! I was looking for a comment like this.

  • @XetXetable
    @XetXetable 3 роки тому +1

    Those mirror results look weird. Why is it the same mirror every time? There don't seem to be similar constants in the other examples.

    • @SebastianSchuchmannAI
      @SebastianSchuchmannAI  3 роки тому

      A big Part of the Image was given in this Case. Sorry, it isnt shown in the Video

    • @LaPapaya
      @LaPapaya 3 роки тому

      That one had a prompt image like the old gpt-image

    • @zarthy4169
      @zarthy4169 3 роки тому +2

      @@LaPapaya Oh, so you can set a specific prompt image that will show up for each image.

    • @LaPapaya
      @LaPapaya 3 роки тому

      @@zarthy4169 Exactly, it will generate from that prompt image.

  • @mrquackface
    @mrquackface 2 роки тому

    Our AI OVERLORD ARE ALMOST COMMING

  • @NicksStuff
    @NicksStuff 3 роки тому

    Where's the snail?

  • @TheGeekosDen
    @TheGeekosDen 2 роки тому

    „Green Shit” lol

  • @zarthy4169
    @zarthy4169 3 роки тому +1

    2:41 The middle one looks perfect. It looks like an actual emoji.

  • @dadthelad
    @dadthelad 3 роки тому +1

    A penguin wearing a green shit???

  • @zarthy4169
    @zarthy4169 3 роки тому

    2:19 This looks like Minecraft.

  • @EricRogstad
    @EricRogstad 3 роки тому

    You don't say the "p" in "OpenAI"? Sounds like "O'en AI"

  • @Michaelf122
    @Michaelf122 2 роки тому

    So basically you're ai is an incredible google image search

  • @hward1973
    @hward1973 2 роки тому

    would be great for confused girls trying to explain what they want in a tatoo shop

  • @workflowinmind
    @workflowinmind 3 роки тому

    Mom? I'm scared

  • @cadenitadelnazareno6717
    @cadenitadelnazareno6717 2 роки тому

    Human text=Green shit, I’m sure that should’ve looked different

  • @magicjuand
    @magicjuand 3 роки тому

    it's just fancy search

  • @myuniquehandle
    @myuniquehandle 3 роки тому

    Is it nothing more than a image search? Most of the images are designed by humans, it would be interesting to highlight which changes GPT-3 did (if any)...

  • @maxziebell4013
    @maxziebell4013 3 роки тому +2

    It’s a decoy... these results are just attention bait while “Next” is ravaging through the world ;-)

  • @vsiegel
    @vsiegel 3 роки тому

    Wait, what?
    That model is an order magnitude smaller than GPT-3 and an order of magnitude more scary than GPT-3.
    I use "scary" here as a unit of AI performance.
    What irritates me is that my intuition tells me that generating images needs a very abstract understanding of objects and other concepts.
    Yes, there is the problem: I used the word "understanding".

    • @StagnantMizu
      @StagnantMizu 3 роки тому

      GPT-3 is scary too, had some conversation in playground with it which made me almost doubt if it was sentient or not lmao

  • @JordanPriede
    @JordanPriede 3 роки тому +1

    Excessive use of the digital noise transition, and the very slow excessive blur transition to bring in the example photos.
    It takes away from the rest of the video.
    Great content, though.

  • @maroon9138
    @maroon9138 3 роки тому

    German english