The Narrated Transformer Language Model

Поділитися
Вставка
  • Опубліковано 15 тра 2024
  • AI/ML has been witnessing a rapid acceleration in model improvement in the last few years. The majority of the state-of-the-art models in the field are based on the Transformer architecture. Examples include models like BERT (which when applied to Google Search, resulted in what Google calls "one of the biggest leaps forward in the history of Search") and OpenAI's GPT2 and GPT3 (which are able to generate coherent text and essays).
    This video by the author of the popular "Illustrated Transformer" guide will introduce the Transformer architecture and its various applications. This is a visual presentation accessible to people with various levels of ML experience.
    Intro (0:00)
    The Architecture of the Transformer (4:18)
    Model Training (7:11)
    Transformer LM Component 1: FFNN (10:01)
    Transformer LM Component 2: Self-Attention(12:27)
    Tokenization: Words to Token Ids (14:59)
    Embedding: Breathe meaning into tokens (19:42)
    Projecting the Output: Turning Computation into Language (24:11)
    Final Note: Visualizing Probabilities (25:51)
    The Illustrated Transformer:
    jalammar.github.io/illustrate...
    Simple transformer language model notebook:
    github.com/jalammar/jalammar....
    Philosophers On GPT-3 (updated with replies by GPT-3):
    dailynous.com/2020/07/30/phil...
    -----
    Twitter: / jayalammar
    Blog: jalammar.github.io/
    Mailing List: jayalammar.substack.com/
    More videos by Jay:
    Jay's Visual Intro to AI
    • Jay's Visual Intro to AI
    How GPT-3 Works - Easily Explained with Animations
    • How GPT3 Works - Easil...

КОМЕНТАРІ • 224

  • @parthchokhra948
    @parthchokhra948 3 роки тому +224

    Your blog on Illustrated Transformer was my intro to Deep Learning with NLP. Thanks for the amazing contributions for the community.

    • @jc_777
      @jc_777 3 роки тому +6

      Yeah it is being referenced in my DL class too. Truly great content for new learners!

  • @ans1975
    @ans1975 3 роки тому +26

    The Illustrated Transformer blog is a masterpiece!

  • @andresjvazquez
    @andresjvazquez 2 роки тому +28

    Dear Teacher Alammar , thanks to this video I was able to accepted into BYU lab as an external researcher (even though I didn’t finish college) and have been invited by my professor to participate with the lab in CASP15 . You really changed the course of my life by demystifying such complex topics for non traditional learners like me . I’m eternally in your debt

  • @Roshan-xd5tl
    @Roshan-xd5tl 2 роки тому +17

    Your ability to explain and breakdown complex topics into simpler and intuitive sections is legendary. Thank you for your contribution!

  • @ultraviolenc3
    @ultraviolenc3 2 роки тому

    I’ve just read your “The illustrated transformer” article and I wanted to say that you made very smart and simple visual representations. It seems you put a lot of thought into that.

  • @ayush612
    @ayush612 2 роки тому +4

    I remember Seeing your Transformer's Blog Jay.. It was legendary!! Was referred to by other youtubers as well... And thanks a lot for the wonderful explanation as well!

  • @curiouspie1264
    @curiouspie1264 Рік тому

    One of the most comprehensive video and blog overviews of Transformers I've seen. Thank you. 🙏

  • @quietkael7349
    @quietkael7349 3 роки тому +9

    Thank you so much for all the tireless work you do for us visual learners out there! I’m looking forward to videos where you get into your excellent visualizations of the underlying matrix operations. Your visual abstractions both at the flow chart level and matrix/vector level have really shaped my mental model for what I think about when I’m engineering models. I’m so grateful and so excited to see what you come out with next (this library you hint at looks wonderful!)

    • @arp_ai
      @arp_ai  3 роки тому +1

      Thanks Jack!

  • @goelnikhils
    @goelnikhils 9 місяців тому

    I haven't see such a clear explanation of Transformers and Decoder LM Models, Amazing Work Jay

  • @nileshkikle8112
    @nileshkikle8112 4 місяці тому

    Outstanding job demystifying the inner working details of the Transformer model architecture! All the illustrations and animations for the inference working are awesome. Thank you for taking all the time and sharing your understanding with all of us. Kudos! 👍

  • @pypypy4228
    @pypypy4228 Місяць тому

    A huge thank you for this explanation!

  • @jpmarinhomartins
    @jpmarinhomartins 3 роки тому

    Dude I freakin love your blog, keep up with the good work! Thanks for everything!

  • @thecutestcat897
    @thecutestcat897 Рік тому

    Thanks, your Blog is so clear!

  • @kazimafzal
    @kazimafzal Рік тому +1

    You sir are an amazing teacher! I'm absolutely flabbergasted by how well you've explained, to think its all mathematics at the end of the day! Thank you for taking the time to put together such a concise yet complete guide to transformers!

  • @ianborregoobrador
    @ianborregoobrador 28 днів тому

    2024, still a great reference to Transformers. Million thanks for the amazing work!

  • @bighit7596
    @bighit7596 3 роки тому +2

    you have a gift for explaining complex materials... many other technical talks assumes the audience is very knowledgeable and are attending the session just for networking

  • @KlimovArtem1
    @KlimovArtem1 3 роки тому +2

    27:56 - this explains a lot, thank you so much!

  • @tehseenzia3135
    @tehseenzia3135 3 роки тому

    Amazing illustration. Keep going Jay.

  • @ishandindorkar2846
    @ishandindorkar2846 5 місяців тому

    Jay, many thanks for your work. These videos help me a lot to understand key concepts in NLP domain through visualization.

  • @yoonyamm
    @yoonyamm 6 місяців тому

    Thank you for sharing wonderful insight!

  • @sharkeyryan
    @sharkeyryan 2 роки тому

    Thanks for creating this content. Your explanation is quite easy to follow, especially for someone like me who is just beginning to explore these areas of AI/ML.

  • @itall9025
    @itall9025 3 роки тому

    Great explanation! Please keep doing this format.

  • @exxzxxe
    @exxzxxe 4 місяці тому

    Maybe the best video on this subject.

  • @tachyon7777
    @tachyon7777 2 роки тому +7

    It would nice to have a step by step walkthrough of the training process. And why each of those steps makes sense intuitively.

  • @1Kapachow1
    @1Kapachow1 3 роки тому

    Really enjoyed your blog post and video, super clear - thank you very much for this amazing resource :)

  • @tusharkhustule3316
    @tusharkhustule3316 7 місяців тому

    1 minute into the video and I already subscribed.

  •  3 роки тому +10

    Just a personal comment on the format of the videos: I, personally, find that constant change of scene (like in "The architecture of the transformer" section) where the camera changes constantly showing you and then showing the computer screen and then back to you, is extremely annoying.
    The content of the video itself was informative.

  • @zongmianli9072
    @zongmianli9072 11 місяців тому

    Thanks for the very clear and concise explanation, Jay!

  • @maxbeber
    @maxbeber 3 роки тому

    Thank you so much for the clear and concise explanation. Keep it up the great work.

  • @studmatze958
    @studmatze958 Рік тому

    Thank you so much for you work on attention and transformers. Your posts and videos are the best i have encountered so far in terms of visualization and explanation. And you did it way better than my Professor. Again thank you :)

  • @omarsultan827
    @omarsultan827 2 роки тому

    Thank you for this awesome introduction!

  • @josephsueke
    @josephsueke 2 місяці тому

    Really clear. amazing job!

  • @drtariqahmadphd3372
    @drtariqahmadphd3372 3 роки тому +1

    Never been more excited by a UA-camr channel than when I saw this guy had a channel.

  • @jacakopl
    @jacakopl 3 роки тому +1

    This is the best video I have seen by far in this domain. You strike a perfect balance in assuming the level of understanding of audience :)

    • @arp_ai
      @arp_ai  3 роки тому

      Awesome! Glad you found it useful!

  • @ygorgallina2691
    @ygorgallina2691 2 роки тому

    Thank you so much for your work ! The illustration help to clearly understand these models !!

  • @stephenngumbikiilu3988
    @stephenngumbikiilu3988 Рік тому

    Your blog was referred to me by my lecture Julia Kreutzer of Google Translate, it's just amazing piece of work. It has really helped me in my understanding of these concepts. Thanks.

  • @damonandrews1887
    @damonandrews1887 3 роки тому

    I found this very helpful visual explainer, thanks so much for your time, and thanks for chopping it up into sections for easy revision 🤓!

  • @maruthiprasad8184
    @maruthiprasad8184 5 місяців тому

    Amazing explanation, my search to understand the transformers ended here, you done the wonderful job, thank you so much for the simplest explanation I ever seen.

  • @hasanb2312
    @hasanb2312 3 роки тому

    Great video Jay, thank you so much!

  • @kalinda619
    @kalinda619 3 роки тому +2

    A phenomenal extension of your blog post. Commenting for that bump in the recommendation algorithm!

    • @arp_ai
      @arp_ai  3 роки тому

      Thank you! Much appreciated!

  • @Udayanverma
    @Udayanverma 7 місяців тому

    loved it. thanks. got some new neurons in my head created by this video.

  • @JimBob-lq1db
    @JimBob-lq1db 4 місяці тому

    Thank you for this great explanation. Visualize , visualize, visualize, the best way to undestand how it works.

  • @yudiguzman8926
    @yudiguzman8926 3 роки тому

    I really appreciate your explanation about this topic. One more time, I check that DL is my new passion. Thanks a lot.

  • @HelenTueni
    @HelenTueni Рік тому

    Amazing video. Thank you very much for making this topic accessible.

  • @javierechevarria1548
    @javierechevarria1548 3 роки тому

    Your are really good (excellent) at explaining a complex topic in a simple way. Congratulations !!!!

  • @AdityPai
    @AdityPai 3 роки тому +1

    Thank you for writing the blog. It has helped me .

  • @sudzam
    @sudzam 11 місяців тому

    Wow! One of THE best explanation of Transformers.. Thanks @Jay!!

  • @RK-fr4qf
    @RK-fr4qf Рік тому

    Impressive. Thank you.

  • @user-yr5vj7xp8d
    @user-yr5vj7xp8d Місяць тому

    Jay, recentemente estive em um curso de I.A, Mas voce apresentou muito bem, de forma didática a PNL.... eu aprendi muito com voce.
    Obrigado. Continue sendo este cara maravilhoso.

  • @jesuslopez3306
    @jesuslopez3306 Рік тому

    Definitely it is easier to understand in a vertical way. Thanks for everything!

  • @evertonlimaaleixo1084
    @evertonlimaaleixo1084 2 роки тому

    Amazing!
    Thank you for share!

  • @a.e.5054
    @a.e.5054 3 роки тому

    The best explanation of the Transformer and GPT model !!

  • @user-fz5vr7yo2l
    @user-fz5vr7yo2l Рік тому

    Amazinnnng illustration of language model transformers

  • @IyadKhuder
    @IyadKhuder Рік тому

    I've ended up here to familiarize myself with NLP transformers. Your video was the optimal choice for me, as it' explains the concept in an understandable scientific manner. Thanks.

  • @peterkahenya
    @peterkahenya Рік тому

    Wow! 🎉 Awesome into.

  • @TusharKale9
    @TusharKale9 2 роки тому

    Great master piece explanation of NLP in real life scenario. Thank you

  • @diogo.magalhaes
    @diogo.magalhaes 3 роки тому +9

    Jay, as a PhD student, I'm a fan of your ability to explain complex topics, in a very simple, illustrated and didactic way! I always recommend your ' illustrated' posts to my colleagues. Thanks again for this great video, keep up the good work!

  • @niundisponible
    @niundisponible Рік тому

    I see Miles Davis vinyl, kind of blue. Awesome album, and thanks for the video!

  • @Opinionman2
    @Opinionman2 2 роки тому

    Awesome stuff. your blog really helped clarify my deep learning class.

  • @hailongle
    @hailongle 3 роки тому

    Fantastic teacher. Thanks Jay!

  • @MsFearco
    @MsFearco 2 роки тому

    I just found this now. it's super. thanks

  • @jackdavidweber
    @jackdavidweber 3 роки тому

    This is really great! Highly recommend!

  • @andreysguitarmusic2661
    @andreysguitarmusic2661 Рік тому

    Great explanations!

  • @romulodrumond3526
    @romulodrumond3526 3 роки тому

    One of the best videos of the subject

  • @mrityunjayupadhyay7332
    @mrityunjayupadhyay7332 Рік тому

    Great explanation

  • @o_felipe_reis
    @o_felipe_reis 3 роки тому +1

    Great video! Best regards from Brazil!

  • @utsavshukla7516
    @utsavshukla7516 3 роки тому

    great explanation! also love all the pop culture references in your room :p

  • @mikemihay
    @mikemihay 3 роки тому

    Awesome content! thank you!

  • @akshaya9577
    @akshaya9577 3 роки тому

    absolutely amazing video

  • @atheeralbarqi3540
    @atheeralbarqi3540 2 роки тому +2

    I appreciate your detailed explanation, Mr. Jay. My first reaction was to read your article (The Illustrated Transformer) after watching the video. My question relates to the Transformer architecture, which consists of six encoders and six decoders layers, all of which seem to be very similar. What is the purpose of the six layers? since a sentence will be checked for relevant information in every word from the first layer using Self-Attention. In addition, Attention is used to boost training speed, so will these six layers slow it down?

  • @junlinguo77
    @junlinguo77 2 роки тому

    I like the way you are teaching! !!

  • @NarkeEmpire
    @NarkeEmpire 6 місяців тому

    You are a great teacher!!! If you chek the EQ settings and lower the music at the beginning the video is perfect!!! Thanks a lot for sharing your knowledge in this very understandable way

  • @tshepisosoetsane4857
    @tshepisosoetsane4857 Рік тому

    Amazing work indeed thanks for simplifying things for everyone to understand this AI great work

  • @rsilveira79
    @rsilveira79 3 роки тому +4

    Nice collection of albuns man! Miles Davis, Radiohead, John Coltrane, very classy! 👏👏👏

    • @kumarvikas_134
      @kumarvikas_134 3 роки тому +1

      Spot on observation, kind of ironic to be listening to Ok Computer and teaching about artificial intelligence :D

  • @sachinr3823
    @sachinr3823 3 роки тому

    Omg, thanks lot for these amazing videos. Your lectures and blogs are so easy to understand.

    • @sachinr3823
      @sachinr3823 3 роки тому

      Small request, please pin the BGM you used in the video

  • @rupakgoyal1611
    @rupakgoyal1611 3 роки тому

    loved the music behind ..

  • @KlimovArtem1
    @KlimovArtem1 3 роки тому +5

    14:15 - so, the Self-Attention layer is actually the thing that’s trying to understand the meaning of the whole sequence? How does it work and how can it be trained? How long sequenced can it analyze?

  • @nisalbandara
    @nisalbandara 3 роки тому

    Im doing a Twitter sentiment analysis and i couldn't wrap my head around BERT and i came across this video. Perfectly explained. Thanks alot

  • @SerJon5
    @SerJon5 3 роки тому +2

    omg, man. I've found several posters of really good music in your room! I defenetely need your TOP-10 albums list!!))

    • @arp_ai
      @arp_ai  3 роки тому

      Haha! I'm mainly displaying some of my favorites!

  • @nmstoker
    @nmstoker 3 роки тому +2

    Watching it now, thanks so much! It's really helpful to go through these kinds of things with clear examples and explanations.
    My only preference would've been to reduce the volume of the background music in the intro. So many podcasts do this and it's an annoying trend!

    • @arp_ai
      @arp_ai  3 роки тому +1

      Thanks Neil! Noted on the audio!

  • @Nereus22
    @Nereus22 2 роки тому

    Great video, thank you!

  • @tiborsaas
    @tiborsaas 4 місяці тому

    This video really aged well. It came out just after GPT3 and before ChatGPT. I love it how it gives massive insights to how current generative AI works behind the scenes (but obviously in a simplified way).

  • @akshikaakalanka
    @akshikaakalanka Рік тому

    Thank you very much! this is awesome and easy to understand.

  • @vincentyang8393
    @vincentyang8393 Рік тому

    Thanks, I learn a lot!

  • @sapnilpatel1645
    @sapnilpatel1645 11 місяців тому

    Great video!

  • @parmarsuraj99
    @parmarsuraj99 3 роки тому +5

    ❤️ That library!!!!

    • @arp_ai
      @arp_ai  3 роки тому +6

      It's been my entire focus the last few months. Stay tuned!

  • @user-oe8lh2gw1u
    @user-oe8lh2gw1u 11 місяців тому

    Great work 👍👍👍

  • @mahdiamrollahi8456
    @mahdiamrollahi8456 2 роки тому

    Thanks, very intuitive…

  • @ahmedb2559
    @ahmedb2559 Рік тому

    Thank you !

  • @raminbakhtiyari5429
    @raminbakhtiyari5429 3 роки тому

    i don't khnow how must say thank you, I just can say please continue uploading your amazing videos. I live in a constrained country and this video is my only hope for learning like other peoples. yours sincerely.
    Ramin Bakhtiyari.

  • @yuchenyang4394
    @yuchenyang4394 3 роки тому

    Great content! can't wait for more.

    • @arp_ai
      @arp_ai  3 роки тому

      Thank you Yuchen!

  • @armingh9283
    @armingh9283 3 роки тому

    Thanks for the explanation. Good music taste at the background by the way👍

    • @arp_ai
      @arp_ai  3 роки тому

      Thank you!

  • @tsadigov1
    @tsadigov1 8 місяців тому

    I am trying to understand working of transformer, you explain it much accessible way. One small thing I wish the video had less of transitions between two cameras.

  • @datascileakos1936
    @datascileakos1936 2 роки тому

    Thank you.

  • @rajeshnalla4497
    @rajeshnalla4497 3 роки тому

    Thank you

  • @amirhosseinfereidooni1798
    @amirhosseinfereidooni1798 2 роки тому

    Thanks for the great explanation. MLP (at 11:35) stands for multilayer perceptron :)

  • @bharath5666
    @bharath5666 3 роки тому

    Amazing.

  • @kobic8
    @kobic8 Рік тому

    hey Jay! love the blog on illustrated transformer, do you also have a reference to your blog on vision transformers?

  • @hunorszegi4007
    @hunorszegi4007 Рік тому

    Thank you for your videos and blog posts. These were my inspiration to create a Java GPT-2 implementation for learning purposes. I can't use a link here, but as huplay I uploaded it to the biggest hosting site, and it is called gpt2-demo.

  • @muhammadjaved9169
    @muhammadjaved9169 11 місяців тому

    great dear