BERT Explained!

Поділитися
Вставка
  • Опубліковано 14 лис 2024

КОМЕНТАРІ • 47

  • @connor-shorten
    @connor-shorten  4 роки тому +13

    1:39 Bidirectional Language Modeling
    2:45 Masking Strategy
    3:38 BERT input
    4:55 The Illustrated Transformer
    5:50 Tensor Dimensions in BERT
    7:20 BERT Model Architecture
    7:58 BERT Base vs. Large
    9:13 Datasets for Training BERT
    9:40 Transfer Learning with BERT
    10:03 SQuAD and BERT
    12:00 Ablations

    • @7justfun
      @7justfun 4 роки тому

      Thank you . Quick Q clarification can :
      Is the dimension of Query matrix same as i/p : L x De?
      How does its factorize i/p to QKV matrices ? I dont think its simple SVD .
      Dimension of K : Dk x De so that KT = De x Dk and can be multipled with Q? Is this correct ?
      Dimenson of V : Dv x De ? and Dv= Dk so that final output Z can be LxDe ? Is this understanding correct ?

  • @pankajverma29007
    @pankajverma29007 4 роки тому +39

    Thanks ! But please slow down :)

  • @petersorbo3277
    @petersorbo3277 3 роки тому +4

    Loved the video Henry! Your fast paced style works great to gain a general understanding of the model & how it fits into a use case . Each slide also serves as a good index for further learning. Surprised at all the negative comments.. although you might’ve done better calling it ‘Bert Overview’

  • @kooshi1333
    @kooshi1333 3 роки тому +2

    my understanding of transformers somehow went down by watching this video

  • @MarketingLeap
    @MarketingLeap 4 роки тому +6

    Well explained but yes slow down a bit! 👍👍

  • @BosakMaw
    @BosakMaw 4 роки тому +10

    Hi, great work! Can you make a video about the first Transformer paper "Attention Is All You Need"
    I haven't caught up on those things and I think others will appreciate it too

    • @connor-shorten
      @connor-shorten  4 роки тому +1

      Thank you for the suggestion! I recommend watching "Attention is all you need" from Yannic Kilcher on UA-cam in the meantime! That video and the blog post "The Illustrated Transformer" helped a lot with my understanding of it!

    • @bioinfolucas5606
      @bioinfolucas5606 4 роки тому +1

      Yes! I would like suggesting the same thing! I watched the Yannic Kilcher one before. But I really would like to see a focus in the attention per se. Thank you!

  • @pranavpattarkine7760
    @pranavpattarkine7760 4 роки тому +8

    Just breathe while speaking!

  • @darshanbari2439
    @darshanbari2439 4 роки тому +41

    When a rapper starts learning NLP and Machine Learning

  • @vigneshwarachinnadurai9636
    @vigneshwarachinnadurai9636 3 роки тому

    Neat explanation. After going through the paper, this video is best for quick go through.

  • @TechVizTheDataScienceGuy
    @TechVizTheDataScienceGuy 4 роки тому

    Thanks for the time stamps. Nice explanation overall.

  • @TechVizTheDataScienceGuy
    @TechVizTheDataScienceGuy 4 роки тому

    Nicely explained

  • @traindiesel7005
    @traindiesel7005 3 роки тому

    if you play this video at double speed you can smell your brain cooking a little

  • @gjeraq
    @gjeraq 4 роки тому

    I don't know why people are complaining. I am not a native speaker and for me your rate of speaking is just fine.

  • @simonbody7632
    @simonbody7632 4 роки тому +20

    Hi. Nice work, but you are talking waaaaay too fast. Slow down

  • @youngcolt5305
    @youngcolt5305 4 роки тому +43

    Problems with your video: You speak too fast relative to the changing slides and text on your slides. This is ineffective when creating tutorials. You assume the viewers already know too much so you throw around words like "auto-regressive" etc. without bothering to explain what that is. Perhaps you should make videos abt a focused sub-topic, coz otherwise this type of video isn't of much utility to people.

  • @nikhithasagarreddy
    @nikhithasagarreddy 4 роки тому

    Can a student apply BERT for his project work?

  • @sheikhakbar2067
    @sheikhakbar2067 4 роки тому

    Why is the rush?

  • @ayushdwivedi3769
    @ayushdwivedi3769 4 роки тому

    Liked the video a lot....have subscribed to your channel...please upload more videos

  • @alassanndiallo
    @alassanndiallo 3 роки тому

    Good work . Please slow down next time !

  • @RaiNBoOoOoWw
    @RaiNBoOoOoWw 2 роки тому

    who is chasing you? super fast!

  • @nesmaabdelaziz7268
    @nesmaabdelaziz7268 4 роки тому

    I have a question if anyone can help, if i input for bert or any transformer a paragraph that contains name of disease or genes for example, how it can detect that this is a disease? and does it replace it with a tag for example.
    second question: is there a possible way to add those identified tags into a matrix for example so i would focus on them will applying attention?

  • @monart4210
    @monart4210 4 роки тому

    Could I extract word embeddings from BERT and use them for unsupervised learning, e.g. topic modeling? :)

    • @ericmacedo_
      @ericmacedo_ 4 роки тому

      I have seen a few approaches where they perform both BERT and LDA separately, concatenate the vector representations (BERT + LDA), and finally, they execute an autoencoder to learn a lower-dimensional latent space representation.
      blog.insightdatascience.com/contextual-topic-identification-4291d256a032

  • @saad-europak-stories
    @saad-europak-stories 3 роки тому

    @Henry AI Labs have a question... Is BERT good enough for Malware detection?

  • @Dunkeyhote
    @Dunkeyhote 4 роки тому

    amazing video thanks!

  • @pranavwankhedkar7435
    @pranavwankhedkar7435 4 роки тому

    Are you Brandon Butch?

  • @LeQNam
    @LeQNam 4 роки тому

    turn the speed to 2x, it' really easy to rock.

  • @mjafar
    @mjafar 4 роки тому

    Thank you!

    • @mjafar
      @mjafar 4 роки тому

      Btw you're not talking too fast. If you were slower it'd become boring. There are captions and slow-downs for people who can't follow.

  • @williambonvini5806
    @williambonvini5806 4 роки тому +7

    Too fast sorry but I can't follow up

  • @naevan1
    @naevan1 2 роки тому

    I'm just now learning text mining and nlp. Holy shit I don't understand anything

  • @gemanucul
    @gemanucul 2 роки тому

    hallo bert!!!!!!!!! hi!!!!!!!!!!!!!!!!!!!!!

  • @esra_erimez
    @esra_erimez 4 роки тому +5

    I'm watching at 1.5 speed and can understand it perfectly fine.

  • @henryCcc8614
    @henryCcc8614 2 роки тому

    Too fast but great.

  • @yeeter269
    @yeeter269 2 роки тому

    153 dislikes woa

  • @a_22_romitbhaumik89
    @a_22_romitbhaumik89 8 місяців тому

    Slow down please