BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

BERT Research - Ep. 1 - Key Concepts & Sources

BERT for pretraining Transformers

Twin Telepathy Challenge!

LOTS of PROMO CODES! #standoff #promocode

Players vs Pitch 🤯

BERT Explained!

Connor Shorten

Переглядів 86 183

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 14 лис 2024

КОМЕНТАРІ • 47

@connor-shorten 4 роки тому ⁺¹³
1:39 Bidirectional Language Modeling
2:45 Masking Strategy
3:38 BERT input
4:55 The Illustrated Transformer
5:50 Tensor Dimensions in BERT
7:20 BERT Model Architecture
7:58 BERT Base vs. Large
9:13 Datasets for Training BERT
9:40 Transfer Learning with BERT
10:03 SQuAD and BERT
12:00 Ablations
@7justfun 4 роки тому
Thank you . Quick Q clarification can :
Is the dimension of Query matrix same as i/p : L x De?
How does its factorize i/p to QKV matrices ? I dont think its simple SVD .
Dimension of K : Dk x De so that KT = De x Dk and can be multipled with Q? Is this correct ?
Dimenson of V : Dv x De ? and Dv= Dk so that final output Z can be LxDe ? Is this understanding correct ?
@pankajverma29007 4 роки тому ⁺³⁹
Thanks ! But please slow down :)
@avatar098 4 роки тому ⁺³
0.75 speed :)
@petersorbo3277 3 роки тому ⁺⁴
Loved the video Henry! Your fast paced style works great to gain a general understanding of the model & how it fits into a use case . Each slide also serves as a good index for further learning. Surprised at all the negative comments.. although you might’ve done better calling it ‘Bert Overview’
@kooshi1333 3 роки тому ⁺²
my understanding of transformers somehow went down by watching this video
@MarketingLeap 4 роки тому ⁺⁶
Well explained but yes slow down a bit! 👍👍
@BosakMaw 4 роки тому ⁺¹⁰
Hi, great work! Can you make a video about the first Transformer paper "Attention Is All You Need"
I haven't caught up on those things and I think others will appreciate it too
@connor-shorten 4 роки тому ⁺¹
Thank you for the suggestion! I recommend watching "Attention is all you need" from Yannic Kilcher on UA-cam in the meantime! That video and the blog post "The Illustrated Transformer" helped a lot with my understanding of it!
@bioinfolucas5606 4 роки тому ⁺¹
Yes! I would like suggesting the same thing! I watched the Yannic Kilcher one before. But I really would like to see a focus in the attention per se. Thank you!
@pranavpattarkine7760 4 роки тому ⁺⁸
Just breathe while speaking!
@darshanbari2439 4 роки тому ⁺⁴¹
When a rapper starts learning NLP and Machine Learning
@zeetech0123 4 роки тому
😂😂
@bipinmandal9332 4 роки тому
😂😂😂
@joywang8173 3 роки тому
can't stop laughing😂😂😂
@vigneshwarachinnadurai9636 3 роки тому
Neat explanation. After going through the paper, this video is best for quick go through.
@TechVizTheDataScienceGuy 4 роки тому
Thanks for the time stamps. Nice explanation overall.
@TechVizTheDataScienceGuy 4 роки тому
Nicely explained
@traindiesel7005 3 роки тому
if you play this video at double speed you can smell your brain cooking a little
@gjeraq 4 роки тому
I don't know why people are complaining. I am not a native speaker and for me your rate of speaking is just fine.
@simonbody7632 4 роки тому ⁺²⁰
Hi. Nice work, but you are talking waaaaay too fast. Slow down
@youngcolt5305 4 роки тому ⁺⁴³
Problems with your video: You speak too fast relative to the changing slides and text on your slides. This is ineffective when creating tutorials. You assume the viewers already know too much so you throw around words like "auto-regressive" etc. without bothering to explain what that is. Perhaps you should make videos abt a focused sub-topic, coz otherwise this type of video isn't of much utility to people.
@BinDerUser 4 роки тому
agree
@nikhithasagarreddy 4 роки тому
Can a student apply BERT for his project work?
@sheikhakbar2067 4 роки тому
Why is the rush?
@ayushdwivedi3769 4 роки тому
Liked the video a lot....have subscribed to your channel...please upload more videos
@alassanndiallo 3 роки тому
Good work . Please slow down next time !
@RaiNBoOoOoWw 2 роки тому
who is chasing you? super fast!
@nesmaabdelaziz7268 4 роки тому
I have a question if anyone can help, if i input for bert or any transformer a paragraph that contains name of disease or genes for example, how it can detect that this is a disease? and does it replace it with a tag for example.
second question: is there a possible way to add those identified tags into a matrix for example so i would focus on them will applying attention?
@monart4210 4 роки тому
Could I extract word embeddings from BERT and use them for unsupervised learning, e.g. topic modeling? :)
@ericmacedo_ 4 роки тому
I have seen a few approaches where they perform both BERT and LDA separately, concatenate the vector representations (BERT + LDA), and finally, they execute an autoencoder to learn a lower-dimensional latent space representation.
blog.insightdatascience.com/contextual-topic-identification-4291d256a032
@saad-europak-stories 3 роки тому
@Henry AI Labs have a question... Is BERT good enough for Malware detection?
@Dunkeyhote 4 роки тому
amazing video thanks!
@pranavwankhedkar7435 4 роки тому
Are you Brandon Butch?
@LeQNam 4 роки тому
turn the speed to 2x, it' really easy to rock.
@mjafar 4 роки тому
Thank you!
@mjafar 4 роки тому
Btw you're not talking too fast. If you were slower it'd become boring. There are captions and slow-downs for people who can't follow.
@williambonvini5806 4 роки тому ⁺⁷
Too fast sorry but I can't follow up
@naevan1 2 роки тому
I'm just now learning text mining and nlp. Holy shit I don't understand anything
@gemanucul 2 роки тому
hallo bert!!!!!!!!! hi!!!!!!!!!!!!!!!!!!!!!
@esra_erimez 4 роки тому ⁺⁵
I'm watching at 1.5 speed and can understand it perfectly fine.
@henryCcc8614 2 роки тому
Too fast but great.
@yeeter269 2 роки тому
153 dislikes woa
@a_22_romitbhaumik89 8 місяців тому
Slow down please

Наступне

Автоматичне відтворення

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

BERT Research - Ep. 1 - Key Concepts & Sources

BERT Research - Ep. 1 - Key Concepts & Sources

BERT for pretraining Transformers

BERT for pretraining Transformers

Twin Telepathy Challenge!

Twin Telepathy Challenge!

LOTS of PROMO CODES! #standoff #promocode

LOTS of PROMO CODES! #standoff #promocode

Players vs Pitch 🤯

Players vs Pitch 🤯

Интересный поединок

Интересный поединок

DSPy Explained!

DSPy Explained!

Speculations on Test-Time Scaling (o1)

Speculations on Test-Time Scaling (o1)

Text Preprocessing | Sentiment Analysis with BERT using huggingface, PyTorch and Python Tutorial

Text Preprocessing | Sentiment Analysis with BERT using huggingface, PyTorch and Python Tutorial

BERT Neural Network - EXPLAINED!

BERT Neural Network - EXPLAINED!

AI Language Models & Transformers - Computerphile

AI Language Models & Transformers - Computerphile

Attention Is All You Need - Paper Explained

Attention Is All You Need - Paper Explained

Sentiment Analysis with BERT Neural Network and Python

Sentiment Analysis with BERT Neural Network and Python

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Language Learning with BERT - TensorFlow and Deep Learning Singapore

Language Learning with BERT - TensorFlow and Deep Learning Singapore

🔥 ПРЕМЬЕРА МЕЛОДРАМЫ 2024! 🔥 Дикарка. 1 серия.

🔥 ПРЕМЬЕРА МЕЛОДРАМЫ 2024! 🔥 Дикарка. 1 серия.

Молодой паренёк шокировал всех!

Молодой паренёк шокировал всех!

Motorbike Smashes Into Porsche! 😱

Motorbike Smashes Into Porsche! 😱

Бомжи достали 6 тачек из мусора. Находка года!

Бомжи достали 6 тачек из мусора. Находка года!

ТЮРЕМЩИК В БОКСЕ! #shorts

ТЮРЕМЩИК В БОКСЕ! #shorts

От первого лица: Школа 7😡ПОТЕРЯЛ ПАМЯТЬ 🤯 ПРИЗНАЛСЯ в ЛЮБВИ на СЦЕНЕ💔 СБИЛА МАШИНА ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡ПОТЕРЯЛ ПАМЯТЬ 🤯 ПРИЗНАЛСЯ в ЛЮБВИ на СЦЕНЕ💔 СБИЛА МАШИНА ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡 ПОТЕРЯЛ ДРУГА 💔НОЧЕВКА с ДЕВУШКОЙ 🤯ДОВЕЛ УЧИТЕЛЯ ДО СЛЕЗ ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡 ПОТЕРЯЛ ДРУГА 💔НОЧЕВКА с ДЕВУШКОЙ 🤯ДОВЕЛ УЧИТЕЛЯ ДО СЛЕЗ ГЛАЗАМИ ШКОЛЬНИКА

Кто круче, как думаешь?

Кто круче, как думаешь?