Confused which Transformer Architecture to use? BERT, GPT-3, T5, Chat GPT? Encoder Decoder Explained

Datafuse Analytics

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 3 чер 2024
This video explains all the major Transformer Architectures and differentiates between various important Transformer Models.
Which Transformer Architecture to use to solve a particular problem statement in Natural Language Understanding (NLU) and Natural Languages Generation (NLG) is explained in a simplified manner.
Over the past 6 years, Transformers, a neural network architecture, have completely transformed state-of-the-art natural language processing and the way we approach to different problem statements in NLG and NLU.
Chapters:
0:00 Introduction
1:21 Encoder Branch
1:57 BERT
2:37 DistilBERT
3:19 RoBERTa
3:59 XLM
4:50 XLM-RoBERTa
5:32 ALBERT
6:40 ELECTRA
7:19 DeBERTa
8:13 Decoder Branch
8:50 GPT
9:13 CTRL
9:54 GPT-2
10:31 GPT-3
11:30 GPT-Neo/GPT-J-6B
11:50 Encoder-Decoder Branch
12:00 T5
13:05 BART
13:46 M2M-100
14:22 BigBird
#datascience #neuralnetwork #machinelearning #naturallanguageprocessing

КОМЕНТАРІ • 64

@datafuseanalytics Рік тому ⁺⁷
In this video, I tried to explain all the major Transformer architectures. I have also explained the differences and training objective of each one of them. If you feel this video adds value in your life then please like, share and comment on this video and subscribe to this channel. If any suggestions and feedback then please drop in comment box.
@aurkom 9 місяців тому ⁺⁵
It would have been awesome if all the models had the release year mentioned along with it as well. Helps to get a birds eye view of the timeline.
@datafuseanalytics 9 місяців тому
Hello. Yes, I am making a separate video on similar topic. It will be uploaded soon. Stay tuned my friend.
@snehotoshbanerjee1938 4 дні тому
Great summary!!
@kevon217 Рік тому ⁺¹
thanks for the excellent, well-explained summary!
@datafuseanalytics Рік тому
Thank you Kevin
@santoshpanigrahi5711 Рік тому ⁺²
Thanks for sharing. It's very informative. Keep up with this work.
@datafuseanalytics Рік тому
Thank you, Santosh, for watching the video.
@hemantwani4757 25 днів тому
Very nicely explained ❤👍
@ajitkumar15 Рік тому ⁺¹
Very nice and to the point video, thank you !!!
@datafuseanalytics Рік тому
Hey thanks a lot Ajit 😃 🙏
@milindkubal2738 Рік тому ⁺⁵
Amazing. Great work👍
@datafuseanalytics Рік тому
Thanks Milind
@SagarBhalke-td3vy Рік тому ⁺¹
Great explanation. Thank you very much
@datafuseanalytics Рік тому
Glad it was helpful for you Sagar...
@falknfurter 7 місяців тому ⁺¹
I just found this video and it's very good. I'm currently trying to understand when to use what type of model. Looking at Huggingface is just overwhelming. That's where this video jumps in and provides an excellent overview of the major models. I wish there would be a similiar video explaining the various pretraining objectives.
@datafuseanalytics 7 місяців тому
Hello. I will definitely make a video on the same. Thanks a lot. 😀
@sagar3482 Рік тому ⁺¹
Informative content
Thanks for sharing this
@datafuseanalytics Рік тому
Glad you liked it!
@SaketKumar-wy1wb Рік тому ⁺¹
This is good. Keep up the good work. 🙂
@datafuseanalytics Рік тому ⁺¹
Thank you Saket, I will
@ganeshkharad 9 місяців тому ⁺¹
this is really nice explaination!!!
@datafuseanalytics 9 місяців тому
Thanks a lot Ganesh 😃 🙏
@sanjaybhalke8032 Рік тому ⁺²
Thanks for sharing
@datafuseanalytics Рік тому
My pleasure
@exxzxxe Рік тому ⁺¹
Well done!
@datafuseanalytics Рік тому
Thanks David.
@user-os1xi8qf4y 6 місяців тому ⁺¹
thank you sir ! Fantastic method of explanation
@datafuseanalytics 6 місяців тому
Hey buddy. Thanks a lot. 😀
@datafuseanalytics 6 місяців тому
Hey buddy. Thanks a lot
@adityakshirsagar1391 Рік тому ⁺¹
Informative 👍
@datafuseanalytics Рік тому
Glad it was helpful and informative for you Aditya. Please do share it with your friends. More interesting videos will be uploaded soon
@ahmedelsabagh6990 2 місяці тому ⁺¹
Greate video!
@datafuseanalytics 2 місяці тому
Thanks a lot. Please do share it with your friends 😁
@sarc007 Рік тому ⁺¹
Excellent
@datafuseanalytics Рік тому
Thanks a lot Suhail.
@rembautimes8808 3 місяці тому ⁺¹
Excellent video and I joined as a sub. Like this style of going thru the various architectures and the use case. Maybe you can also update it with GPT 4 since it’s new out there.
@datafuseanalytics 2 місяці тому
Thanks a lot for this amazing comment. I have uploaded the latest video using ChatGPT model - ua-cam.com/video/MKHEaxdoqxA/v-deo.html
Please go through it and feel free to comment
@WillBeebe Рік тому ⁺¹
Superb 🎉
@datafuseanalytics Рік тому
Hey thanks William
@d4munche3z Рік тому ⁺¹
Can you create a tutorial on Longformer and the concepts/code used to adapt an LLM for larger token sizes?
@datafuseanalytics Рік тому ⁺¹
Hello David. I haven't made it yet. But I will definitely make one on Longformer etc which takes a whopping 4096 tokens as input. Thanks for your feedback.
@amortalbeing 7 місяців тому ⁺¹
thanks a lot❤
@datafuseanalytics 7 місяців тому
You are most welcome 😃 Do check other videos too on AI on this channel.
@projectbit2248 Рік тому
Hello, how do I contact/ connect with you, with regards to a project?
@datafuseanalytics Рік тому
Hello, please contact us via our email. datafuseanalytics@gmail.com
@mhaya1 8 місяців тому ⁺¹
Kudos🎉
@datafuseanalytics 8 місяців тому
Thank you 😃
@markfallu2389 11 місяців тому ⁺¹
Great summary- would be good if you did an update
@datafuseanalytics 11 місяців тому
Sure. I will make an updated video comprising of all the possible model architectures
@tilkesh Рік тому ⁺¹
Thx
@datafuseanalytics Рік тому
Most welcome 😃 😊
@ianboyles2425 Рік тому ⁺¹
there's some new important ones like the newer gpt Neo models, alpaca, llama, cereus, vicuna
@datafuseanalytics Рік тому
Hello Ian. Yes. At the time of this session, these models weren't available. Thank you for your feedback. I will definitely make one video (part 2) which will encompass these models in a more simpler fashion
@chenpeter7428 Рік тому ⁺¹
It seems it does not cover BERT in computer vision.
@datafuseanalytics Рік тому
Yes you are right Chen Peter
@ko-Daegu Рік тому ⁺³
this sounds like copy pasted from online articles and just reading from them without extra info at all
@datafuseanalytics Рік тому
Hey Ko-Jap. I referred multiple books for the same and then wrote the content in my language. But I did not refer to any online blogs or articles. Only books are the reference. But thank you for your valuable feedback. I will improve so that it doesn't sound as I am reading. 🙏😀
@yosup125 5 місяців тому ⁺¹
for the algo
@datafuseanalytics 5 місяців тому
Thank you
@gregbugaj 9 місяців тому
Nice overview
@datafuseanalytics 9 місяців тому
Hey Thanks a lot 😃
@saketkr Рік тому ⁺²
This is good. Keep up the good work. 🙂
@datafuseanalytics Рік тому ⁺¹
Hey Thanks Saket

Наступне

Автоматичне відтворення

Guide to TRANSFORMERS ENCODER-DECODER Neural Network : A Step by Step Intuitive Explanation