Top Optimizers for Neural Networks

Hopfield network: How are memories stored in neural networks? [Nobel Prize in Physics 2024] #SoME2

The Sigmoid Function Clearly Explained

СКАНДАЛЬНЫЙ бой Али, когда в ринге ему противостояли сразу ДВОЕ #shorts

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

Кирилл Набутов. Арестович в Кремле, кто взорвал командующего в Москве, война России с НАТО

A Review of 10 Most Popular Activation Functions in Neural Networks

Machine Learning Studio

Переглядів 13 056

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 24 січ 2025

КОМЕНТАРІ • 21

@harriehausenman8623 Рік тому ⁺⁶
Thanks for the video! Great overview & refresher 🤗
I appreciate the calm, slow and clear voice.
@PyMLstudio Рік тому ⁺²
Thanks for the comment. That’s very encouraging to hear!
@tomorourke6301 Рік тому ⁺¹⁰
Extremely helpful thank you very much
@PyMLstudio Рік тому ⁺¹
Thanks Tom for your nice words, that’s very encouraging!
@soyoltoi Рік тому ⁺⁶
Great video! Some points about English to help you improve:
You should say "how it looks" instead of "how does it look like". If you want to include the "like", you can say "what it looks like" instead.
@PyMLstudio Рік тому ⁺⁴
Thanks, that’s a good point! I’ll try to remember that for my next video.
@dashnarayana Рік тому ⁺³
Great video. I would recommend plus 2 level students who can see how basic calculus is used in AI in later stage. Besides, these would serve as good exercises
@phuonglethithanh8498 Рік тому ⁺¹
Thank you for this video ❤
@Sathishreddy-fe3cp 3 місяці тому
i want to practice all optimizers with different activation functions with some maths problems and in python could you please suggest good book
@FelheartX Рік тому ⁺⁸
relu, leaky relu, swish seem to be an evolution.
The issue with relu was that it leaves dead weights. Then the issue with leaky relu was its discontinuity. And swish finally fixed all of them.
Are relu and leaky relu still useful for anything?
Also why was GELU used for language models? Why does GELU work better there than other activation functions?
@PyMLstudio Рік тому ⁺⁵
Thank you for your question. Indeed, ReLU, LeakyReLU and Swish is an evolution. And it is true that ReLU suffers from dead neurons, but still, ReLU and its variants such as LeakyReLU are used in ANNs, specially in computer vision tasks like image segmentation. One advantage of ReLU is its simple and **efficient** computation.
As for GELU, some properties of GELU make it suitable for more complex tasks like NLP. For example, its non-monotonic behavior allows the network to capture more complex patterns in text data.
But having said these, the choice of activation function heavily depends on the data and the task, and one should experiment with different activation functions to find the best one for a given task.
@ThePhysicsTrain Рік тому ⁺⁵
Animations are cool.. Have you used manimce or manimgl.?
@PyMLstudio Рік тому ⁺¹
Thanks for the comment 😊 I have used ManimCE
so far I have never played with ManimGL yet , but will check it out and see if it worth switching
@jiahao2709 6 місяців тому
may i know how you make these videos?
@SolathPrime 8 місяців тому
I have my own activation function that I use, it's Softplus like function
it's the integral of (1+tanh(x))/2 which looks like Sigmoid except it's faster in training
It's integral is this equation that I call "Rectified Integral Tangent Hyperpolica" RITH for short
It's mostly linear for x≥1 which makes it fast in training
(x+ln(cosh(x)))/2 I added the term 1/e to center it between 0 and positive infinity
@Saurabhmaths1999 Рік тому ⁺²
Love from India
@ukissrulez Рік тому
nope
@kies9416 Рік тому ⁺⁵
Cool
@jirapolottpobukadee1139 2 місяці тому
❤
@hipphipphurra77 Рік тому
Wrong!
The derivativ of the ELU ist a perfect continuous function everywhere even at 0.
ua-cam.com/video/56ZxEmGRt2k/v-deo.html
@PyMLstudio Рік тому ⁺³
Thanks for the comment, but that depends on the value of alpha.
As I mentioned in the video, if alpha =1, the derivative of ELU is continuous (also see the plotted curve corresponds to alpha=1)
But if alpha != 1, the derivative will be a discontinuous function

Наступне

Автоматичне відтворення

Top Optimizers for Neural Networks

Top Optimizers for Neural Networks

Hopfield network: How are memories stored in neural networks? [Nobel Prize in Physics 2024] #SoME2

Hopfield network: How are memories stored in neural networks? [Nobel Prize in Physics 2024] #SoME2

The Sigmoid Function Clearly Explained

The Sigmoid Function Clearly Explained

СКАНДАЛЬНЫЙ бой Али, когда в ринге ему противостояли сразу ДВОЕ #shorts

СКАНДАЛЬНЫЙ бой Али, когда в ринге ему противостояли сразу ДВОЕ #shorts

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

Кирилл Набутов. Арестович в Кремле, кто взорвал командующего в Москве, война России с НАТО

Кирилл Набутов. Арестович в Кремле, кто взорвал командующего в Москве, война России с НАТО

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

Why Do We Need Activation Functions in Neural Networks?

Why Do We Need Activation Functions in Neural Networks?

Why ReLU Is Better Than Other Activation Functions | Tanh Saturating Gradients

Why ReLU Is Better Than Other Activation Functions | Tanh Saturating Gradients

Why Do We Use the Sigmoid Function for Binary Classification?

Why Do We Use the Sigmoid Function for Binary Classification?

Why do we use "e" in the Sigmoid?

Why do we use "e" in the Sigmoid?

Why Do Neural Networks Love the Softmax?

Why Do Neural Networks Love the Softmax?

Why Neural Networks can learn (almost) anything

Why Neural Networks can learn (almost) anything

But what is a neural network? | Deep learning chapter 1

But what is a neural network? | Deep learning chapter 1

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

ReLU Activation Function Variants Explained

ReLU Activation Function Variants Explained

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

TOY STORY IN BRAWL STARS!?

TOY STORY IN BRAWL STARS!?

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

Гениальное изобретение из обычного стаканчика!

Гениальное изобретение из обычного стаканчика!

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

😳Трамп ПОТІШИВ Скабєєву, але одразу РОЗЧАРУВАВ #shorts

😳Трамп ПОТІШИВ Скабєєву, але одразу РОЗЧАРУВАВ #shorts

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ