Activation Functions - EXPLAINED!

CodeEmporium

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 27 січ 2025

КОМЕНТАРІ • 164

@desalefentaw8658 4 роки тому ⁺⁴⁴
wow, one of the best highlights of activation functions on the internet. Thank you for doing this video
@GauravSharma-ui4yd 4 роки тому ⁺³³
Awesome as always. Some points to ponder correct me if I am wrong
1. Relu is just not a activation but can also be thought as a self regularizer, as it offs all those neurones whose values are negative, so it's just a kind of automatic dropout.
2. A neutral net with just input and output layer, with softmax at the output layer is logistic regression, but when we add hidden layers in this network with no hidden activations then it's more Powerful than just vanilla logistic regression as it is now taking linear combination of linear combinations with different weight settings. But it still results in linear boundaries.
Lastly your contributions to the community is very valuable, clears a lot nitty-gritty details in short time. Keep going like this :)
@generichuman_ 2 роки тому ⁺⁵
No, dropout is different. Random sets of neurons are turned off in order to cause the neurons to form redundancies which can make the model more robust. In the case of dying Relu, the same neurons are always dead, making them useless. Dropout is desirable and deliberate, dying Relu is not.
@linuxbrad Рік тому ⁺¹
7:48 "once it hits zero the neuron becomes useless and there is no learning" this explains so much, thank you!
@ranchokastudent1473 2 дні тому
This video has been immensely helpful! i was struggling with grasping the concept of activation functions but thanks to your video I was able to understand it within minutes. Thanks a lot!
@UdemmyUdemmy Рік тому ⁺⁹⁰
the screetching noise is irrtitaing..else nice tutoial
@prakharmishra-m5r 7 місяців тому ⁺¹
I agree
@malikkashifsaeed1938 5 місяців тому ⁺⁴
agreed. cringe and irritating
@semon00 5 місяців тому ⁺¹
I don't agree
@otabeknajimov9697 Рік тому ⁺¹
best explanation of activation functions I ever seen
@lex8799 Місяць тому ⁺¹
Very clear, thanks bro, you’re heaven sent !
@SkittlesWrap 9 місяців тому ⁺¹
Straight to the point. Nice and super clean explanation for non-linear activation functions. Thanks!
@jhondavidson2049 4 роки тому ⁺⁵
I'm learning deep learning rn and using the deep learning book published by MIT press for the same. That's kinda complicated for me to understand especially these parts cause m still an undergrad and have 0 previous experience with this. Thank you for explaining this so well.
@CodeEmporium 3 роки тому
Anytime :)
@PrymeOrigin Рік тому
One of the best explanations ive come across
@fahadmehfooz6970 3 роки тому ⁺¹
Amazing! Finally I am able to visualise vanishing gradient descent and dying relu.
@CodeEmporium 3 роки тому
Glad!
@PritishMishra 3 роки тому ⁺⁷
The most thing I love about your videos is the fun you add... Learning becomes a bit easier
@deepaksingh9318 4 роки тому ⁺¹
Wow... Perfect and easiest way to explain it..
Everyone talks about what activations do but nobody shows in how actually it looks like behind the algos..
And you explain things in the most easiest way which are so easy to understand and remember..
So a big like for. All your videos..
Could uh make more and more and DL.. 😄
@CodeEmporium 3 роки тому
Thank you. I'm always thinking of more content :)
@the-tankeur1982 9 місяців тому ⁺¹³
I hate you for making that noises, i want to learn, comedia is something i would pass on
@deepakkota6672 4 роки тому ⁺⁸
Wooo, Did I just noticed the complex explained simple. Thanks! Looking forward to more videos.
@adrianharo6586 3 роки тому ⁺⁶
Great video!
The dissapointed gestures were a bit too much x'D
A question I did have as a beginner was.
What does it mean for a sigmoid gradient to "squeeze" values, as in they become smaller and smaller as they back propagate?
@AnkityadavGrowConscious 3 роки тому
It means that sigmoid function will always output a value between 0 and 1 regardless of any real number input. Notice the mathematical formula and graph of a sigmoid function for better clarity. Any real number will be converted to a number between 0 and 1. Hence sigmoid is said to "squeeze" values.
@epiccabbage6530 Рік тому ⁺¹
What are the axises on these graphs? Is it inputs, input*weights + bias for linear?
@NITHIN-tu7qo 11 місяців тому
did you get answer for it?
@mikewang8368 4 роки тому
better than most professors, thanks for great video
@CodeEmporium 3 роки тому
Thanks!!
@AmirhosseinKhademi-in6gs 2 роки тому
but we cannot use ReLU for the regression of functions with high degrees of derivatives!
In that case, we should still go with infinitely differentiable activation functions like "Tanh", right?
@nguyenngocly1484 4 роки тому
With ReLU f(x)=x is connect, f(x)=0 is disconnect. A ReLU net is a switched system of dot products, if that means anything to you.
@x_ma_ryu_x 2 роки тому ⁺⁵
Thanks for the tutorial. I found the noises very cringe.
@linuxbrad Рік тому
9:03 what do you mean "most neurons are off during the forward step"?
@kphk3428 4 роки тому ⁺¹
1:16 I couldn't see that there were different colors so I was confused.
Also I found the voicing of the training neural net annoying. But some people may like what other people dislike, so it's up to you to keep on voicing them.
@gabe8168 4 роки тому ⁺¹
the dude is making these videos alone, if you don't like his voice that's on you, but he can't just change his voice
@tarkatirtha 2 роки тому
Lovely intro! I am learning at the age of 58!
@rishabhmishra279 2 роки тому ⁺²
Great explanation ! and the animations with maths formula and visualizing it is awesome !! Many thanks !
@MrGarg10may Рік тому ⁺¹
then why isn't leaky RELU ELU used everywhere in LSTM, GRU, Transformers ..? why is RELU used everywhere
@kanehooper00 8 місяців тому
Excellent job. There is way too much "mysticism" around neural networks. This shows clearly that for a classification problem all the nerual net is doing is creating a boundary function. Of course it gets complicated in multiple dimensions. But your explanations and use of graphs is excellent
@oheldad 4 роки тому ⁺³
Great video ! And what is more great - are the useful references you add at the description. ( For me (1)+(7) answer the questions I asked my self at the end of your video - so its was on point ) ! Thank you !
@CodeEmporium 4 роки тому
Haha. Glad the references are useful! :)
@Nathouuuutheone 3 роки тому
What decides the shape of the boundary?
@TheAscent_ 4 роки тому
@6:24 How does passing what is a straight line into the softmax function also give us a straight line? Isn't the output, and consequently the decision boundary, a sigmoid?
Or is it the output before passing it into the activation function what counts as the decision boundary?
@CodeEmporium 3 роки тому
6:45 - The line corresponds to those points in the feature space (the 2 feature values) where The sigmoid's height is 0.5.
@vasudhatapriya6315 Рік тому
How is softmax a linear function here? Shouldn't it be non linear?
@AymaneArfaoui 7 місяців тому
what does x and y represent in the graph you use to show the cats and dog points ?
@wucga9335 Рік тому
so how do we know when to use relu or leacky relu? do we just use leacky relu all together in all cases?
@rasikannanl3476 7 місяців тому
great .. so many thanks ... need more explanation
@fazlayrabby3709 3 місяці тому
Can you please explain this "No gradient means no learning"?
@mitchfrtube Місяць тому
Gradient is in Charge of updating the weights in backpropagation process. If the gradient is zero = no gradient.. no update will Take place.. so.. the learning process stops. You Will add Zero to the weights and they Will be stucked remaining the Same.
@dazzykin 4 роки тому ⁺⁴
Can you cover tanh activation? (Thanks for making this one so good!)
@CodeEmporium 4 роки тому ⁺⁵
I wonder if there is enough support that warrants a video on just tanh. Will look into it though! And thanks for the compliments :)
@shivendunsahi 4 роки тому ⁺⁵
I discovered your page just yesterday and might I say, YOU'RE AWESOME! Thanks for such good content bro.
@CodeEmporium 4 роки тому ⁺³
Thanks homie! Will dish out more soon!
@ShivamPanchbhai 3 роки тому
this guy is genius
@eeera-op8vw 7 місяців тому
good explanation for a beginner
@phucphan4195 3 роки тому
thank you very much, this is really helpful
@CodeEmporium 3 роки тому
Thanks:)
@RJYL Рік тому
Great explanation for activation function I like it so much
@CodeEmporium Рік тому
Thanks so much for commenting
@superghettoindian01 Рік тому
Another great video
🎉🎉🎉!
@CodeEmporium Рік тому
Thanks so much!
@alifia276 3 роки тому
Thank you for sharing! This video cleared my doubts and gave me a good introduction to learn
further
@CodeEmporium 3 роки тому
Super glad :)
@pouyan74 3 роки тому ⁺¹
I've read at least three books on ANN's so far, but it's only now, after watching this video, that I have the intuition of what exactly is going on and how do activation functions break linearity!
@shrikanthnc3664 3 роки тому
Great explanation! Had to switch to earphones though :P
@meghnasingh9941 4 роки тому ⁺⁴
wow, that was really helpful, thanks a ton!!!!
@CodeEmporium 4 роки тому ⁺¹
Glad to hear that. Thanks for watching!
@alonsomartinez9588 2 роки тому
Awesome vid! Small sug: I might check the volume levels, during the screaming in :56 it was a bit painful to my ear and possibly sounded like audio clipping.
@prakharrai1090 2 роки тому
can we use linear activation with hinge loss for Linear svm for binary classification.
@sgrimm7346 Рік тому
Excellent explanation. Thank you.
@malekaburaddaha5910 3 роки тому ⁺¹
Thank you very much for the great, and smooth explanation. This was really perfect.
@CodeEmporium 3 роки тому
Much appreciated Malek! Thanks for watching!
@simranjoharle4220 2 роки тому
This was really helpful! Thanks!
@CodeEmporium 2 роки тому
Thanks for watching :)
@Mohammed-rx6ok 3 роки тому
Amazing explanation and also funny 😅👏👏👏
@blackswann9555 Місяць тому
excellent explaination
@programmer4047 4 роки тому ⁺¹
So, we should always use leaky reLU
@cheseremtitus1501 4 роки тому
Amazing presentation ,easy and captivating to grasp
@CodeEmporium 3 роки тому
Glad you liked it! Thank you!
@myrondcunha5670 3 роки тому
THIS HELPED SO MUCH! THANK YOU!
@wagsman9999 Рік тому
Beautiful explanation!
@younus6133 4 роки тому ⁺¹
oh man, amazing explanation.Thanks
@VinVin21969 4 роки тому
plot twist: its not that the boundary no longer changes, the vanishing gradient cause the gradient to be very small , that we can assume it is negligible
@CodeEmporium 3 роки тому
Danana nanana nanana nana
@DrparadoxDrparadox 3 роки тому ⁺¹
Great Video. Could you explain what U and V are equal to in this equation : o = Ux + V ? And How did you come up with the decision boundary equation and how did you determine the values of w1 and w2 ?
Thanks in advance
@ronin6158 4 роки тому
it should be possible to let (part of) the net optimize its own activation function no?
@mohammadsaqibshah9252 2 роки тому
This was an amazing video!!! Keep up the good work!
@CodeEmporium 2 роки тому
Thanks so much!
@yachen6562 4 роки тому ⁺¹
Really awesome video!
@SeloniSinha 7 місяців тому
wonderful explanation!!!
@youssofhammoud6335 4 роки тому
What I was looking for. Thanks!
@kellaerictech 2 роки тому
That's great explanation
@CodeEmporium 2 роки тому
Thanks so much for watching !
@ankitganeshpurkar 4 роки тому
Nicely explained
@CodeEmporium 4 роки тому
Thanks for watching this too
@undisclosedmusic4969 4 роки тому ⁺³
Swish: activation function. Swift: programming language. More homework, less sound effects 😀
@CodeEmporium 4 роки тому ⁺¹
Nice catch. I misspoke :)
@keanuhero303 4 роки тому
What's the +1 node on each layer?
@avijain6277 3 роки тому
The bias term
@splytrz 4 роки тому
I've been trying to make a convolutional autoencoder for mnist, and at first I used sigmoid activation on the convolutional part and it couldn't make anything better than just a black screen on the output but when I removed all activation functions it worked well. Does anyone have any idea why that happened?
@fatgnome 4 роки тому
Are the outputs properly scaled back to pixel values after being squeezed by sigmoid?
@splytrz 4 роки тому
@@fatgnome Yes. Otherwise the output wouldn't match with images. Also I checked model.summary() every time I made changes to the model.
@prashantk3088 4 роки тому ⁺¹
really helpful..thanks
@najinajari3531 4 роки тому
Great Video and great page :) Which softwares you use to make these videos ?
@CodeEmporium 4 роки тому ⁺¹
Thanks! I use Camtasia Studio for the editing; Photoshop and draw.io for the images.
@Edu888777 3 роки тому
I still dont understand what a activation function is
@mangaenfrancais934 4 роки тому ⁺¹
Great video, keep going !
@tahirali959 4 роки тому ⁺¹
good work bro keep it up
-
@CodeEmporium 4 роки тому ⁺¹
Will do homie
@fredrikt6980 4 роки тому
Great explanation. Just add more contrast to you color selection.
@CodeEmporium 3 роки тому
My palette is rather bland i admit
@aaryamansharma6805 4 роки тому ⁺¹
awesome video
@jigarshah1883 4 роки тому
Awesome video man !
@LifeKiT-i Рік тому
With graphical calculator, your explanation is sanely clear!! thank you!!
@CodeEmporium Рік тому
Thanks so much for the kind comment! Glad the strategy of explaining is useful :)
@bartekdurczak4085 7 місяців тому
good explanation but the noises are little bit annoying but thank you bro
@francycharuto 3 роки тому
gold, gold, gold.
@yukuchan 3 місяці тому ⁺¹
nice video ♥
@masthanjinostra2981 3 роки тому
Benefited a lot
@CodeEmporium 3 роки тому
Awesome! Glad!
@jamesdunbar2386 3 роки тому
Quality video!
@ExMuslimProphetMuhammad 3 роки тому ⁺¹
Bhai video shayad accha hoga but thumbnail pe Teri pic dekhke hi kafi log click na kare, I'm here just to let you know this:avoid putting your face on thumbnail or in video as no one is interested in seeing the educator while watching technical videos.
@CodeEmporium 3 роки тому ⁺¹
You clicked. That's all i care about ;)
@abd0ulz942 Рік тому
learn Activation Functions with Dora
but I honestly it is good
@ehsankhorasani_ 3 роки тому
good job thank you
@CodeEmporium 3 роки тому
Very welcome!
@harishp6611 4 роки тому
yes! I liked it. Keep it up.
@ShinsekaiAcademy 4 роки тому
thanks my man.
@CodeEmporium 3 роки тому ⁺¹
You are oh so welcome
@히안-p3j 5 місяців тому
I don't understand..
@uzairkhan7430 3 роки тому ⁺¹
awesome
@jaheerkalanthar816 2 роки тому
Thanks mate
@patite3103 3 роки тому
Amazing!
@farhanfadhilah5247 4 роки тому
this is helpful, thanks :)
@jhondavidson2049 4 роки тому
Amazing!!!!!!!!!!!!!!!!!
@CodeEmporium 3 роки тому
Thanks!!!!!!!!!!
@DataScoutt 3 роки тому
Explained the Activation Function ua-cam.com/video/sar9xi-ah4M/v-deo.html
@igorpostoev2077 3 роки тому
Thanks man)
@Acampandoconfrikis 3 роки тому
thanks brah
@zaidalattar2483 4 роки тому
Perfect explanation!... Thanks
@CodeEmporium 3 роки тому
Much appreciated!
@abdussametturker 4 роки тому
thx. subscribed

Наступне

Автоматичне відтворення