9:37 you made conclusion based on P(X=[0,2] | Y), I think the correct way is to calculate P(Y|X=[0,2]). In case P(Y=1) is very small, the answer can be Y=0.
In the last part of the video you said we can fit a known distribution to a continuous set of data. However, you continued to then write that the probabilities can be calculated by taking the product of the pdf evaluated at different values of the feature and label. The pdf does not provide probabilities however, as it needs to be integrated to inform one of the probabilities of an event. This part of the video seems imprecise. However, the video in general was great. Thanks.
The explanation is so cool! But it would be even cooler if you added some examples with continious features and fitting a distribution, this part wasn't so clear...
If I search for any ML Algorithm I just first check your channel If you have created the video on the same... You are my first preference for ML/DL Algo Explanation. Just a request please make a video on Deep Learning Algorithm too like CNN, RNN & LSTM "from scratch". It will really help people who want to become practitioners in AI like me.
Maybe i'm wrong but I think the hypothesis is not that X1 and X2 are independant but that X1 and X2 are conditionnaly independant. It was very clear otherwise thank you !
I think the hypothesis is that you assume each feature to be (w.r.t other features) 1) globally independent (in the global sample space) 2)conditionally independent w.r.t the occurence of each class label (under the subset sample space where the particular class event has occured) If these assumptions are not met, then it does not seem possible to build the mathematics, because as far as I see, if events A and B are independent, that does not naturally imply conditional independence between events (A|C) and (B|C)
Ok, I've given up on the video after 45 secs. You said "stated clearly", if you hadn't I'd have kept watching. You point to an array of features called X. What are they? Are they features of the array itself (its size / rank / dimension?), are they features of the thing the array of describing (measurements in a house?), or a list of possible attributes (the ingredients on a pizza?) Then you introduce a label. So what, is this like a python dictionary? Plus, I've no idea what sort of issue we're supposed to be tackling? Is it probability? Is it rationality with limited knowledge? I only guess that because I've heard of Bayes before. Instead you launch into calculations when I have not the first idea what you're calculating. Why would I listen to that? Tell you what, I'll give it another 30 secs. If there's no illustrative example / clear explanation of what the hell you're covering I'm gone.
How he manage to explain something that a 1-hr lecture couldn't! Thanks mate
Dude.. I lost count of the videos I watched to understand this but lastly, after seeing your video the struggle ended. Thank you so much!
'Clearly Explained' - and it actually was. Thanks man
:D :D
One of the best explanations I've ever seen!
Thanks mate! Keep supporting...
This was a very clear explanation indeed. Thank you!
You're very welcome!
Been struggling to grasp this topic but i finally hit that Eureka moment with this video,.Thank you so much
HUGE thanks for perfectly delivering the whole concept in one video bro!!
9:37 you made conclusion based on P(X=[0,2] | Y), I think the correct way is to calculate P(Y|X=[0,2]). In case P(Y=1) is very small, the answer can be Y=0.
In the last part of the video you said we can fit a known distribution to a continuous set of data. However, you continued to then write that the probabilities can be calculated by taking the product of the pdf evaluated at different values of the feature and label. The pdf does not provide probabilities however, as it needs to be integrated to inform one of the probabilities of an event. This part of the video seems imprecise.
However, the video in general was great. Thanks.
one of the best explanation of this topic. Thanks man
Thank you very much for the video. Clearly explained indeed, the only part I couldn't get completely was the discretization.
The explanation is so cool! But it would be even cooler if you added some examples with continious features and fitting a distribution, this part wasn't so clear...
I just love this explanation
Good work with the visuals!!
Very clearly explained, thank you!
Very nice explanation and perfect illustrations!!
You did good my friend. I'm glad I came across this video
in the last part at minute 11: What is the function f to fit a known distribution? Thank you for answering!
If I search for any ML Algorithm I just first check your channel If you have created the video on the same... You are my first preference for ML/DL Algo Explanation. Just a request please make a video on Deep Learning Algorithm too like CNN, RNN & LSTM "from scratch". It will really help people who want to become practitioners in AI like me.
Thank you so much ❤
Writing CNNs and RNNs from scratch are pretty hectic...maybe some day I'll try.
@@NormalizedNerd Waiting... you are our only hope who can teach us Mathematics of ML with cool animation, That's why requested you! Thanks.
LOVED IT!!!
Awesome Explanation! Can't thank you enough...
Nicely explained!
Great explanation :)
bro, best explanation I could find
Thanks bro :)
It was clearly explained as mentionned in the title. Thanks a bunch !!!
Best Explanation
Very clear explanation!
very nice explanation thank you so much
great explanation
Really good work, congrats
Thanks man!
Nice video! Thank you.
This is really well explained.
Thank you so much man!!
sir please more lectures.
I am seeing after too days later your lectures
made some advance NLP and CV lectures or AI lectures thanks
I will try my best to upload more frequently.
That was great! I'm really glad that I found your channel. Thanks a lot 👍👍
It was like a revision for class 12 probability 😁😁
Yeah simple yet effective concept.
I am appreciate your work
Thanks a lot!
Yes, this was actually well explained. Thank you :)
Thank you very much for your work! Nice explanation!
You are welcome!
Maybe i'm wrong but I think the hypothesis is not that X1 and X2 are independant but that X1 and X2 are conditionnaly independant. It was very clear otherwise thank you !
In naive Bayes every feature is treated as an independent feature that's why it's called naive.
I think the hypothesis is that you assume each feature to be (w.r.t other features)
1) globally independent (in the global sample space)
2)conditionally independent w.r.t the occurence of each class label (under the subset sample space where the particular class event has occured)
If these assumptions are not met, then it does not seem possible to build the mathematics, because as far as I see,
if events A and B are independent, that does not naturally imply conditional independence between events (A|C) and (B|C)
10:51 How does this work? wouldn't the probability that Xi = xi be zero, given we're using a continuous distribution? Because of the "=" sign
Great explanation.
Glad it was helpful!
Amazing video. thanks.
very helpful🥺🥺
hey, thanks man, very clear explanation.😀😀
Thank you!
Well explained, a quick revision for Naive bayes. I forgot why it was called Naive until i watched this video 😂😂
Thanks! Haha.
Sir is this like
Bayesian classifier deals with conditional probability ?
Naïve bays classifier deals with joint probability ?
Thanks in advance.....
Yeah!
Awesome! Thank you.
Well done👊👊
Great video man great
herre is a sub
Love this
I love this Exolaination 😍🥰😘
Thanks a lot ❤
love this!
I wish you were my professor
well done
Thank u it was great.
HELPFUL!!!!
you saved me
truly amazing
Thanks!
so goood
thumbs up
thanks
and how aboit gaussian NB?
You areee Amazing. I love your Indian Bengali accent ( just a guess hehe make me a voice analyzer if i am right XD
)
3b1b's bro is here
Haha :3
Nice ⭐⭐⭐⭐⭐
10:37
liked that
Hello people from the future! :D
independant moment
just quit confusing people
Ok, I've given up on the video after 45 secs. You said "stated clearly", if you hadn't I'd have kept watching.
You point to an array of features called X. What are they? Are they features of the array itself (its size / rank / dimension?), are they features of the thing the array of describing (measurements in a house?), or a list of possible attributes (the ingredients on a pizza?) Then you introduce a label. So what, is this like a python dictionary?
Plus, I've no idea what sort of issue we're supposed to be tackling? Is it probability? Is it rationality with limited knowledge? I only guess that because I've heard of Bayes before.
Instead you launch into calculations when I have not the first idea what you're calculating. Why would I listen to that?
Tell you what, I'll give it another 30 secs. If there's no illustrative example / clear explanation of what the hell you're covering I'm gone.
Nope, 30 secs later and it's absolute horseshit.
Great explanation