EM Algorithm : Data Science Concepts

Поділитися
Вставка
  • Опубліковано 24 гру 2024

КОМЕНТАРІ • 147

  • @aaroojyashin3790
    @aaroojyashin3790 11 місяців тому +38

    This could be by far the best explanation I have seen for EM algorithm. The way you have connected the intuitive way to mathematical explanation, is so so commendable!!!! Thank you so much for your efforts

    • @ritvikmath
      @ritvikmath  11 місяців тому +1

      Glad it was helpful!

    • @chainetravail2439
      @chainetravail2439 4 місяці тому

      @@ritvikmath I comfirm, thank you for helping lost students

  • @nnyabavictoria6332
    @nnyabavictoria6332 2 роки тому +5

    Thank you Ritvik for simplifying EM algorithm like this. This is the best video I have seen so far.

  • @anuraganand9675
    @anuraganand9675 2 роки тому +72

    The world needs to see this. Thanks Ritvik, I honestly have utmost respect and love for the amount of hard work you put in your videos. Cheers :)

    • @e555t66
      @e555t66 Рік тому +2

      It would take a lot of time to develop these intuitions on your own.

  • @nawfalguefrachi2071
    @nawfalguefrachi2071 2 роки тому +2

    thank you ritvik the best videos are in this channel.
    Very intresting way of teaching thank you from TUNISIA

  • @rachelhobbs6189
    @rachelhobbs6189 2 роки тому +16

    Your channel and way of teaching is so amazing!! Very inviting, inclusive, and friendly. Thank you so much for such good vibes 💕

  • @alizarean5080
    @alizarean5080 11 місяців тому +2

    I have an exam tomorrow and this video was the thing I needed. I can't thank you enough dude.

  • @rishi71095
    @rishi71095 Рік тому +2

    It would take me two more lives to be able to explain it this well to someone, kudos! Great job buddy!

  • @vantongerent
    @vantongerent 2 роки тому +6

    YES! I have quiz on this NEXT WEEK!

  • @andrashorvath2411
    @andrashorvath2411 Рік тому +2

    Awesome explanation. I'd like to extend yours with my intuition regarding the E-Step: the first term p(x|m0) shows the probability of x happening for the chosen m0, and the second term LogLikelihood shows the probability of x happening for the computed m, and we want to maximize both. Because we want a choice with high probability from every aspect. That's why we multiply them together. Because the multiplication can weight between them. If one of them is small then the result will be small. It can be high only if both are high.

    • @ritvikmath
      @ritvikmath  Рік тому +1

      thanks for the additional inputs!

  • @albertonieto2557
    @albertonieto2557 2 роки тому +1

    Brilliant explanation. I especially appreciate you first providing the intuition of the method in the verbal explanation of the E and M steps. I struggled with the seeing the math first in other lectures until seeing your video. Thanks for posting this.

  • @adrian-mu3jr
    @adrian-mu3jr 2 роки тому +5

    That's really great way to look at EM. I'm an engineering graduate but new to ML and the workup explanation before dropping into the maths is excellent. thanks

  • @aniketsakpal4969
    @aniketsakpal4969 Рік тому +2

    Incredible explanation! Was trying to understand the intuition behind EM for a long time! Thanks for the video! Keep Going!!

  • @navyadewan8680
    @navyadewan8680 8 місяців тому

    your understanding and explanation of such a complicated concept is impeccable

  • @aidanheffernan652
    @aidanheffernan652 2 роки тому +1

    It's 4am and I saw this video and had to watch....... really great explanation bro.....your a natural teacher.....thanks for this......subscribed

  • @peterhopkinson3040
    @peterhopkinson3040 2 роки тому +1

    Your videos are unreal, simple explanations of complex problems its insane.

  • @michalistsatsaronis8728
    @michalistsatsaronis8728 2 роки тому +10

    Thanks!You explained such a complicated subject so clearly!!!!

  • @simeonvince8013
    @simeonvince8013 Рік тому +5

    Thank you for the high-quality contents that you have produced over the past few years. Most of the time, it really did help me get the intuition and understanding of what was going on with the theoretical concepts I was seeing in my courses.
    Once again, thank you !

  • @louighi91
    @louighi91 2 роки тому +1

    Holy, i can't believe how good this video was :) thank you so much

  • @H1K8T95
    @H1K8T95 2 роки тому +2

    Ngl my favorite rapper-turned-algorithm

  • @sakshamsingh2005
    @sakshamsingh2005 7 місяців тому +1

    By far the best explanation, amazing.

  • @abhishekganapure6456
    @abhishekganapure6456 7 місяців тому

    The only explanation you need for understanding EM algorithm, proper chad explanation!

  • @gergerger53
    @gergerger53 2 роки тому +1

    I had a jolt of excitement when I saw you had decided to do a video on this topic. It's something I've had to revisit time and time again, always understanding the intuition, but always getting lost in the formulas. Your post did a great job at helping to explain the intuition. I did struggle a bit with your non-conventional likelihood notation, though. That did throw me off a little bit but I understand why you had to have it that way and quickly adjusted. The care you took in explaining why there is mu and mu0 just shows why you are a fantastic teacher.

  • @chaurasiaansh6059
    @chaurasiaansh6059 4 місяці тому

    this is the best lecture for em algo

  • @scottzgoldin2
    @scottzgoldin2 Місяць тому

    Wow, thank you for your work here. I finally feel confident in a subject in my masters classes and that means the world

  • @sharmilakarumuri6050
    @sharmilakarumuri6050 2 роки тому +3

    Very well explained

  • @fotiskamanis8592
    @fotiskamanis8592 Рік тому

    Thank you! By far the best channel for providing clear explanations to fairly complex problems.

  • @tecnom7133
    @tecnom7133 8 місяців тому +2

    The best Thanks man

  • @TanmayGhonge
    @TanmayGhonge Рік тому +1

    Broke down the most complicated algorithm in the simplest terms. Wow!

  • @pouce902
    @pouce902 Рік тому

    you are just amazing! What would be super useful would be an EM video based on your "Maximum likelihood" one.

  • @MegaNightdude
    @MegaNightdude 2 роки тому +3

    Great video !

  • @paramjeetsingh3444
    @paramjeetsingh3444 2 роки тому

    cant express how happy i m to see after yr videos . thanks a lot !

  • @bassoonatic777
    @bassoonatic777 2 роки тому +1

    Much better explanation than what I normally see. I would also be interested in seeing you go through the proof.

  • @fabianomenezes5892
    @fabianomenezes5892 2 роки тому

    What an amazing channel, honestly

  • @sasakevin3263
    @sasakevin3263 Рік тому

    Awesome! Best explanation of EC algorithm for the beginner!

  • @mohammed2250
    @mohammed2250 2 роки тому

    Yes please go on with the prove, that will be an interesting topic. Though I went on Andrew's ng video couple of times, but I couldn't understand it better than here!! You're a rock star in simplifying complex concepts!!

  • @ericwilson8665
    @ericwilson8665 Рік тому

    Absolutely fantastic. I agree w/ other comments... The DS world needs to see this. Thank you.

  • @MN-zs8lc
    @MN-zs8lc 7 місяців тому

    Although there is more for fully understanding, I was able to gain the concept because of your video!

  • @christophb9616
    @christophb9616 2 роки тому +5

    Thanks for the very clear explanation! A follow up video on how the EM algorithm can be used in gaussian mixture models or bayesian networks would be awesome!

  • @commentor93
    @commentor93 2 роки тому

    Thanks for your explanation.
    I think my main mental knot was wondering why you alter N instead of looking for the best guess for x to locate the value of the unknown value. To realize that x doesn't change and the power of the algorithm lies in finding the optimal solution for the learner without caring for the actual value of x was what I needed for it all to make sense.

  • @evgenyivanko9680
    @evgenyivanko9680 11 місяців тому

    Ritvik, you are doing a great job, thanks

  • @nuamaaniqbal6373
    @nuamaaniqbal6373 2 роки тому

    i thank GOD i found your channel. A big thanks to youtube and to you!!

  • @hangsu5724
    @hangsu5724 17 днів тому

    A real master can explain the most complex problem in an understandable way.

  • @sgklee2664
    @sgklee2664 2 роки тому

    This explanation is amazing in order to get the concept

  • @thetrainoflife8327
    @thetrainoflife8327 Місяць тому

    Genius man, genius , Wonderful explanation !!

  • @juliachu4685
    @juliachu4685 2 роки тому

    THANK YOU. You're literally saving my ML undergraduate course

  • @sajedehaghababaei5300
    @sajedehaghababaei5300 10 місяців тому

    Thanks a lot! That is a great explanation!!! I was struggling with EM for a long time!! :))
    I'd grateful if you also talk about the proof of convergence!

  • @sahilagarwal1871
    @sahilagarwal1871 2 роки тому +2

    Would love to see a proof video! Keep up the great work!

  • @awangsuryawan7320
    @awangsuryawan7320 2 роки тому +3

    Can't wait to see the proof

  • @timchang5676
    @timchang5676 Місяць тому +1

    yeah this guy is the fucking goat

  • @biswajit-k
    @biswajit-k Рік тому

    Got this crystal clear. Thanks a lot!

  • @tb1131
    @tb1131 Рік тому

    Your explanations are soooo clear! really appreciate the effort you put into your videos. Thank youu!!

  • @mihirsalot
    @mihirsalot Місяць тому

    Loved it. Thanks for the efforts.

  • @Tyokok
    @Tyokok 2 роки тому

    Thanks for the great lecture. One question if I may: 2:20, why you put best guess 1 here instead of a random draw from your known distribution?

  • @srinivasbringu67
    @srinivasbringu67 2 роки тому

    Lovely, that's very intuitive. Thank you so much.

  • @hannahnelson4569
    @hannahnelson4569 4 місяці тому

    Very cool! Thank you for teaching!

  • @nikosp3903
    @nikosp3903 7 місяців тому

    Amazing explanation!

  • @sairaamvenkatraman5998
    @sairaamvenkatraman5998 11 місяців тому

    Coild you please do the derivation or intuition for EM for clustering? I observe that it is described in many textbooks, but not in such a cool way. 😅

  • @yaochung-chen
    @yaochung-chen Рік тому

    Really nice explaination! Thank you!

  • @eceserin
    @eceserin 2 роки тому +2

    Example with python coming anytime?

  • @ehsanmon
    @ehsanmon 2 роки тому +4

    Thanks so much for this great and explanation! I would definitely be interested in the proof. It will be great if you could do a video on Gaussian mixture models as well and how it is solved using the EM algorithm.

  • @yulinliu850
    @yulinliu850 2 роки тому +3

    Excellent. Thank you so much! 👍

  • @QuocHuyPham-s4n
    @QuocHuyPham-s4n 10 місяців тому

    Excellent explanation!!

  • @bbcailiang7
    @bbcailiang7 11 місяців тому

    Excellent explanations!

  • @vijayrajan5792
    @vijayrajan5792 2 роки тому

    Very compelling ... Brilliant

  • @Martyr022
    @Martyr022 2 роки тому +5

    I'm interested in the proof!

  • @JanSchmid-d4k
    @JanSchmid-d4k Рік тому

    Thank you for all the work you put in your videos to make life's like mine easier. Cheers man!

  • @pallavibothra9671
    @pallavibothra9671 2 роки тому +3

    Hi Ritvik, thank you very much for awesome videos. Could you please make some videos on SQL?

    • @ritvikmath
      @ritvikmath  2 роки тому +2

      thanks! and please check out my full SQL playlist here:
      ua-cam.com/play/PLvcbYUQ5t0UFAZGthysGOAtZqLl3otZ-k.html

    • @pallavibothra9671
      @pallavibothra9671 2 роки тому

      @@ritvikmath Awesome! Thanks a lot.. Could you please add sql with window function to the playlist, if possible?

  • @kulbhushansingh1101
    @kulbhushansingh1101 Рік тому

    great man, ultra great

  • @taxtr4535
    @taxtr4535 4 місяці тому

    You are a gem

  • @bradleyadjileye1202
    @bradleyadjileye1202 9 місяців тому +1

    Amazing, thank you for that !

  • @dadimanoj9051
    @dadimanoj9051 Рік тому

    Thankyou for explaining very clearly

  • @sharmilakarumuri6050
    @sharmilakarumuri6050 2 роки тому +4

    Can you do the proof too please

  • @dannyzilberg3101
    @dannyzilberg3101 9 місяців тому

    Thank you so much for these videos!
    One question: how do you estimate and maximize the integral in practice? That was the elephant for me...

  • @moravskyvrabec
    @moravskyvrabec Рік тому

    Wonderful. Definitely helps my understanding. When I find time I want to see what you're doing w/ the stock predictions. If I remember lectures from business school, you should not be able "generate alpha" unless you possess information the market does not. In this case you could say you've found some new idea that has real predictive value, but either they will a) already have found this and put much more compute + their proximity to the actual place where the trades happen towards getting the answer first and beating you to the trades or b) didn't know it before but will immediately steal it and then see a) haha. But hey, I'll still watch to see what you've got going on.

  • @tungbme
    @tungbme 2 роки тому

    Thank you so much for your explanation, helps me a lot

  • @davidforman4080
    @davidforman4080 Рік тому

    So clear -- wow!

  • @Darkev77
    @Darkev77 2 роки тому +6

    I’m very confused as of why not just maximize the log-likelihood of all the current observed data given some mu?

    • @michaurbanski5961
      @michaurbanski5961 2 роки тому

      I think it applies not only for estimation of mu, but any arbitrary parameter. Then it would not be as simple as taking average of all observed data. I could be wrong though :P

    • @ritvikmath
      @ritvikmath  2 роки тому +5

      Love this question, thanks for asking. Indeed with this toy example the EM algorithm is overkill and it was mostly meant for instructive purposes. Of course, when we use things for instructive purposes we can miss the more interesting applications. Thinking about this, an interesting variation would be what if you have the data [1,2,x,y] drawn from a N(0,sigma) where now it’s the standard deviation sigma as well as two missing values you want to predict. This is interesting because it’s important to consider the values of the non missing data *and* the potential values of the missing data which are consistent with some estimate for sigma (since standard deviation is inherently a measure between data points)

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 2 роки тому

    This is explained so well

  • @nitishyamsukha
    @nitishyamsukha 3 місяці тому

    very good explanation!

  • @kevinshao9148
    @kevinshao9148 Рік тому

    Thanks for the great video! One question: if you have (1+2+x)/3 = x , then you can have close form solution, why you still need numerical approach?

  • @deepshikhaagarwal4125
    @deepshikhaagarwal4125 Рік тому

    Thanks So much Ritvik!Your videos are amazing...do you have list of playlist for machine learning to connect dots in ML concepts,I see playlist for data science but not for machine learning.
    Thanks

  • @michaelrainer7487
    @michaelrainer7487 2 роки тому

    On step 2, what does the dx do at the end of that equation?

  • @isciaberger5017
    @isciaberger5017 Рік тому

    this helped so much, thank you a lot!!

  • @nipa161991
    @nipa161991 Рік тому

    Thanks for the video! What was not clear to me is whether we calculate all E(LL|M) for all Ms in which we can calculate the argman in step 3?

  • @sneggo
    @sneggo 2 роки тому

    Great video! Thank you so much

  • @Sushanta-Das
    @Sushanta-Das 8 місяців тому

    Sir , I know, In E - step we estimate unknown x , but you are calculating Likelyhood . how are these connected ?

  • @grjesus9979
    @grjesus9979 Рік тому

    What if x is high dimensional? How would the integral change?

  • @skanoun
    @skanoun 6 місяців тому

    Great teacher❤

  • @Andres186000
    @Andres186000 2 роки тому +3

    Nice to see the theorem guaranteeing convergence for sequences that are increasing and bounded being used to prove this. I do have a more pragmatic question which is how somebody would go about finding the argmax in the M step. Would gradient descent be used on the expectation of log-likelihood function (I would imagine in this case the expectation of log-likelihood would have to be convex for this to work) to find the argmax?

    • @Michael-vs1mw
      @Michael-vs1mw 2 роки тому +2

      Yep, you can use any optimization method. For Gaussian mixture models there are explicit formulas for the M step which are obtained in the usual way by setting the gradient of the expected log-likelihood to zero.

  • @Thefare1234
    @Thefare1234 Рік тому

    Great explanation. However, the way you have written it, there is no difference between the likelihood function and the probability function. I think for clarity you should swap x,1,2 and \mu. Also you should use ; instead of | so that the likelihood function is not confused with conditional probability.

  • @EW-mb1ih
    @EW-mb1ih 2 роки тому

    Is the EM algorithm the best algorithm to use in some specific problem (compared for example to the gradient descent algorithm)?

  • @xiaogangcho6261
    @xiaogangcho6261 2 роки тому

    Thanks a for the easy-to-understand intuation of EM algorithm. Would you like to explain the Coin-flip example along with your formulation step ② and ③?

  • @shivambindal4809
    @shivambindal4809 2 роки тому

    The expression for Expectation seems similar to Bayesian theory where we have prior belief (P(x|u)) and likelihood and we are multiplying both to get posterior. Is this the same concept?

  • @neelabhchoudhary2063
    @neelabhchoudhary2063 9 місяців тому

    oh my god. this was so helpful

  • @apresunreve
    @apresunreve 2 роки тому

    amazing, thanks for such a clear explanation :)

  • @akshiwakoti7851
    @akshiwakoti7851 2 роки тому

    Great videos. Got it in one go! Could you do Gaussian Mixture Models? Thanks.

  • @ehsanmon
    @ehsanmon 2 роки тому

    How would the problem change if we didn't know the variance either?

  • @RahulSingh-be2mg
    @RahulSingh-be2mg 18 днів тому

    Incredible

  • @dougcree6486
    @dougcree6486 2 роки тому

    A worked example of the final process would be invaluable.

  • @rachidwatcher5860
    @rachidwatcher5860 2 роки тому

    Thank you Ritvik for your explanation! does it work only for normal distribution or we can apply it for other kind of distributions