Expected Values for Continuous Variables!!!

Поділитися
Вставка
  • Опубліковано 26 гру 2024

КОМЕНТАРІ • 191

  • @statquest
    @statquest  2 роки тому +4

    Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/

  • @iroikoarakide8990
    @iroikoarakide8990 11 місяців тому +10

    Why do I have to attend college classes when you explain these concepts so thoroughly and in a third of the time? Great video as always man you are a Godsend

  • @theburtmacklin9615
    @theburtmacklin9615 3 роки тому +47

    Amazing videos; if you only knew the way in which your channel had changed things for me… what started as a “hey this video looks cool” has largely been responsible for my wife enrolling into a data science postgrad program in the University of Waterloo.

    • @statquest
      @statquest  3 роки тому +10

      That's amazing! Wish your wife good luck for me and I hope it goes well. It's very exciting! BAM! :)

    • @Thedavidk
      @Thedavidk 6 місяців тому

      @@statquest I did but the integral stuff was a bridge to far for me 🙂

  • @raulsena3917
    @raulsena3917 3 роки тому +51

    Amazing videos, just wanted to thank you for all you have taught me. Now I am doing a Master's Degree in Artificial Intelligence and planned to do a Ph.D. and all thanks to your videos.

    • @statquest
      @statquest  3 роки тому +7

      Wow! That is awesome! Good luck!

    • @bogdankhamelyuk3857
      @bogdankhamelyuk3857 2 роки тому +2

      same bro. Also doing MS in AI and reminiscing some ML basics XD

  • @norukamo
    @norukamo 10 місяців тому +7

    Just discovered this channel an hour ago and now I'm binging your videos. They're soo good and easy to understand. BAM!

    • @statquest
      @statquest  10 місяців тому +1

      Thank you very much! :)

  • @ahmadatta66
    @ahmadatta66 3 роки тому +4

    I'm in the library and I did not realize my laptop speaker was on, and the video started, as always, with a song. :D
    I'm not even embarrassed. Everybody should hear your songs, they are amazing haha

  • @durgeshkshirsagar116
    @durgeshkshirsagar116 3 роки тому +6

    You sound like kids' rhymes but when you teach, teach like a pro.

  • @otavioaugustodeassuncao9858
    @otavioaugustodeassuncao9858 2 роки тому +3

    Currently i'm studying Statistics at Federal University of São Carlos (UFScar) and just wanted to thank you for all the helpful and fun content that you've been posting... Not only had helped me to understand but also has made me like Statistics even more

    • @statquest
      @statquest  2 роки тому +1

      Thank you very much! :)

  • @josherickson5446
    @josherickson5446 3 роки тому +20

    Triple bam bro! Now just need to refresh some calculus :)

  • @astar2128
    @astar2128 2 місяці тому +2

    Most of my knowledge on stats and data analysis is self taught and this channel is a godsent (infinite bam!). Now i have to learn some calculus😅. Math has always intimidated me but I want to understand the concepts.

  • @zalavadiaridham
    @zalavadiaridham 2 роки тому +6

    Amazing video josh!!! I'm waiting for the video on why we divide by n-1 when we compute the sample variance. Thank you for the very informative content that you put out.

    • @statquest
      @statquest  2 роки тому

      Thank you!

    • @hrig
      @hrig 2 роки тому +2

      yes! n-1please!!!!!!!!

    • @jaspermuller9851
      @jaspermuller9851 11 місяців тому +2

      Is there any update on the n-1?

    • @dicksang2
      @dicksang2 9 місяців тому

      I am following through and want your version of n-1 as denominator to estimate the variance by sample variance.
      Thanks!!

  • @sparshasherke250
    @sparshasherke250 Рік тому +1

    this explanation is magical :cries:

  • @MrDanituga
    @MrDanituga 3 роки тому +2

    One of the best explanations I have ever had! Congratulations on the video and on your content in general. Watch every video :D

  • @fedos
    @fedos 3 роки тому +3

    I think I've had too much to drink, because I read the notification as "Star Quest with Josh Starmer", and I was confused about your sudden shift to astronomy. I'm going to have to watch this tomorrow.

    • @Erosis
      @Erosis 3 роки тому +2

      Star Quest would make a great April Fools joke. Have any astronomy friends, Josh?

    • @statquest
      @statquest  3 роки тому +7

      Once StatQuest makes me a billionaire, I'll make spaceship called Star Quest!!! You guys can come along for the ride.

  • @stef1790
    @stef1790 3 місяці тому +1

    Thank you! Really helped with refreshing my statistics!

  • @FortranCastle
    @FortranCastle 22 дні тому

    All this integration and derivation and by the end, the expected value (quite intuitively) was the reciprocal of the rate. I mean it makes a lot of sense without needing to prove it. If you expect to see 5 people per minute, you can expect to wait 1/5 minutes between encounters on average.

    • @statquest
      @statquest  22 дні тому

      Yep! The idea wasn't to blow your mind, but to show you how a process works using an example that you can easily verify with other means.

  • @awaisjabbar8304
    @awaisjabbar8304 6 місяців тому +1

    Liked the video, before it even starts, awesome channel, why I have not heard of you before.

  • @newsupdates3622
    @newsupdates3622 3 роки тому +2

    Double bam! Here to watch another great statquest video for a refresher

  • @selva279
    @selva279 3 роки тому +4

    Eagerly waiting for your book...

  • @jerrycheung8158
    @jerrycheung8158 3 роки тому +8

    Great video! May I ask where is the next video to go to for solving the n - 1 puzzle?

    • @statquest
      @statquest  3 роки тому +4

      Unfortunately I haven't made it yet. However, this is my favorite webpage on the topic: online.stat.psu.edu/stat415/lesson/1/1.3

    • @trad_sikh
      @trad_sikh 2 місяці тому

      @@statquest
      I have a question which is bothering me.
      We know sampling distribution’s standard deviation (standard error) times square root of sample size is the population’s standard deviation.
      Why are we even getting into this mess of dividing by n-1?
      Is it because the first case is for a sampling distribution (distribution of means of samples) while the later is a simple random sample ? Am I mixing Central limit theorem where it doesn’t belong?

    • @statquest
      @statquest  2 місяці тому

      ​@@trad_sikh To see why dividing by 'n' underestimates the population variance, see: ua-cam.com/video/sHRBg6BhKjI/v-deo.html

  • @egeilkay8882
    @egeilkay8882 19 днів тому +1

    when the statSquatch said "TRIPLE BAM" it just hit different

  • @eduquest3169
    @eduquest3169 3 роки тому +1

    Thanks a lot dear, may god bless you Josh.

  • @Letsbetog
    @Letsbetog 3 роки тому +3

    You are an awesome singer As well as A great Teacher.🙃🙃🙃

  • @homunculide8567
    @homunculide8567 2 роки тому +1

    Thanks for that and another videos with all that useful explanations... I'll keep binge watching. Regarding your explanation at 10:06... The second term and all following appear wrong to me, because the intervals are 10s each and not 20s, 30s, 40s... this the area of the rectangle should be likelihood x 10(s) for all terms (discrete rectangle) You are the expert but I can't wrap my head around that.

    • @statquest
      @statquest  2 роки тому +1

      I'm not sure I fully understand your equation. The area of each rectangle (the probability) is 10 x likelihood (obviously the values are approximated and rounded in the figure). The time, 's', should only be used to determine the likelihood at a specific point.

  • @AG-cx1ug
    @AG-cx1ug 9 місяців тому

    Did we finally get the final equation as at 7:40? Both are just area under the curve? Width wasn't being used before 11:00 - we were just using the specific outcome x, how did both width and specific outcome come in the final equation?

    • @statquest
      @statquest  9 місяців тому

      The equation at 7:40 is just the equation for the area under a curve defined by an exponential distribution, not the equation for the expected value of an exponential distribution. The equation for the expected value of an exponential distribution doesn't show up until 12:39. The big difference is that we multiply the formula for the exponential distribution by 'x', a specific outcome. Width isn't part of this equation because we take the limit earlier as the number of rectangles goes to infinity and their widths go to 0.

  • @sharan9993
    @sharan9993 3 роки тому +7

    When can i know why we divide sd by n-1?
    I hav seen ur videos on sd and know we need to subtract something but why 1?
    U said we need to know about expected values first. So when can i learn?
    Love ur videos. Thanks a lot

    • @statquest
      @statquest  3 роки тому +5

      Believe it or not, we still have a long way to go. Sorry this process is so painful!

    • @statquest
      @statquest  3 роки тому +6

      That said, if you want to skip to the head of the class, see: online.stat.psu.edu/stat415/lesson/1/1.3

    • @sharan9993
      @sharan9993 3 роки тому +2

      @@statquest ohhh i will wait bcoz i am curious yet lazy😂

    • @sharan9993
      @sharan9993 3 роки тому +1

      @@statquest i checked it out and it went over my head. Could u tell me the path that leads there? I will search and read them. Thanks a lot

    • @statquest
      @statquest  3 роки тому +3

      @@sharan9993 If I could, you wouldn't be waiting for my videos.

  • @tauqeeralitapya6447
    @tauqeeralitapya6447 Рік тому

    at 9:39, why are we multiplying 20 with 0.2 and not 10 as the interval is still 10 between 10 and 20?

    • @statquest
      @statquest  Рік тому

      20 represents the specific, discretized, outcome. In other words, we have discrete time points, 10 seconds, 20 seconds, 30 seconds, 40 seocnds, etc. that we have to wait before we meet someone in StatLand. Since we have a histogram, everyone we meet between 0 and 10 seconds is categorized as "someone we met in 10 seconds". Likewise, everyone we meet between 10 and 20 seconds is categorized as "someone we met in 20 seconds". etc.

    • @priyankjain9970
      @priyankjain9970 Рік тому

      @@statquest then it should be like = (10*0.4) + (20*(0.4+0.2)) + (30*(0.4+0.2+0.1)) and so on.. and that's because for x=10 , probability will be area under curve till 10 which is 0.4, and for x=20 probability will be area under curve till 20 which is 0.63(roughly equal to 0.6) ... Please correct me if I am wrong

  • @alexandersmith6140
    @alexandersmith6140 Рік тому +1

    Hi Josh. What video(s) after this should I watch to get closer to understanding why we divide by (n - 1) when finding the sample variance and sample covariance? An intuitive explanation for this seems nowhere to be found across the entirety of the internet, and the StatQuest channel has thus far been a divine gift of comprehension.

    • @statquest
      @statquest  Рік тому

      Have you seen this one: ua-cam.com/video/sHRBg6BhKjI/v-deo.html Other than that, the best I can do is refer you here: online.stat.psu.edu/stat415/lesson/1/1.3 One day I turn that page into StatQuest, but not for a while.

    • @atg6174
      @atg6174 11 місяців тому +1

      @@statquest why not now? i think its time to do so ...😢

    • @quanghoang3801
      @quanghoang3801 8 місяців тому

      @@statquest i have waited for 4 years 😢

  • @AG-cx1ug
    @AG-cx1ug 9 місяців тому

    At 11:13 why is there still an x value at x * L(X=x) * width? Wasn't the x the width?

    • @statquest
      @statquest  9 місяців тому

      'x' is the value on the x-axis. It refers to a specific outcome or event happening at a specific time.

  • @samieweee7468
    @samieweee7468 11 місяців тому +1

    I am a bit confused is the value in y axis the likelihood of meeting a person after xth seconds or the number of people we meet after x th seconds cause during initial explanation the dots at each x were number of people we met after xth time.

    • @statquest
      @statquest  11 місяців тому

      What time point, minutes and seconds, are you asking about?

    • @samieweee7468
      @samieweee7468 11 місяців тому

      ​@@statquest 2:37

    • @statquest
      @statquest  11 місяців тому +1

      @@samieweee7468 At 2:37, each dot represents an individual person. In other words, StatSquatch is creating a histogram. However, histograms have problems - gaps in the data and the data are not continuous (they have to be put in bins). Thus, we use an exponential distribution to approximate the histogram. The exponential distribution doesn't have gaps and is continuous. And, from there on out, we use the distribution, which gives us likelihoods on the y-axis, rather than the histogram, which gives us the number of people on the y-axis.

  • @adampotter760
    @adampotter760 Рік тому +1

    What the next video to watch to find out the elusive reason for dividing by n-1!? Please reply!

    • @statquest
      @statquest  Рік тому +1

      Unfortunately I don't have the video yet. In the mean time, check out: online.stat.psu.edu/stat415/lesson/1/1.3

  • @DjamelEddineStudentAchour
    @DjamelEddineStudentAchour 2 місяці тому +1

    amazing, thanks for the video.

  • @kakashisensei8146
    @kakashisensei8146 3 роки тому +2

    Josh, I'm really thankful to all your vdos, it's enlightening, lol.
    Please can you make some vdos on distribution functions (e.g., normal distribution)?

    • @statquest
      @statquest  3 роки тому +2

      I've got a basic video on the normal distribution here: ua-cam.com/video/rzFX5NWojp0/v-deo.html and a video on maximum likelihood estimation with the normal distribution here: ua-cam.com/video/Dn6b9fCIUpM/v-deo.html All of my videos can be found here: statquest.org/video-index/

    • @kakashisensei8146
      @kakashisensei8146 3 роки тому +2

      @@statquest you're the best..
      Thanks a lot!

  • @НиколайТолстов-м3и
    @НиколайТолстов-м3и 2 роки тому +1

    1) Why cant we just take average of all waiting times to get how much on average we would wait?
    2) How often in real life we can use some sort of formula like exponential to describe a distribution? I think in most of the cases we can use NN to fit the distribution

    • @statquest
      @statquest  2 роки тому +3

      Expected values are helpful for doing Statistics and getting a sense of how likely future events will be. This is why we go through the trouble to do all this math instead of just taking an average or fitting an NN to the data.

  • @taotaotan5671
    @taotaotan5671 3 роки тому +6

    probability is fun : )

    • @statquest
      @statquest  3 роки тому

      BAM! :)

    • @taotaotan5671
      @taotaotan5671 3 роки тому

      ​@@statquest I guess the next video will talk the variance of a random variable :)

  • @SunSan1989
    @SunSan1989 Рік тому +1

    I watched all the videos carefully, but why can't I find it? Why choose n-1 instead of N minus 2 or n-0.5?

    • @statquest
      @statquest  Рік тому

      Unfortunately I haven't made that one yet. In the mean time, check out: online.stat.psu.edu/stat415/lesson/1/1.3

  • @drelijahmikail3916
    @drelijahmikail3916 3 місяці тому

    shouldn't it be a Riemann sum where the interval is the same? Why is the multiplier: 10, 20, 30, etc, instead of constant interval of 5 or 10?

    • @statquest
      @statquest  3 місяці тому

      Because the x-axis value (10, 20, 30) represents time and we want to know the probability of observing something during that block of time.

  • @AG-cx1ug
    @AG-cx1ug 11 місяців тому

    At 14:30 isn't the anti derivative just the integral?

  • @isurumahakumara
    @isurumahakumara 3 роки тому

    I couldn't find any quests on Time Series, If there aren't any would love to see one in the future!!!

  • @mohammedk.h.f3016
    @mohammedk.h.f3016 3 роки тому +2

    Thank you very much

  • @waisyousofi9139
    @waisyousofi9139 3 місяці тому +1

    AS USUAL, BEST!

  • @AG-cx1ug
    @AG-cx1ug 9 місяців тому

    For each rectangle shouldn't we be multiplying the likelihood value by 10 and not 20,30? Since the area is just between 10-20 or 20-30?

    • @statquest
      @statquest  9 місяців тому

      At 9:00 we calculate the y-axis values (the likelihoods) for each rectangle by 10 because it is the distance between each tick-mark on the x-axis. This gives us the probabilities of each event. However, the events, or outcomes, themselves, are the x-axis values. So, starting at 9:10, we multiply the probabilities by the outcomes (the x-axis coordinates). When we add these products up, we get a weighted average of the outcomes (weighted by their probability of occurring). Is that what you are asking about?

  • @Altamira_
    @Altamira_ Рік тому

    Thanks for all of your videos they are great. I'm waching the full playlist "statistics fundamentals" and have to admit that since I understood all of the previous videos thanks to your great explanations I was lost on this one when you used integrals :/ I think it's the video that requires the more calculcus knowledge.

    • @statquest
      @statquest  Рік тому +1

      As long as you understand the discrete case, you should be good to go.

    • @Altamira_
      @Altamira_ Рік тому +1

      @@statquest Thanks :)

  • @mamadi-uk7kv
    @mamadi-uk7kv 2 роки тому +1

    Perfect man,Great

  • @mandaltech6249
    @mandaltech6249 2 роки тому

    Waiting for your book. You area real teacher.

    • @statquest
      @statquest  2 роки тому

      Thanks! I hope it is out in May!

  • @aysan7513
    @aysan7513 2 роки тому +1

    omg you saved my life

  • @AG-cx1ug
    @AG-cx1ug 9 місяців тому

    What if the data is erratic and doesn't fit with any of the lambda values to make the curve?

    • @statquest
      @statquest  9 місяців тому

      Then you can either use a different distribution, or you can try to approximate things with a histogram.

    • @AG-cx1ug
      @AG-cx1ug 9 місяців тому +1

      @@statquest Thank you!

  • @গোলামমোস্তফা-শ৮থ

    Probability =( height*weight). Of rectangle
    How did you find that?
    Can you please explain how it indicates the probability?
    ..
    and also explain me that what is the value of "x" in continous random variables "mean"?

    • @statquest
      @statquest  2 роки тому +1

      The area under the curve between two points represents the probability of something happening between those two points (see: 4:48). This is simply how probability distributions are defined. We can solve for that probability exactly using calculus (see: 4:49), or we can approximate it using rectangles (height * weight).

  • @maxyen9892
    @maxyen9892 2 роки тому

    Could you comment on how we would find expected values for a normal probability distribution, given a mean and standard deviation?

    • @statquest
      @statquest  2 роки тому +3

      The expected value for a normal distribution is its mean.

  • @ANGGAKAHFI-kr4yy
    @ANGGAKAHFI-kr4yy 7 місяців тому

    Sorry sir i dont get it when u use 0.05 for the lambda,

    • @statquest
      @statquest  7 місяців тому

      What time point, minutes and seconds, are you asking about?

  • @6872elpado
    @6872elpado 3 роки тому +1

    I didn't know you could call the integral of g'(x) the anti-derivative. Why didn't you just integrate g'(x) to get g(x)?

    • @statquest
      @statquest  3 роки тому +2

      "Antiderivative" and "Indefinite Integral" are synonyms. So we can say it either way. en.wikipedia.org/wiki/Antiderivative

  • @amirhosseinshafieian3951
    @amirhosseinshafieian3951 3 роки тому

    THANK YOU for your great videos I just have a question, I think( I am not sure) that this data in this example follows Poisson distribution, not exponential!!!!! , am I right???

    • @amirhosseinshafieian3951
      @amirhosseinshafieian3951 3 роки тому

      I think I got it my self

    • @statquest
      @statquest  3 роки тому

      Poisson is discrete and models something very differently from what we are modeling here with an exponential distribution. For details, see: en.wikipedia.org/wiki/Poisson_distribution

  • @davonraymond3274
    @davonraymond3274 Рік тому +1

    appreciate you

  • @alikhalili9961
    @alikhalili9961 2 роки тому

    Where is the n-1??????????? I am outraged. Lol. By the way, great video. Looking forward for more.

    • @statquest
      @statquest  2 роки тому +1

      Unfortunately this is just the first of many steps. :(

  • @tedchirvasiu
    @tedchirvasiu 3 роки тому +4

    5 years later: if you watched this video hoping to learn exactly why we divide by n-1, you are one step closer to understanding this mystery, but not quite there yet.

    • @statquest
      @statquest  3 роки тому +4

      Yep. Sorry it is taking so long.

  • @pigritor
    @pigritor 3 роки тому

    Dear Sir, could you do video on Linear Mixed models and GEE? Maybe you could create a separate donation target with threshold which you need to achieve to make one? It would be extremely useful. It is long waited and hard topic with a lot of contradictory info.

    • @statquest
      @statquest  3 роки тому +3

      I'll keep that in mind.

  • @oraciopozzo8694
    @oraciopozzo8694 3 роки тому

    When you approximate the expected value it is confusing that you add areas whose base is not the constant 10 interval. The areas of the rectangles should be in your example variable height given by the formula times 10 and not 10, 20, … Otherwise the rectangles are meaningless.

    • @statquest
      @statquest  3 роки тому +1

      I'm sorry that was confusing to you. However, each rectangle has the same width (as you can see in the illustration). However, each rectangle represents a different specific outcome. So 10, and 20 are not different widths, but different outcomes. 10 is the outcome represented by the first rectangle (with width = 10) and 20 is the outcome represented by the second rectangle (also with width = 10).

    • @priyankjain9970
      @priyankjain9970 Рік тому

      ​@@statquest then it should be like = (10*0.4) + (20*(0.4+0.2)) + (30*(0.4+0.2+0.1)) and so on.. and that's because for x=10 , probability will be area under curve till 10 which is 0.4, and for x=20 probability will be area under curve till 20 which is 0.63(roughly equal to 0.6) ... Please correct me if I am wrong

    • @statquest
      @statquest  Рік тому

      @@priyankjain9970 No, it's (10*0.4) + (10 * 0.2) + (10 * 0.1) + (10 * 0.09) + ... etc. To approximate the area under the curve, we add up the area of each rectangle. The area of each rectangle is the width (10) times the height (0.4 for the first, 0.2 for the second, 0.1 for the third, etc.).

    • @priyankjain9970
      @priyankjain9970 Рік тому

      @@statquest Thanks for reply.
      Actually the height is 0.04 for first, 0.02 for second, 0.01 for third and so on ( as explained by you @8.29 in video). Therefore area of rectangle will be 0.4 for first, 0.2 for second, 0.1 for third and so on.
      My concern is following
      As you stated E(X) = Σ x * p(X=x) .. This means E(X) = 10*(probability till 10) + 20 * (probability till 20) + 30 * (probability till 30) and so on.
      Now
      probability till 10 means area under curve till 10 which is = 0.4
      probability till 20 means area under curve till 20 which is = 0.6 (approx)
      probability till 30 means area under curve till 30 which is = 0.7 (approx)
      Therefore E(X) should be (as per my understanding) = 10*0.4 + 20*0.6 + 30*0.7 + ....
      Please help me to understand this

    • @statquest
      @statquest  Рік тому +1

      @@priyankjain9970 Sorry about the typos with the area vs height. That said, the probability of observing an event between 10 and 20 seconds is not the cumulative probability of observing an event between 0 and 10 or between 10 and 20. Your equations use the cumulative probabilities, which is not correct in this situation.
      To clarify, the expected value is "the probability of observing an event between 0 and 10 seconds times the outcome, 10 (this is just the label for the any event that occurs between 0 and 10 seconds) + the probability of observing an event between 10 and 20 times the outcome, 20 (again this is just the label for any event that occurs between 10 and 20) + the probability of observing an event between 20 and 30 seconds time the outcome 30 + ....

  • @megasun
    @megasun Рік тому

    Number is not right around 8:25, the rectangle area should be approximate 0.3894. I was very surprised that the given rectangle area of 0.4 is bigger than the given integrated result of 0.39, because the rectangle area looks slightly smaller than the area under the curve ... still I love these videos. And of course, for the purpose of this lecture, having more digits here is distracting and not helpful.

    • @statquest
      @statquest  Рік тому +2

      Sorry if my inconsistent rounding through you off.

  • @Thedavidk
    @Thedavidk 7 місяців тому

    Wow, that got complicated really quickly

    • @statquest
      @statquest  7 місяців тому

      Did you first watch the expected values for discrete variables: ua-cam.com/video/KLs_7b7SKi4/v-deo.html

  • @auzaluis
    @auzaluis 3 роки тому +1

    BAM!

  • @b1ueberrycheesecake
    @b1ueberrycheesecake Рік тому +1

    BAM!!

  • @ahmedabuali6768
    @ahmedabuali6768 3 роки тому

    Dear Professor, I would like to ask you if you have a very good lecture note for this book, introduction to mathematical statistics Robert v. hogg, I am ready to pay for that, I like your methodology in presenting.

    • @statquest
      @statquest  3 роки тому +1

      Unfortunately I've never read that book.

    • @ahmedabuali6768
      @ahmedabuali6768 3 роки тому

      @@statquest Ok, I would like to take this chance again to ask you if possible to add topics about Bayesian statistics. Many thanks and I am still following you :)

    • @statquest
      @statquest  3 роки тому +2

      @@ahmedabuali6768 I'm working on those videos right now. They should be out soon.

    • @ahmedabuali6768
      @ahmedabuali6768 3 роки тому

      @@statquest wow, very good, attached also the pdf presentation please, I am ready to buy it. Go ahead, dear Prof.:)

  • @tim40gabby25
    @tim40gabby25 3 роки тому

    09:50 Eyeballing, 0.4 no way is twice the height of 0.2

    • @statquest
      @statquest  3 роки тому

      Rounding errors.

    • @tim40gabby25
      @tim40gabby25 3 роки тому +1

      @@statquest Appreciate the response, tho you went on a bit. Old UK duffer here, enjoying the ride :)

  • @RoyalYoutube_PRO
    @RoyalYoutube_PRO 7 місяців тому

    Has the mystery of 'n-1' been resolved yet?

    • @statquest
      @statquest  7 місяців тому

      Not yet. The best I can do is give you a link: online.stat.psu.edu/stat415/lesson/1/1.3

  • @VladLanz
    @VladLanz 6 місяців тому

    Looks like the math is wrong for the approximation of expected value of distribution:
    1) for each bin we compute: f(x) = lambda * e ^ (-lambda * x), where f(x) is the value of probability density function (PDF) for the middle point
    2) we compute probability of each bin: p(x) = f(x) * delta, where delta is the width of the bin, in our case = 10
    3) This is a step with mistake: we calculate contribution for each bin and sum everything up: E(x) = sum ( p(x) * x ), where x is the middle/average point for each bin, but in your video you took upper bound instead of an average value for each bin.
    If you do calculation this way, you get 19.23 which is closer to true value

    • @statquest
      @statquest  6 місяців тому

      I wouldn't say the math is wrong because the purpose is only to illustrate a concept, rather than how the math is actually done. In practice, we don't do a summation, we take the integral.

    • @VladLanz
      @VladLanz 6 місяців тому +1

      @@statquest sure, in practice we take the integral. But for approximation it makes more sense to take average value for each bin rather than it's top value. [5, 15, 25, ... 95] instead of [10, 20, 30, ... 100]. In case that's you and not some hired assistant who's answering comments here: thank you for your work, you're amazing🤗 You're the main reason i managed to remember everything i learned in university more than 10 years ago, started to master ml and deep learning and began working as data analyst

    • @statquest
      @statquest  6 місяців тому

      @@VladLanz That's me! Thanks! :) (and I still wouldn't say taking the edge is 'wrong' - different, and maybe it doesn't make as much sense for the sake of getting the best approximation, but not wrong).

  • @pibruks
    @pibruks 3 роки тому +1

    $$ #BAM!!!

  • @satyamraj2779
    @satyamraj2779 10 місяців тому +2

    you are too cute with your bams!

  • @MrAzrai99
    @MrAzrai99 3 роки тому +1

    Just realizing the significance of the calculus I've learnt long time ago ":"(

    • @statquest
      @statquest  3 роки тому +1

      bam!

    • @MrAzrai99
      @MrAzrai99 3 роки тому

      @@statquest Please make a video on multivariate normal distribution next🙇‍♂️

    • @statquest
      @statquest  3 роки тому

      @@MrAzrai99 I'll keep that in mind.

  • @Sarah-hu4wt
    @Sarah-hu4wt 2 місяці тому +1

    I like how I am smarter than StatSquatch

  • @neptunesbounty1786
    @neptunesbounty1786 3 роки тому +1

    1 downvote by StatSquatch

  • @TJ-hs1qm
    @TJ-hs1qm 2 роки тому

    ahh... how come that Likelyhood and Probability actually mean different things ? 🤷‍♂️

    • @statquest
      @statquest  2 роки тому

      See: ua-cam.com/video/pYxNSUDSFH4/v-deo.html

  • @nir199820
    @nir199820 3 роки тому

    I am beginning to think that the all "dividing by n-1" thing is a hoax :)

  • @bockarieb
    @bockarieb 5 місяців тому

    Probability density function tutorials Ghana university

  • @timehealthfit1891
    @timehealthfit1891 3 роки тому +1

    This is a great video, want to be youtube friends?

  • @ruiyucong3125
    @ruiyucong3125 11 місяців тому

    这集学不明白了,已晕

    • @statquest
      @statquest  11 місяців тому

      Sorry. :( Did you watch this one first: ua-cam.com/video/KLs_7b7SKi4/v-deo.html