Sample Size and Effective Sample Size, Clearly Explained!!!

Поділитися
Вставка
  • Опубліковано 26 лип 2024
  • The sample size for an experiment depends on what you want to say and what kind of replicates you have. This StatQuest shows examples of how biological and technical replicates are counted differently. It also shows what to do when your samples are correlated. Oh, and just in case you're interested, the twins are monozygotic.
    For a complete index of all the StatQuest videos, check out:
    statquest.org/video-index/
    If you'd like to support StatQuest, please consider...
    Buying The StatQuest Illustrated Guide to Machine Learning!!!
    PDF - statquest.gumroad.com/l/wvtmc
    Paperback - www.amazon.com/dp/B09ZCKR4H6
    Kindle eBook - www.amazon.com/dp/B09ZG79HXC
    Patreon: / statquest
    ...or...
    UA-cam Membership: / @statquest
    ...a cool StatQuest t-shirt or sweatshirt:
    shop.spreadshirt.com/statques...
    ...buying one or two of my songs (or go large and get a whole album!)
    joshuastarmer.bandcamp.com/
    ...or just donating to StatQuest!
    www.paypal.me/statquest
    Lastly, if you want to keep up with me as I research and create new StatQuests, follow me on twitter:
    / joshuastarmer
    #statquest #statistics

КОМЕНТАРІ • 80

  • @statquest
    @statquest  2 роки тому +2

    Support StatQuest by buying my book The StatQuest Illustrated Guide to Machine Learning or a Study Guide or Merch!!! statquest.org/statquest-store/

    • @Also_sprach_Zarathustra.
      @Also_sprach_Zarathustra. 6 місяців тому

      Hello ! Thank you a lot for you videos !
      But I have a question: since everyone is correlated to some extent, shouldn't we use this formulation to calculate the 'effective size' for each sample (M.yellow, etc.)?

    • @Also_sprach_Zarathustra.
      @Also_sprach_Zarathustra. 6 місяців тому

      More clearly: why this limitation, this threshold, with twins? Isn't it a big bias ?

    • @Also_sprach_Zarathustra.
      @Also_sprach_Zarathustra. 6 місяців тому +1

      And yes everyone should buy your book !!! It's an amazing book, and the clearest statistics courses I have seen!!!
      !! Please don't stop teaching like you do, we need you so much !!

    • @statquest
      @statquest  6 місяців тому

      @@Also_sprach_Zarathustra. What time point in the video, minutes and seconds, are you asking about?

  • @mastermike890
    @mastermike890 6 років тому +2

    Such a great series! Really helps make statistics approachable to us! I can't wait for a Central limit therm, delta method, strong law of large numbers or perhaps slutsky's theorem. Those are all topics which I found really challenging when I first started my studies.

  • @nanazeethiopia2892
    @nanazeethiopia2892 4 роки тому +1

    BAM!! I Really like your lectures . They are super professional. Can you do on LSTM , Long range dependence, and other types of distributions like Pareto Distributions ....??. Thanks.

  • @pamaa7
    @pamaa7 4 місяці тому +1

    Thank you so much for explaining a student who got low marks in Statistics I'm feeling happy that my concepts are clear now. Can't thank you enough.

  • @qwerty11111122
    @qwerty11111122 2 роки тому +1

    1:10 interesting, tree blood. This was a nice and sweet statquest.

  • @tracyxiang7692
    @tracyxiang7692 6 років тому

    love the series!!

  • @joaocarneroguedes
    @joaocarneroguedes 4 роки тому +1

    Very interesting! Thanks for this!

  • @parklee3646
    @parklee3646 4 роки тому +1

    Can you please make a video about effect size or randome effect model?

  • @joshtwigg5419
    @joshtwigg5419 2 роки тому +1

    These videos are awesome! Is this the same concept as ICC for cluster-RCT analysis? Thanks :)

    • @statquest
      @statquest  2 роки тому +2

      Maybe. Unfortunately, I'm not familiar with those terms.

  • @ehsansalehabadi1500
    @ehsansalehabadi1500 Рік тому +2

    Thanks for these awesome sets of lectures, sir.
    I had a question. does the formula work only for positive correlations? I mean what if the correlation between twins is negative so that they zeroed the denominator, or for example, make the effective size negative?

    • @statquest
      @statquest  Рік тому

      To be honest, all I know about the actual formula is that it's a little more complicated than what I presented (this video was simply to present the main ideas of the concepts) and presumably can handle negative correlations correctly.

    • @ehsansalehabadi1500
      @ehsansalehabadi1500 Рік тому +1

      @@statquest Cool! Thank you for the clarification, sir :-)

  • @fatemehkarimi6752
    @fatemehkarimi6752 2 роки тому +2

    dear Josh, I will be thankful if you make a video about the effect size and cohen's d. these topics are intangible and I didn't find a good video for them on the net. thanks in advance.

    • @statquest
      @statquest  2 роки тому

      I believe I talk about these things in my video on power: ua-cam.com/video/VX_M3tIyiYk/v-deo.html

  • @chaduvulachilakala7981
    @chaduvulachilakala7981 Рік тому +1

    great explanation sir thank you

  • @joshstat8114
    @joshstat8114 5 місяців тому

    Clearly explained. I have something to request. Would you like to create a video about law of large numbers and order statistics?

    • @statquest
      @statquest  5 місяців тому

      I'll keep that in mind.

  • @AmitKumar-uj6ed
    @AmitKumar-uj6ed 3 місяці тому +2

    Well! Everything is clearly explained 😅

  • @ciherrera
    @ciherrera 4 роки тому

    I have a couple questions about this video. Is there an explanation for the effective sample size equation you show? Also, why would the effective sample size equation not be linear with correlation? If you have a sample size of 2, and a correlation of 0.5, why would the effective sample size be 1.33 and not 1.5?

    • @statquest
      @statquest  4 роки тому

      For more details on the formula for effective sample size, check out the wikipedia article: en.wikipedia.org/wiki/Effective_sample_size

  • @reytns1
    @reytns1 6 років тому

    Hello, Joshua, I have an experiment on an F1 mapping population, I work with table grape, so I take 36 individuals of my population to measurement firmness 18 have a good firmness and the other not, actually are the bad ones.... so as you mentioned in this video do I have to apply an effective sample size? (Is it possible that you give your email?) thanks

  • @ZiauddinAzimi
    @ZiauddinAzimi 4 роки тому

    Nice explanation. I have a question. I want to find the effects of Vitamin A on the methylation status of some specific genes in children. Now how to calculate the sample size?

    • @statquest
      @statquest  4 роки тому +3

      Good question. You need an estimate of the effect size and the variation in the data. Then you can do a power analysis. Perhaps there are similar studies that you can look at for ideas.

  • @pjgdba306
    @pjgdba306 3 роки тому +2

    Cool, I have no reason to know this info, but it is really interesting.

  • @karthica5251
    @karthica5251 7 місяців тому

    Amazing as always, but here when you mention correlation are you referring to pearson correlation coeff? Is it possible for them to be non-linearly correlated (I cannot think of a situation)?

    • @statquest
      @statquest  7 місяців тому +1

      I believe people usually use pearson's correlation coefficient. I can't imagine the correlation being non-linear, but you can always check for that.

  • @afzalsiddique7165
    @afzalsiddique7165 4 роки тому +3

    If you're a mouse geneticist, you can think of blue dudes as a specific strain of mouse. DOUBLE BAMM!!!

  • @user-bz8nm6eb6g
    @user-bz8nm6eb6g 4 роки тому +2

    what a clear explanation

  • @sauravpcbl
    @sauravpcbl 4 роки тому +1

    Thanks a ton Sir

  • @csmatyi
    @csmatyi 5 років тому +5

    how do we calculate correlation between let's say, 5 blue dudes?

    • @statquest
      @statquest  5 років тому +1

      Great question! I'm not an expert on this one, but here's a guess: One way you could do it is measure expression from a lot of genes (perhaps all of them) and then calculate correlations based on that. Since there is going to be a fair amount of correlation between everyone (blue, orange and green - since they all are dudes and do dude things) you could scale the correlation based on how correlated blue is to orange and green.

    • @brunoraviolo
      @brunoraviolo 4 роки тому

      @@statquest This sounds very important for any research and yet seems to be a methodological detail, i'm confused. Could you give us some examples? Thanks

  • @EdDubs_
    @EdDubs_ Рік тому +1

    Damn that intro goes hard

  • @kevindeng3576
    @kevindeng3576 5 років тому +1

    Could u please do one on latent variable?

  • @salsabillashafaadzra8109
    @salsabillashafaadzra8109 3 роки тому

    Hi sir, can we use margin error 7% and is there any journal that refers it? Thank you

    • @statquest
      @statquest  3 роки тому

      I'm not sure what you mean by your question. Can you elaborate on it or give more context?

    • @salsabillashafaadzra8109
      @salsabillashafaadzra8109 3 роки тому

      @@statquest like other formula sample size, there is rule that margin error about 5 % - 10 % but not in 7%. For example like slovin formula is have margin error 5% and 10%. So that, i wanna ask can we use margin error 7% and is there any references that i can refers it? Thank you, sir

    • @statquest
      @statquest  3 роки тому +1

      @@salsabillashafaadzra8109 I see. Unfortunately I don't know of any references. That doesn't mean they don't exist, it just means I don't know about them. bummer! :(

  • @abbygillette568
    @abbygillette568 4 роки тому +2

    I love one man and that man is Josh Starmer

  • @PGDM_SHASHANKJAIN
    @PGDM_SHASHANKJAIN 4 роки тому

    Hi, Here you calculated the effective sample size for persons belonging to the same set of population( blue ones), However there must be some correlation between the orange and blue ones or blue and green ones or orange and green ones or among all three of them .Aren't we supposed to calculate effective sample size for them too?

    • @statquest
      @statquest  4 роки тому +1

      If all samples have the same amount of correlation, then it all washes out and we don't have to worry about things. However, if some samples are more correlated than others, then we need to take that into account, so that's what's going on here. In your own study, you need to figure out if there is a uniform amount of correlation among your samples or not. If not, then you need to adjust for it.

    • @yclong8848
      @yclong8848 4 роки тому

      @@statquest Hi, how do you calculate this correlation? Using genetic correlation? Like for the identical twins, the genetic correlation = 1, then sample size = effective sample size. Then that means it won't raise the power by adding an identical twin dude into the sample, right? thanks.

  • @thegamingannex5752
    @thegamingannex5752 2 роки тому +1

    If you're a quality engineer, you can think of technical replicates as repeatability and reproducibility samples. TRIPLE BAMMM!

  • @TomerBenDavid
    @TomerBenDavid 3 роки тому

    Which mic do you use?

    • @statquest
      @statquest  3 роки тому

      I think I just used to the built in mic on my laptop for this one. However, if you want details on my setup, see: ua-cam.com/video/crLXJG-EAhk/v-deo.html

  • @zijieyuan2255
    @zijieyuan2255 5 років тому +8

    You must know how to do magics. Otherwise how could you make things so easy!!

  • @monzurmorshed.
    @monzurmorshed. 2 роки тому +1

    Thank you!

  • @xanyula2738
    @xanyula2738 2 роки тому

    Is effect size and effective sample size the same thing ?

    • @statquest
      @statquest  2 роки тому

      No. Effect size is related to how different two groups are.

  • @a_son8549
    @a_son8549 3 роки тому +1

    Neat!

  • @guilhermesantucci
    @guilhermesantucci 3 роки тому

    im high watching this on freewill and not algorithm recommendation. i forgot why

  • @judelaroscain6948
    @judelaroscain6948 6 років тому

    Duh hahaha .

  • @daisyumutoni8957
    @daisyumutoni8957 3 роки тому

    how would you add a fraction of a dude to a sample though..?

    • @statquest
      @statquest  3 роки тому +1

      Ha! I guess you need to round up! :)