Choosing a Statistical Test for Your IB Biology IA

Поділитися
Вставка
  • Опубліковано 21 січ 2025

КОМЕНТАРІ • 436

  • @danielm9463
    @danielm9463  5 років тому +88

    One small caveat: in broader mathematics, "number of bacterial colonies" would be treated as a *discrete variable*, which means the variable is numeric but it's restricted to certain values (and between those allowable values are gaps that the variable can't take on). But if you're plugging that variable into a regression or t-test/ANOVA model, then you're treating it as continuous. To quote minitab, which has great articles on statistics:
    "If you have a discrete variable and you want to include it in a Regression or ANOVA model, you can decide whether to treat it as a continuous predictor (covariate) or categorical predictor (factor). If the discrete variable has many levels, then it may be best to treat it as a continuous variable. Treating a predictor as a continuous variable implies that a simple linear or polynomial function can adequately describe the relationship between the response and the predictor. When you treat a predictor as a categorical variable, a distinct response value is fit to each level of the variable without regard to the order of the predictor levels. Use this information, in addition to the purpose of your analysis to decide what is best for your situation."
    support.minitab.com/en-us/minitab-express/1/help-and-how-to/modeling-statistics/regression/supporting-topics/basics/what-are-categorical-discrete-and-continuous-variables/

    • @madipsychomadi3144
      @madipsychomadi3144 4 роки тому +3

      thank you very much for your generous response.

    • @felixokatch4587
      @felixokatch4587 4 роки тому +1

      Good

    • @danielm9463
      @danielm9463  3 роки тому

      ​@Raggul Kanakasabapathy, it sounds like it could go either way. I'm not an expert by any means, but I think you would decide whether to treat the independent variable as categorical vs continuous by asking questions like "do I expect a linear relationship between the input and output, such that more of the independent variable consistently leads to more/less of the dependent variable?" I think you can also ask questions like "for the purposes of my study, do I care about the *relationship* that the independent variable has to the dependent variable, or do I care about finding e.g. a single dose that is therapeutically significant?"
      Do you have reasons to expect that salt has a linear relationship (positive or negative) with plant growth? Are you interested in the specific nature of that linear relationship (e.g., is your goal is to say "for every 1 g of salt, the leaves grow X cm longer")? If so, then it might make more sense to treat mass as continuous.
      Alternatively, are you looking for whether a particular mass of salt exists which provides a statistically significant growth effect? Are there reasons why too little salt could inhibit growth, and too much salt could inhibit growth, but maybe there's a salt mass in the middle that could balance multiple biological factors and yield the most growth? If so, then it might make more sense to treat mass as categorical. The fact that you described 0 g of salt as the "control group" already suggests to me that you might be looking to compare salt vs. no salt, and the reason you have different amounts of salt is to help answer the question "do plants grow more effectively with salt than without salt?" In other words, having a control group suggests that you're not looking for the specific nature of a hypothesized linear relationship between amount of salt and amount of growth.
      I've never taught IB Bio, but I did teach IB Physics. If you're writing an IB Biology IA, I would think it's worthwhile to explain how you're analyzing your data and justify why you chose to treat the independent variable as categorical vs. continuous. If you choose to treat mass as categorical, then you've got a categorical independent variable and a continuous dependent variable--what test would you use in that circumstance? If you treat mass as numerical, then both the IV and DV are continuous--what test would you use in that circumstance?
      (By the way, you can still do a regression with a categorical independent variable, but that's not really done in IB Biology, and it sounds like this might not make sense in your case. Here's what it would entail. For each dose, you'd essentially have a different dummy variable that takes on a value of 0 or 1. So the variable m_1g would be 0 or 1, and the coefficient would tell you how much more/less growth the plant had compared to 0 g when 1 g of salt was added. The variable m_2g would similarly take on values of either 0 or 1, etc.)
      Another quick thing--if you go the route of performing multiple t tests (to compare each salt mass to the control group), then you want to watch out for family-wise error. You can do a quick Google search to see what this is. This is usually dealt with by lowering the p value that you need in order to reject the null hypothesis. For example, if you do 5 t-tests, and you'd normally use p < 0.05 as your significance level, then you might instead calculate 0.05/5 and use p < 0.01 as the significance level for each test, since performing multiple tests increases the likelihood of a false positive. There are different ways to account for this family-wise error, and the appropriate approach depends on the class you're in and how rigorous the statistics element is.

    • @danielm9463
      @danielm9463  3 роки тому

      @Raggul Kanakasabapathy Since it's an IA, I can't quite give you a direct answer--that will be up to you. All I'll say is that it sounds like you're saying you have a continuous IV, a continuous DV, and you seek the mathematical relationship.
      if you go the route of regression, I have a few thoughts: (a) I think regression and mathematical relationships are less common in IB Bio IAs. Because of that, I recommend asking your teacher if you're on the right track and if you have a suitable topic, question, and statistical analysis approach. (b) If you go the route of regression, I think you will want to be careful about how you phrase your research question. If your understanding is that regression is appropriate when seeking a mathematical relationship between a continuous IV and a continuous DV, then you want to phrase your research question as seeking that *mathematical* relationship between the variables. (c) In all of it, the ultimate goal is to use the analysis as a mechanism to talk about in-depth Biology analysis. So if you find a negative linear relationship, then you want to use the Biology you've learned to try and explain, in depth, *why* that relationship exists. (d) Since I've never taught IB Bio, definitely have a conversation with your teacher about all of this. The advice I'm giving you is good for IB Physics, but I'm not sure how much of it translates over to Biology.

    • @danielm9463
      @danielm9463  3 роки тому +1

      @Raggul Kanakasabapathy Hi Raggul, I was curious how your IA turned out! Were you happy with the final product?

  • @tekmepikcha6830
    @tekmepikcha6830 4 роки тому +334

    This is perhaps the most simpliest and yet full tutorial I've heard on statistical tests LOL.....Well put together Daniel. Thanks

  • @RG-sl6ix
    @RG-sl6ix 4 роки тому +172

    Finally, the sky is clear....

  • @14MCDLXXXVIII88
    @14MCDLXXXVIII88 2 роки тому +24

    This is by far the best material about SPSS I've ever seen in my whole life. Throw 100$ SPSS books into trash can. This video is very beautiful in a minimalist way. Greetings from a procrastinator doctor, who is trying to hurry analysis at 07:00 AM for the finish date of a paper :-)

  • @Tiatabs13
    @Tiatabs13 2 роки тому +34

    I have all this data I’ve collected for my masters thesis, I’ve been looking for a test to use for two days. This video, hands down, has just saved my life. THANK YOU!!!

  • @asmaaadventures1791
    @asmaaadventures1791 2 роки тому +7

    Wow! I cannot express how much I am grateful to you for making this video. I spent one week figuring out which suitable statistical test was for my case. THANK YOU

  • @LnPlayEng
    @LnPlayEng 2 роки тому +9

    Man, this guy nailed it. I fumbled in my undergrad research. I wish this was available to me then. Thank you, Sir.

  • @Lydia-yo4lo
    @Lydia-yo4lo 2 роки тому +10

    Thank you so much for this video! Simple, concise, well organised -- it's rare to see such a well-made tutorial to a somewhat confusing topic such as this, amazing :)

  • @rodsalomon6524
    @rodsalomon6524 3 роки тому +29

    Im from the Linguistics field in this is still helpful for my thesis, thank you!

  • @view5558
    @view5558 3 роки тому +11

    After struggling for years trying to figure out about the necessity of so many statistical tests, finally I have an overview of statistical tools an how to choose one from.

  • @1990poulami
    @1990poulami 2 роки тому +4

    I watched so many videos on this topic. But this is the best explanation so far. Thank you so much

  • @v9054
    @v9054 4 роки тому +5

    I fell in love with the way you explained both the qualitative and quantitative techniques.

  • @danielelieh-ali-komi6544
    @danielelieh-ali-komi6544 Рік тому +5

    This video includes more data and has better educational content than what I learned in my MSc. Thank you Daniel.

  • @mmmm7656
    @mmmm7656 4 роки тому +80

    Thanks to Allah.. I have found someone who is teaching the basics..I needed it badly

    • @asmasultana2732
      @asmasultana2732 3 роки тому

      Hello. I need some help from you

    • @mmmm7656
      @mmmm7656 3 роки тому

      @@asmasultana2732 yes

    • @danielm9463
      @danielm9463  3 роки тому +1

      I'm glad it helped! I often need the basics too. This is a good starting place, but this video really does make some simplifications, and as you study more, you'll see that things aren't always so clear cut. I'm glad it helped!

  • @robingriffin42
    @robingriffin42 4 роки тому +58

    9:57 minute video vs 6 weeks of lectures...just wow!

  • @jodieharth
    @jodieharth 4 роки тому +10

    Thank you! Was recommended for Business Data Analytics.
    4:08 is particularly useful if you're wondering what sort of test to use (or, what it's called!) Chi-Squared / t-Test / Correlation.

  • @darrenzerone765
    @darrenzerone765 2 роки тому +5

    Statistics are so simple and easy to understand after your wonderful explanation. Thank you for this amazing video.

  • @saggrawal
    @saggrawal 4 роки тому +10

    An amazingly simple description of statistical tests. Thank you so much!

  • @view5558
    @view5558 3 роки тому +4

    After struggling for years trying to figure out about the necessity of Sr many statistical tests, finally I have an overview of statistical tools an how to choose one from.

  • @michalinaprycka3692
    @michalinaprycka3692 4 роки тому +10

    This video has helped me more than my biology teacher, love you man

    • @danielm9463
      @danielm9463  3 роки тому

      Glad it helped! I sympathize with IB Bio teachers! They're expected to teach statistics, but it's not part of the curriculum!

  • @donharris8846
    @donharris8846 4 роки тому +11

    For a broad perspective, this is a remarkable video. Nice work! This helps a lot

  • @yusronnaim
    @yusronnaim 2 роки тому +1

    half semester of my statistics class didnt give such comprehension, thank you for this well put video.

  • @tameka4100
    @tameka4100 4 роки тому +1

    your english is impeccable. i can clearly hear the proper pronunciation and enunciation. Concept is well explained.

    • @danielm9463
      @danielm9463  3 роки тому

      It's funny--I think people think I'm smarter than I am because I enunciate well :)

  • @happyshappys
    @happyshappys 3 роки тому +1

    Simple, easy and best and proper way to have quick notes

  • @NileSon123
    @NileSon123 2 роки тому +4

    Best video I watched in statistics especially in experimental designs
    Thank you very much for that simple and valuable information

  • @Abdul-io8xh
    @Abdul-io8xh 2 місяці тому

    You are so amazing! So simple yet eloquent!! you KNOW how to teach stats.

  • @solomiiak1779
    @solomiiak1779 Рік тому

    Thank you sooo much for this video! It saved my life!I spent so many days trying to understand which test I should use for my master's thesis. You explained VERY well and simple! Thank you again.

  • @s1lverhawk
    @s1lverhawk 3 роки тому +3

    From the bottom of my heart THANK YOU for this

  • @_kaira4130
    @_kaira4130 3 роки тому +1

    The way to generalize test in a table is amazing!! Thanks! It really really helps a lot!

    • @danielm9463
      @danielm9463  3 роки тому

      Glad it helped! Just don't miss the correction I mentioned in the description

  • @maria_yourKARMA
    @maria_yourKARMA 8 місяців тому

    I like how the simplicity of your content, keep it up.

  • @ottobena
    @ottobena 3 роки тому +3

    Simply the best and clearest explanation video.

  • @Sarmilagiri12
    @Sarmilagiri12 4 роки тому +14

    The most simpliest and useful tutorial i ever heard and seen. Thank you soo much for giving us such a wonderful lessons. 😊😊😊

  • @tessaowens
    @tessaowens 4 роки тому +13

    Correlation can be used on categorical data too! Not just continuous data.

    • @jincymathewm362
      @jincymathewm362 3 роки тому +2

      Yes, spearmans rank correlation

    • @mrkhan.881
      @mrkhan.881 3 роки тому +1

      Yes. But the dependent variable should always be continuous variable. And if the independent variable is Nominal then use Phi coefficient test and if ordinal then use Spearman.

    • @danielm9463
      @danielm9463  3 роки тому +3

      @@mrkhan.881 That's true for linear regression. Other regressions (e.g., logistic regression) do permit the dependent variable to be categorical.

    • @danielm9463
      @danielm9463  2 роки тому +1

      @Tessa Owens, it's a good point. Really, a lot of the dichotomies in this intro video disappear in higher-level stats. You can reproduce the same p-values in Anova as a regression model, and you can set up dummy/indicator variables to encode categorical variables. The main audience for this video is high schoolers who haven't learned any stats but are expected to use it in their IB Biology IA.

    • @danielm9463
      @danielm9463  2 роки тому +1

      @@jincymathewm362 It's a good point. I've intentionally avoided ordinal data in this video because it really is a separate category from a stats perspective and usually requires nonparametric methods. Fortunately IB Bio high schoolers aren't expected to use it!

  • @datapeek
    @datapeek 2 роки тому +2

    I watched many videos on Hypothesis tests and always left with some sort of confusion. But you made it really simple. 🔥

  • @flyingfox707b
    @flyingfox707b 3 роки тому +19

    Very succinct and well put together. Thank you, Daniel!

  • @kaliswarijawacar1501
    @kaliswarijawacar1501 3 роки тому +1

    I am not clear about which technique is used for the which study before observing this video. Now I clear about it . Thank you sir

  • @novaxgreed2828
    @novaxgreed2828 3 роки тому +1

    This video basically summarized two semesters of my statistics subject ngl.

  • @m-tron5966
    @m-tron5966 2 роки тому

    The Simplest and Most Comprehensive Demonstration of Tests...Thank You

  • @mainecervillon5437
    @mainecervillon5437 3 роки тому +6

    Thanks for making it super simple, clear, and easy to understand!

  • @kerrielashley
    @kerrielashley 2 роки тому +2

    Thank you for this video. You did a great job of explaining the various statistical tests and their applicability.

  • @kenmakozume4781
    @kenmakozume4781 Рік тому

    I'm homeschooled and we were tasked with an experimental quantitative research. It is my first research and I'm all by myself. Thank you so much for your simple explanation! You saved me big time. My teacher dosent teach and her modules are hard to understand. Now, I have an idea what statistical analysis I should use for my experiment. Many many thanks again🙏

    • @danielm9463
      @danielm9463  Рік тому

      Feel free to post a comment if you have any specific questions about your research!

    • @kenmakozume4781
      @kenmakozume4781 Рік тому

      @@danielm9463 My research is about aquaponics, how fish waste affects the growth of a plant, specifically Gotu Kola. My experiment will use two containers with different number of fishes to manipulate the amount of fish waste and both have one sample of Gotu Kola. Should I use correlation or t test? Thank you for you Ur reply :)

    • @danielm9463
      @danielm9463  Рік тому

      ​@@kenmakozume4781 This sounds really interesting. There might be a couple ways you could take it. Do you have a way to measure the mass/volume of fish waste, or is the plan to count the number of fish? I'm also curious how you're measuring plant growth--is this done on e.g. a daily basis or at the end of a fixed study period?
      One rule of thumb is that you need ~10 points minimum for correlation or simple linear regression, although you can often get away with fewer data points. ("Simple" linear regression just means there's a single dependent variable and a single independent variable.) So t-test might tend to be more natural for that reason, unless you're planning to collect repeated measures for each tank. If you do a t-test, you're treating your independent variable as categorical, almost as though you simply labeled one "high waste tank" and the other "low waste tank."

    • @kenmakozume4781
      @kenmakozume4781 Рік тому

      @@danielm9463 thank you for the kind compliment. I measure the fish waste amount by weighing scale(grams) and by counting the fish but the scale is more specific. I will be observing the samples for four weeks. Particularly, measuring the diameter and height of the largest and smallest Gotu Kola. Is that a t test??

    • @danielm9463
      @danielm9463  Рік тому

      @@kenmakozume4781 it sounds like you might have two different independent variables, and possibly two different dependent variables. You'll either have to pick one of each, or simply have a few different research questions, each of which you answer separately through a different statistical analysis (e.g., #1 does mass of fish waste impact diameter of..., #2 does mass of fish waste impact height of..., etc.).
      Are you measuring the samples periodically (e.g., at 0 weeks, 0.5 weeks, 1.0 weeks, 1.5 weeks, ... 4 weeks), or are you measuring just at the beginning and the end (i.e., at 0 weeks and at 4 weeks)?

  • @WrenStanchen
    @WrenStanchen 2 роки тому +3

    This helped me a lot, and I'm taking a graduate level class. Thank you!

  • @scarsmakestars
    @scarsmakestars 4 роки тому +2

    I'm in IB Sports Exercise and Health Science but this video helped a lot for choosing the test for my IA. Thanks!

  • @mellizacosminatan7173
    @mellizacosminatan7173 3 роки тому +4

    Thank you so much, this is really helpful. I am even able to understand my own research thoroughly without thinking what was the reason why is it not relationship based.

  • @marshall-milkbread
    @marshall-milkbread 3 роки тому +1

    tysm!! i would've had a breakdown if not for your video 🤧 also, the dun dun dun at 0:08 made me laugh so thanks for that too

  • @razwanulislamasif1190
    @razwanulislamasif1190 2 роки тому +1

    best ever discussion in truly simple way. love from Bangladesh🇧🇩🇧🇩🇧🇩

  • @battlemud577
    @battlemud577 3 роки тому +2

    Omg... I am a year 5 dental student and this vdo finally opened my mind...

  • @marjiehoss1498
    @marjiehoss1498 4 роки тому +19

    I wish you taught me Econometrics. Thank you for this.

  • @arushikumar3044
    @arushikumar3044 2 роки тому +3

    This video is so well made! Thank you for your effort. It's really helping us out.

    • @danielm9463
      @danielm9463  2 роки тому +1

      Very glad to hear that! Check out the caveats I pinned at the top of the comments. Feel free to ask questions!

  • @dr.ujjvalrana5261
    @dr.ujjvalrana5261 4 роки тому +8

    Thorugh and simple to understand. Thanks

  • @anthonyoni4751
    @anthonyoni4751 3 роки тому +1

    I don't know how you did it but you have helped me to know this better.

  • @justiceoforiatta6826
    @justiceoforiatta6826 2 роки тому

    Thanks for enlighten me on how to choose these statistical tests. I will be grateful to learn more.

  • @sebastianheuchler436
    @sebastianheuchler436 3 роки тому +1

    Very brief and to the point. Great for a reasonable first overview of these topics. Thanks

  •  3 роки тому +4

    Haters will say this clip is way too simple, I will respond yes that’s why it’s almost perfect.

  • @BiotechArab
    @BiotechArab 2 роки тому

    Bravo! It is just the thing I needed to refresh my background after a long break

  • @ktemade
    @ktemade 7 місяців тому

    I'm slow when it comes to math (like "if johnny has 5 apples" type slow) and this breakdown finally made things clear

    • @danielm9463
      @danielm9463  7 місяців тому

      Don't hesitate to post any questions you have about your work or what you're studying!

  • @magdalenatobar1420
    @magdalenatobar1420 3 роки тому +1

    Is carbohidrate content continuous or qualitative? Im investogating the effect of carbohidrate percentage on the liberation of CO2 during yeast fermentation and dont know what test I can apply. The prodution of CO2 is presented in a time v/s producction graph. Can anyone help me ?

    • @danielm9463
      @danielm9463  3 роки тому

      This is a little trickier, because each trial produces a separate graph. In order to compare the different graphs, you'll have to figure out how you will distill each graph into a single value. For example, you could compute the average slope from each graph and compare those production *rates,* or you could set a fixed amount of time and compare the *total CO2 mass* during a constant ∆t. There are probably other ways to do it too. For example, maybe CO2 production follows a characteristic graph where it increases steadily, and then it flattens out. If this were true, then perhaps it would be meaningful to measure the amount of time ∆t before to get to the flat part of the graph. Or maybe you have experimental constraints that make certain data collection processes easier than others--for example, perhaps it's easier to measure the total time ∆t to achieve a fixed amount of CO2 production. So there are many different ways to extract a single statistic/value from each graph. The "correct" way to analyze those graphs will depend on the specific scenario you're dealing with, and I'm not as knowledgeable about the biology part of it.
      But there's something important to keep in mind: the way you process the data will slightly change your research question, and you may need to adjust your research question to reflect this. For example, if you choose to compare CO2 production rates (in grams/second) for different carbohydrate percentages, then maybe your research question should refer to the *rate* of carbon dioxide production. You want to make sure the question is phrased to match your analysis. Otherwise, your analysis is answering a different question.
      Ultimately, your dependent variable is most likely going to be numerical/continuous, because it will probably be something like CO2 production rate (e.g. in grams/second), or total mass (in grams), or total time (in seconds), etc. And your dependent variable is ultimately carbohydrate percentage, as you've said. So if you run 10 trials with 10 different carbohydrate percentages, then you're going to have 10 graphs, which you perhaps convert into 10 different rates. Then, you could produce a single final graph showing CO2 production rate against carbohydrate percentage. The point I wanted to make is that you'll probably have e.g. 10 raw data graphs, and then 1 processed data graph (or maybe more, depending on how advanced your analysis gets).

  • @ainazarmstrong5615
    @ainazarmstrong5615 2 роки тому +2

    This video perfectly cleared everything up, thank you so much.

  • @naziya9967
    @naziya9967 3 роки тому +9

    Iam from india , teaching research to post graduate students of education department.feel very helpful to understand overall category /families of statistics test.very excellent content of video 👍👍👍👍

  • @sowmyaramaswamy6686
    @sowmyaramaswamy6686 Рік тому

    So helpful. Thank you! The basics make it easy to understand everything

  • @zucledesma
    @zucledesma 3 роки тому +4

    Very clear explanation. Thank you so much.

  • @alziraangeli9707
    @alziraangeli9707 3 роки тому +3

    Awesome video! Thank you so much, Professor! Cheers from Brazil!

  • @bastawesybastawesy
    @bastawesybastawesy 3 роки тому

    JEEEZZ YOU JUST SAVED MY LIFE, best explanation video ever hands down and best 9 minutes of my life ! xx

  • @shahidbud3862
    @shahidbud3862 3 роки тому +1

    Comprehensive and over simplified.A great video.

  • @marwaa.6759
    @marwaa.6759 4 роки тому +1

    simply perfect and clear explanation . Thanks for your great effort

  • @roopalivs6782
    @roopalivs6782 4 роки тому +6

    Beautifully explained and simplified...... keep going 👍🏾

  • @albertsays2678
    @albertsays2678 3 роки тому +2

    Cant correlation analyses deal with both continuous and categorical data? Cant we say that some categorical data have relationship to some continuousa data?

    • @danielm9463
      @danielm9463  3 роки тому +2

      Absolutely. That sort of analysis doesn't happen in an IB Biology classroom, but in other contexts of applied statistics, what you describe is very common. It's possible for us to do a linear regression using categorical variables that we encode using 'dummy variables' that take on a value of 0 or 1. In multivariate contexts, we can even test for interactions between the variables by multiplying a categorical dummy variable by another independent variable that is continuous.

    • @danielm9463
      @danielm9463  3 роки тому +2

      And in many contexts where we do a linear regression, we want to look at how well the model fits the data. So we looks at R^2 values, etc. to evaluate goodness of fit.

  • @PriitPauklin
    @PriitPauklin 4 роки тому +3

    Great job! It helped a lot in explaining the basics! Nice work!

  • @NinjaCoderInTraining
    @NinjaCoderInTraining 5 років тому +5

    Thanks man. Best explanation I've seen so far 👍

  • @yumi-yw6fu
    @yumi-yw6fu Рік тому +1

    so if i have categorical data my research us qualitative?

    • @danielm9463
      @danielm9463  Рік тому +1

      Good question! The answer is "it depends." One correction: you asked about "categorical data," but instead we should say "categorical variable." The variable is categorical vs. continuous. The data are quantitative vs. qualitative. This video didn't talk about qualitative data and was only focused on quantitative data. Categorical variables can produce quantitative and qualitative data. So can continuous variables--they can generate qualitative and quantitative data. Some examples of this are below.
      EXAMPLE 1 (categorical variables):
      Research question: do children prefer pizza over salad at a higher rate than adults? I will stand on the street and random ask people who walk by what they prefer.
      Quantitative data collected:
      Pizza Salad
      Child 24 13
      Adult 6 17
      Qualitative data collected:
      Children: they asked more questions. Several children what type of pizza and what type of salad. Adults: they answered more quickly and didn't have conversations about the topic. They seemed to take the question seriously.
      EXAMPLE 2 (continuous variables):
      Research question: does height increase linearly with age during childhood? I will pick one person at each age 2, 4, 6, ... 18 and measure their height with a tape measure.
      Quantitative data collected:
      Age Height (inches)
      2 35
      4 39
      6 43
      8 52
      10 54
      12 58
      14 63
      16 62
      18 64
      Qualitative data collected:
      Some of the younger children had a difficult time standing up straight. The child aged 2 and the child aged 4 were from the same family. All participants were born in Canada.

  • @mohammedabdulrazzaq7193
    @mohammedabdulrazzaq7193 3 роки тому +1

    I am a med student at Orebro University Sweden, and yet this video was with no comparison far away better than the lecture we had at school, Gret work man, best of luck

    • @danielm9463
      @danielm9463  3 роки тому +2

      Tack för dina vänliga ord! I'm really glad the video helped. It is a bit simplistic, but if you have any questions, don't hesitate to ask them in a comment!

  • @erwincarlogonzales
    @erwincarlogonzales 3 роки тому

    This video is golden for MS and PhD students

    • @danielm9463
      @danielm9463  3 роки тому

      I'm glad it's helpful! If you're an MS or PhD student, it could be useful for big-picture frameworks. But just a word of caution--it does make some simplifications. I've gotten comments from grad school professors that the distinctions here are less valid at a more advanced level, and that this video ignores e.g. generalized linear models, which extend regression to categorical variables. Those things are all true, but they're too advanced for an introductory level. I made this video for high school students with no background in statistics who have the task of determining an appropriate statistical test to use for their research project.

  • @rediettekle7377
    @rediettekle7377 3 роки тому +1

    Wow this simplified everything so much, thank you

  • @zaina9011
    @zaina9011 3 роки тому +1

    Best ever teaching technique....

  • @sallsall3
    @sallsall3 2 роки тому +2

    I also use chi square test for my accounting research bcz it has nominal and ordinal scale
    Not just for biology subject

  • @joyofall30
    @joyofall30 5 місяців тому

    Just starting to watch the video and loving your video

  • @mylearning7504
    @mylearning7504 3 роки тому +1

    Best video on statistical tests

  • @ikmalhakim1101
    @ikmalhakim1101 3 роки тому +1

    How do you proceed from one-way ANOVA? How do you figure out how many and which one is statistically significant?

    • @danielm9463
      @danielm9463  3 роки тому

      One approach (though I'm not sure the extent to which this is done) is to proceed with individual t-tests that run through all possible pairwise comparisons. However, doing this introduces family-wise error. For example, if you do 10 follow-up pairwise t-tests using a significance level of 5%, then the cumulative chance of a false positive in any of the 10 tests is 1-(1-α)^10 = 40%. If the true Type 1 error rate is 40%, then it would be inappropriate to claim that our significance level is α = 5%. So the significance level must be adjusted to compensate for the family-wise error. One approach is the Bonferroni correction, which simply entails dividing the significance level α by the number of pairwise tests that are planned (or conducted post-hoc). If a cumulative Type 1 error rate of 5% is desired, then a significance level of 0.05/10 = 0.005 would be used. The more precise formula gives us the exact cumulative Type 1 error rate for 10 tests each using α = 0.5%. It is: 1-(1-.005)^10 = 4.9%.

  • @ma.angelicapattaguan7225
    @ma.angelicapattaguan7225 3 роки тому +1

    What if the purpose is to find the relationship of categorical data?

    • @danielm9463
      @danielm9463  3 роки тому

      Categorical IV and continuous DV: t-test
      Continuous IV and categorical DV: logistic regression (this isn't covered in the video)
      Counting frequency of 2 categorical variables in order to determine if they're independent: Chi squared test of independence

  • @DrAamir580
    @DrAamir580 Рік тому +1

    Sir you just nailed this video, you made me so damn clear about correlation. Thanks alot for your precious video😍

  • @Melissa_Official454
    @Melissa_Official454 Рік тому

    One of the best explanations. Thanks much.

  • @impossiblesoul
    @impossiblesoul 2 роки тому +1

    which family would a test which has the purpose as relationship but have both categorical and continuous data go into ?

    • @danielm9463
      @danielm9463  2 роки тому

      Can you give an example? It would depend on which of the independent and dependent variables are categorical vs. continuous. The answer also depends on the level you're working at (e.g., high school, undergraduate/ university, graduate). There's a lot of simplification in this video, and the amount of detail quickly grows for students doing higher-level statistics.
      For example, if you have a categorical dependent variable, some categorical independent variables, and some continuous independent variables, then you use logistic regression (for a binary dependent variable) or multinomial regression (for a dependent variable that can take more than 2 levels/values) to evaluate the relationships of the predictors on the outcome.
      Similarly, if you have a continuous dependent variable, some categorical independent variables, and some continuous independent variables, then you can use multiple linear regression (as well as generalized linear models, e.g., you can use a Poisson regression for count data) to evaluate the relationships of the predictors on the outcome.

    • @impossiblesoul
      @impossiblesoul 2 роки тому

      @@danielm9463 for example i have a question which asks which test to use if we are trying to understand the relationship between car speed and stopping distance using 3 makes of tyre. i thought you can use a two way anova but i keep getting confused because we have more than two variables.. the way the question is worded makes me think that the tyre make is the response variable but in that case wouldn’t we use a linear regression …
      (so the car speed increases at increments , the stopping distance is measured , and then the car tyre type is changed and the experiment repeated)

    • @danielm9463
      @danielm9463  2 роки тому

      @@impossiblesoul There are a couple ways you could go with this. Generally, though, I think the question is asking: how does [the tyre make] impact [the relationship between car speed and stopping distance]?
      Are you doing this in a physics class, and if so, have you learned what physical quantity is responsible for one tyre having longer vs. shorter stopping distance? The force that brings the car to rest is the force of kinetic friction (also called dynamic friction), but that's an extrinsic property that depends on how hard you push the breaks. There's a different quantity that is intrinsic to the tyre material (and the material you're driving on), which relates to friction. If everything else is the same (same car mass, same initial velocity, final velocity = 0, and g=9.8m/s^2), then can you derive an expression between the stopping distance s and the quantity? As a hint, you would need Newton's second law and this kinematic equation: v^2 = u^2 + 2(a)(s).
      But back to your question: there is a numeric quantity that could replace [the relationship between initial car speed and stopping distance]. Then, the question would be asking: "how does [the tyre make] impact [this numeric quantity]?" And now you have a categorical variable (tyre make) impacting a continuous variable (the numeric quantity you may have learned about which pertains to friction).
      If you're doing this in a higher-level stats course, and not a physics course, then you have two independent/predictor variables, and they interact (the effect of speed on stopping distance changes based on the tyre make). In this case, the question is asking you to quantify the interaction between tyre make and speed.

    • @impossiblesoul
      @impossiblesoul 2 роки тому +1

      @@danielm9463 Thank you for the reply. I am doing a stats course and this question is more of us trying to understand what test can be used in this situation.

    • @impossiblesoul
      @impossiblesoul 2 роки тому

      Using what you said with there being two independent variables, I am going to with a two way ANOVA test

  • @psychologykaTopper
    @psychologykaTopper 7 місяців тому

    In my research, there are two groups: control and experimental. Both of these groups gave pre- and post-tests. Which test should I use? In the experimental group, there are 100 people, and in the control group, there are 100 people.

    • @danielm9463
      @danielm9463  7 місяців тому

      If each person took a pre-test and a post-test, and assuming your outcome metric is numeric, then you could use a paired t-test where you compare the difference (post minus pre) among the two groups (control vs. treatment).

    • @psychologykaTopper
      @psychologykaTopper 7 місяців тому

      @@danielm9463 thanku so much, If my population is larger, do I still need to use a paired t-test over a z-test?

  • @dirkaloha
    @dirkaloha Рік тому

    Reaffirmed what I am beginning to grasp. Thank you!

  • @dharris6357
    @dharris6357 2 роки тому +1

    Best summary video ever!!

    • @danielm9463
      @danielm9463  2 роки тому

      Thank you!!

    • @dharris6357
      @dharris6357 2 роки тому +1

      @@danielm9463 The world needs an @danielm9463 full statistics course series. I’m taking a statistics for the social studies. I’ve watched tons of videos on UA-cam. I’ve not seen a tutorial done this well. So, we need more! You’ve got a gift ❤️

    • @danielm9463
      @danielm9463  2 роки тому

      ​@@dharris6357 ❤❤❤ thank you!! I would love to make a full stats video series and hope to have time for that some day soon. What types of topics do you think would be most useful? Given that you study stats in your class already, is it the big-picture overview that is most useful, or would it be helpful to have videos on the mechanics of e.g. how to do a t-test and why/how it gets modified when there are unequal sample sizes, unequal variances, etc.? Or is it more helpful have videos with big-picture overview / how to look at different scenarios and see which test would apply?

  • @rekharana6562
    @rekharana6562 10 місяців тому

    Although the video is in English and my native language is Hindi, but I found it very much clear and informative.... thank you so much

  • @shadowmonarch809
    @shadowmonarch809 3 роки тому +2

    How about if our study is fnding the relationship of modular learning but it is categorical and 1 continuous which is the academic performance? Will we use correlation or T-test?

    • @danielm9463
      @danielm9463  2 роки тому

      Check out the summary at 4:08 of the video. You have a categorical independent variable and a continuous dependent variable. You're comparing the categorical groups (the modular learning) on the basis of the continuous outcome (academic performance). Which do you think is more appropriate--t-test or correlation?
      When you get into higher-level stats, you'll learn that actually a t-test can be replicated in a regression model. But for now it's helpful to think about whether you're (a) testing if two groups are different on the basis of their continuous outcome variable (that's a t-test, or comparing groups) or (b) evaluating the strength of correlation / the quantitative effect of your predictor on the outcome (that's correlation/regression, or looking for a relationship).

  • @fazeelzubair2787
    @fazeelzubair2787 3 роки тому

    Hi. How to test association in a 2x2 table where 2 adj cells in a row have values 0.
    Data is non parametric

    • @danielm9463
      @danielm9463  2 роки тому +1

      If it's a 2x2 table, and 2 adjacent cells have counts of 0 in a single row, then doesn't that row simply have no observations at all? It sounds like you only have data for one of the groups. In general, the non-parametric approach for a 2x2 contingency table is Fisher's exact test, but each row needs to have some data.

  • @anettedab6036
    @anettedab6036 2 роки тому

    What type of question would be: How does UV light affect vegetable decay? and what type of test would you recommend.

    • @danielm9463
      @danielm9463  2 роки тому

      How are you measuring UV light? You could potentially measure it as a continuous variable (intensity of the light), or as a categorical variable (presence of UV light vs. absence of UV light). That will change which test is appropriate. Similarly, how are you measuring vegetable decay--is it continuous? (e.g., you're measure the remaining mass after some controlled amount of time, or you're measuring the time until a fixed quantity of decay has occurred.)

  • @christophers.villaraza2841
    @christophers.villaraza2841 3 роки тому +1

    Crystal clear explanation!

  • @victorsolomon977
    @victorsolomon977 Рік тому

    Thanks a million, you've done excellent. How do I make use of regression analysis for this topic " The impact of smartphone overuse on student academic performance "

    • @danielm9463
      @danielm9463  Рік тому

      I would be glad to help! First, can you state what your independent variable (IV) and dependent variable (DV) are? When you respond, please make sure your IV and DV are numeric. For example, "smartphone overuse" isn't numeric because you can't say "smartphone overuse=3.5." However, instead of "smartphone overuse," there's something very similar that you could use instead, which *is* measurable / quantifiable with a number value. Another important question--how will you measure your variables and collect data?

    • @victorsolomon977
      @victorsolomon977 Рік тому

      I AM GLAD YOU RESPOND! AS FOR THE DV, I WILL GET THE RESPONDENTS CGPA FROM THE SCHOOL RECORDS, WHILE IV, I'LL USE LIKERT SCALE (SA, A, N,D,SD) AND THEN QUANTIFY EACH WITH NUMERIC CODE, WITH EACH CODE REPRESENTING (FOR INSTANCE;SA-1,A-2 etc). THEN DURING THE ANALYSIS, I'LL ASSIGN A QUALITATIVE MEANING TO EACH CODE eg '1- VERY EFFECTIVE', '2- EFFECTIVE' etc, WHICH WILL AFTER ANALYSIS RESULT IN IV REQUIRED FOR REGRESION. THIS IS ALL WHAT I'M THINKING. I AM NOT SURE IF IT WILL BE POSSIBLE AT ALL
      @@danielm9463

  • @muzammilliaket1223
    @muzammilliaket1223 Місяць тому

    Thank you ain't enough for this. Rather, I have to say, love you.

  • @eslavathpuramdas8107
    @eslavathpuramdas8107 3 роки тому

    Which test will be employed for comparison of dissolution profile between two products having continuous variable. Thanks for your time.

    • @danielm9463
      @danielm9463  2 роки тому

      Do you want to compare two curves/functions? This would be a more complex procedure, and the appropriate technique will be guided by conventions specific to industry/domain. For example, you might read through this:
      blog.eglifesciences.com/dissolution-analyses-comparison-of-profiles-using-f2-analysis-calculation
      Techniques like the Kolmogorov-Smirnov test might be available to you, or if a single defining quantity can be extracted from each dissolution curve (with standard error), you could potentially do a comparison that way.

  • @arpitachopra27
    @arpitachopra27 Рік тому

    I am.a little confused. I am looking at "To what extent does Altering Cooking Methods (Boiling, Heating, Microwave) change the iron concentration (µg/dL) of Spinacia oleracea (50 grams)?". so how does different cooking methods impact the concerntration of iron concentration. I will do a spectrometer test, and find the iron concerntration of spinach using known concerntration of iron and measuring their absodance. i will createa a culliburation curve, and then use that to find relationship between the concentration of iron and the type of cooking method. what kind of test would i use to find the relationship?

    • @danielm9463
      @danielm9463  Рік тому

      This is a great question. Reply back to this comment, and identify (a) what your independent variable is, (b) whether your independent variable is categorical or continuous, (c) what your dependent variable is, and (d) whether your dependent variable is categorical or continuous. Once you've done that, we'll take the next question together which will help you pick a statistical test to evaluate whether different cooking methods lead to different mean concentrations.

  • @w.k7866
    @w.k7866 3 роки тому +1

    Thank you. Best explanation on youtube!

  • @rupalidesai6492
    @rupalidesai6492 3 роки тому

    the video cant be seen..is there a problem?

  • @melekayran
    @melekayran Рік тому

    Hello! Can I do a d
    statistical search as CHI SQUARE TEST for my IA to find out the relationship between two plants. Thank you.

    • @danielm9463
      @danielm9463  Рік тому

      Before we can answer that question, you'll need to define your independent variable and your dependent variable.
      - What are you changing intentionally and in a predetermined fashion from one trial to the next? (This is your independent variable.)
      - What are you measuring as the response variable, which changes in response to the independent variable? (This response variable is your dependent variable.)

  • @smitatharkar6798
    @smitatharkar6798 4 роки тому +2

    Great video, I understood the concept clearly.. thanks alot!

  • @bibekmalla8027
    @bibekmalla8027 3 роки тому +1

    Best teaching method

  • @preetamgurav4139
    @preetamgurav4139 3 роки тому +1

    Sooo good I literally cried