Correlation Doesn't Equal Causation: Crash Course Statistics #8

Поділитися
Вставка
  • Опубліковано 18 січ 2025

КОМЕНТАРІ • 234

  • @snowballeffect7812
    @snowballeffect7812 6 років тому +375

    This needs to be mandatory viewing for EVERYONE.

    • @cross4326
      @cross4326 6 років тому +7

      I second that!

  • @sugami82
    @sugami82 6 років тому +268

    "Correlation does not equal causation" was my old stats teacher's favourite phrase along with "always interpolate, never extrapolate." :)

    • @xsaberfaye
      @xsaberfaye 6 років тому +15

      Extrapolation is actually necessary in certain circumstances though - for example predicting growth of global human population, economic forecasts, environmental forecasts regarding climate change.... anything that has to do with the future.

    • @Unordung
      @Unordung 6 років тому +6

      Post hoc ergo propter hoc!

  • @SilortheBlade
    @SilortheBlade 6 років тому +225

    Bah. I know my rock keeps away tigers because I have never seen a tiger for as long as I have had it.

    • @Tuckems
      @Tuckems 6 років тому +2

      SilortheBlade Makes sense to me

  • @Pfhorrest
    @Pfhorrest 6 років тому +232

    Nicholas Cage movies are correlated by yet another unmentioned variable: summer. Nicholas Cage is an action movie star. Action movies are generally targeted for summer releases. Summer is also hot, which is the cause behind air conditioner sales and swimming, the latter of which is of course the cause of drowning.

    • @aido92
      @aido92 6 років тому +59

      Pfhorrest Or it could be that people who have endured a Nicholas Cage movie are more likely to drown themselves ...

    • @polyjohn3425
      @polyjohn3425 6 років тому +12

      That's true, but the data shows a close correlation over multiple years, not just over the seasons of a given year. It just so happens that the summers of years with more Nicholas Cage movies also happen to have more drownings.

  • @murphygreen8484
    @murphygreen8484 6 років тому +26

    This has been my favorite CrashCourse season by far. Really enjoying the material and the host!

  • @qilinxue989
    @qilinxue989 6 років тому +650

    *Me:* I used to think correlation implied causation.
    *Me:* Then I watched this video. Now I don't.
    *Friend:* Sounds like the video helped.
    *Me:* Well, Maybe.

    • @polyjohn3425
      @polyjohn3425 6 років тому +26

      lol. Well, probably.

    • @jedisentinel4879
      @jedisentinel4879 6 років тому +21

      The video explains that it's not because two elements are correlated that one is the cause of the other. One '''can''' be the cause, but it's not logical to imply it just from their correlation. It was not the floor itself that broke the glass even though it is related to the breaking, it was it's impact with the glass, '''caused''' by gravity.

    • @verdatum
      @verdatum 6 років тому +25

      XKCD is a pretty good comic :)

    • @HerodotusVon
      @HerodotusVon 6 років тому +8

      Kachimbo somebody missed the joke

    • @noobnoobyify
      @noobnoobyify 6 років тому +10

      Herodotus Von 8428 no, someone got the joke, but felt the need to expand our knowledge.

  • @aude1979
    @aude1979 6 років тому +14

    A class on non linear relationships would be FANTASTIC :) And more classes in general (e.g., on general versus mixed effects models; GAMs etc...) Thank you for your dynamism!

  • @laterkater4213
    @laterkater4213 6 років тому +34

    Better explanation then my university level stats class. 👍

  • @Deedj1
    @Deedj1 6 років тому +24

    Everyone needs to see this! Just because things seem connected on the surface doesn’t mean they’re related and Visa Versa!

    • @josephyml
      @josephyml 6 років тому

      psst. its vice versa, not visa versa

    • @legalfictionnaturalfact3969
      @legalfictionnaturalfact3969 6 років тому +1

      and if they're not connected then they DON'T CORRELATE. this shit's a red herring.

    • @legalfictionnaturalfact3969
      @legalfictionnaturalfact3969 5 років тому +1

      @@improover113 talking specifically about causal relationships, as the phrase states explicitly

  • @BlackCatGodess
    @BlackCatGodess 6 років тому +34

    Puppy cat! I didn't know that they'd made a stuffed animal of him. This has greatly improved my day.

  • @jesusosegueda422
    @jesusosegueda422 6 років тому +7

    Crash Course, thank you so much. This awesome course is definitively above the curve!

  • @akankshaandadityasingh9888
    @akankshaandadityasingh9888 6 років тому +74

    When she apologises for using imperial units......

  • @greyareaRK1
    @greyareaRK1 6 років тому +45

    I haven't watched Nicholas Cage movies, AND I haven't drowned. Aha!

  • @Runsheeg53633
    @Runsheeg53633 Місяць тому

    This is the best tutorial I have watched on this topic.

  • @ramseszeeman4076
    @ramseszeeman4076 Рік тому +2

    without you guys i would not pass my exams thank you so much

  • @aaronmarks9366
    @aaronmarks9366 5 років тому +6

    "Air Cons, and Con Airs"
    Amazing

  • @JackieChenpi
    @JackieChenpi 6 років тому +20

    Watching Stat for fun again.

  • @easysnake205
    @easysnake205 6 років тому +31

    I feel some people go so far in this argument that they seem to argue the correlation disproves causation.
    Eg. "thats only correlation it doesnt prove causation, obviously you are wrong"
    Yes correlation doesnt prove causation, but it most definitely does not disprove causation. Further it might suggest causation, or that a 3rd factor is causing both phenomena to occur. Its frustrating to give data in an argument, to have the other side counter with, "thats only correlation, it doesn't prove causation, you are wrong."

    • @patrickanderson1993
      @patrickanderson1993 6 років тому +1

      EasySnake 100% agree

    • @ShaudaySmith
      @ShaudaySmith 6 років тому +1

      i've seen this too! It irks me to no end.

    • @rosettahyatte8010
      @rosettahyatte8010 6 років тому

      This is crash course statistics and statistics is all about probability?

  • @KASSISHOT
    @KASSISHOT 6 років тому +11

    Every time I see one of these videos I look at the view count and know that there's that many more people out there that are better educated about this topic and that makes me very optimistic for the future keep up the great work guys

  • @earth2ellie
    @earth2ellie 6 років тому +4

    “Mr. Fluffy misses you.”
    *pouts thinking of the cat I don’t have missing me*

  • @mielthebee
    @mielthebee 2 роки тому +5

    "..if people blink more when they're lying!"
    Our Professor: 😳

  • @davidsweeney111
    @davidsweeney111 6 років тому +18

    This needs to be essential viewing for EVERYONE.

  • @txt3567
    @txt3567 5 років тому +2

    Thank you so much for sharing. You're so much better at explaining than my professor.

  • @ElforTheLandstander
    @ElforTheLandstander 6 років тому +1

    This was the funniest Crash Course video I've ever seen. Her comedic timing is excellent. Though I still don't know if that clever mayor was a man or a woman.

  • @thegodofalldragons
    @thegodofalldragons 6 років тому +3

    I've seen people both conflate correlation with causation in situations that are clearly coincidence and insist that correlation does not equal causation when the pattern of cause and effect are obvious.

  • @aaronmarks9366
    @aaronmarks9366 5 років тому +10

    I wish all my scatterplots ended up making pictures of dinosaurs.

  • @flippersnyder
    @flippersnyder 6 років тому +4

    So this was great. You are definetly one of my favorite crash course hosts. And I took statistics back in 1994. I have one question that boggles me. When and who is right, who determines the reality or that there is causation?
    Example .... cigarette smoking and lung health. The negative effects are clearly visible, the correlation is there ... but is it really the cause? When and how do we get to a positive causality?
    Or is it left to the interpreter? Or is it just all relative? Or by the end of the day it's meaningless and everyone can make the statement "correlation doesn't equal causality" and your data and beautiful charts and correlations just fizzle out?

    • @24680kong
      @24680kong 6 років тому +2

      That's the tricky part! Ultimately they all need to be interpreted. Overall, there is no true "proof", just higher levels of confidence. I am confident that the city of Paris exists, even though I've never been there. The process generally starts by asking "is this even possible?" and "Does this make some sense?" Then you can go back and try to find some other cause of the data you got. Eventually, you have to do experiments carefully. But even well-planned experiments can have hickups and biases (there have been many cases of seemingly high-confidence experiments not being repeatable by other professionals). Often, multiple experimenters need to come up with the same results on their own (and usually with their own equipment) before the scientific community is convinced. Overall, it's a difficult and time consuming process.

    • @sammyinengland
      @sammyinengland 6 років тому +3

      In health data like the lung example, there is a set of criteria called the Bradford-Hill criteria. Google it. This is criteria for determining if something can be considered causation. It is not a checklist: you still need to do your own scientific interpretation. But it’s a good way to get an idea of whether the data your looking at implies causation or not. The criteria are: effect size, consistency, specificity, temporality, biological plausibility, dose-response relationship, coherence, analogous results. Interestingly, Bradford Hill who came up with this list, is the same Hill who co-authored the original Doll and Hill paper that established the linked between smoking and lung cancer!

  • @PatrickMichaelOLeary
    @PatrickMichaelOLeary 6 років тому +3

    At 0:27, it must have taken everything you had to not blink.

  • @lovepeople951
    @lovepeople951 6 років тому +11

    Thank u Crash Course

  • @НикитаКорниенко-й6р

    Как же замечательно вы рассказываете! Даже переводить ничего не надо! (Russian is deliberate here)

  • @xmems
    @xmems 6 років тому +12

    Love this upload 😍

  • @rkpetry
    @rkpetry 6 років тому +2

    ...how do you fit a regression line through a circle (or fat ellipse) on a 2D-scattered, plot...
    ...how do you define accuracy where there are fewer data points, even though the fitted-curve looks similar, (do you overlay random information certitude measure sigma bars)...
    *_...(in case you missed the first question: flip the plot axes for a different regression line...)_*

  • @GameOver321
    @GameOver321 2 роки тому +1

    wow! Thank you

  • @gymotc
    @gymotc 4 роки тому +1

    Excellent video! Thank you!!!

  • @UnknownRefrigerator
    @UnknownRefrigerator 6 років тому +1

    I love this series! However, you made one, small lie: R^2 does not have to be between zero and one, but can in fact be negative.
    You spoke of the mx + b, but failed to mention what value it has to determine b (and if chose horribly wrong, it can give you negative R-values, due to estimate a model that is worse than random).
    Keep up the series! :)

    • @mishadonchenko4362
      @mishadonchenko4362 6 років тому +3

      Squares of real numbers are always nonnegative, by definition. They can never be less than zero -- the square of -5 is 25, for example.

  • @MakeMeThinkAgain
    @MakeMeThinkAgain 6 років тому +4

    I was TRICKED into watching this by the title. How hard would it be to add, "WARNING! THIS IS STATISTICS, DWEEB" to what appears on my temptation screen?
    It was really good.

  • @verdatum
    @verdatum 6 років тому +1

    Anecdotally, after playing Simpsons: Hit & Run (a GTA clone), I genuinely drove more recklessly for a little while. Not like I got into an accident, but like I was cutting corners tighter, and being a little heavier on the pedal. I had to work at it to knock it off. Really really good game though.

  • @maruisaiahnapa7381
    @maruisaiahnapa7381 6 років тому +1

    I was JUST reading up on this in class! 😂

  • @ginohobayan001
    @ginohobayan001 Рік тому

    Thank you!!!
    Learned so much from this video.

  • @youknuckle
    @youknuckle 6 років тому

    Love this video and the channel, also - @1:43 You've spelled eruptions wrong...

  • @ternvall
    @ternvall 6 років тому +4

    y = mx + b , is this some American standard? In Sweden it's y=kx+m

    • @HeinerS
      @HeinerS 6 років тому +1

      It doesn't really matter either way. The general consensus is that the last letters from the latin alphabet, i.e. x, y and z are being used as placeholderds for unknown quantities, whereas letters from the beginning (e.g. a, b and c) or middle (e.g. k, l, m and n) are being used as placeholders for known quantities (to be supplied or deduced when doing a specific example). The placeholders for know quantities may be different in different countries for many reasons (ease of pronounciation, legibility, tradition, etc.). Tradition usually also means that often the same equation uses different placeholders in math and physics. Example: in Math class the may use y = ax + b, in Physics class they may use y = mx + c, just because ... (and then of course in the kinetic equations this becomes e.g. v = at + v0 representing physical quantities).

  • @SangoProductions213
    @SangoProductions213 6 років тому +5

    Correlation does not neccesarily state that causation is found between two variable.
    However. don't walk away thinking correlation disproves causation. This isn't politics. There are more than two possibilities. (There are in politics too, but ignore that.) Thanks, and have a good day.
    As a final note: Time taken to get from point a to point b is negatively correlated with speed. There is (by definition no less) causation there.

    • @verdatum
      @verdatum 6 років тому +1

      Sango, that's a good tip. But I fear that addressing people as the "scientifically illiterate" might not be the best way to get your message across. (What I would give for Crash Course: Rhetoric).

    • @SangoProductions213
      @SangoProductions213 6 років тому

      Everyone was illiterate (scientific and otherwise) at one point. It is one's duty to make sure they do not continue to be.

    • @xsaberfaye
      @xsaberfaye 6 років тому

      There is no causation only chaos.

    • @verdatum
      @verdatum 6 років тому +1

      It is absolutely true that everyone begins illiterate, and there should be no shame in that. However, referring to people as such can cause them to misinterpret your message as being condescending, even though you had no intention to be that way. Regardless, they are now slighted, and in retaliation, they ignore your advice, no matter how reasonable it was.

    • @verdatum
      @verdatum 6 років тому

      kaizersabre, there is no Dana, only ZOOL.

  • @yetigriff
    @yetigriff 6 років тому +13

    That's not the graph Jim Carrey and Jenny McCarthy showed me.

  • @childfs6865
    @childfs6865 6 років тому +18

    Comment containing the word EVERYONE in caps lock.

  • @mariafranciscalopez3694
    @mariafranciscalopez3694 6 років тому +1

    Me: focus, you have a test this week
    Also me: OMG PUPPYCAT!!

  • @hem89180
    @hem89180 6 років тому

    Love the series!!!

  • @h0rban
    @h0rban 6 років тому

    You have mentioned that the steep line can have a strong correlation but there was no support of the graphics. Emphasis for users: the slope and correlation are different

  • @teen-at-heart
    @teen-at-heart 6 років тому

    Good episode, but some things would need exercise and ‘usage’ in order to be memorized well and longer-term, like r and r squared.

  • @kevinye1041
    @kevinye1041 4 роки тому +2

    Squared correlation r^2
    Line of regression
    Can anyone explain a little more in depth standard deviation? Im still not sure what information it tells us in a scatter plot

  • @darrenreuben4222
    @darrenreuben4222 6 років тому

    this was an awesome video

  • @MrGustaphe
    @MrGustaphe 6 років тому

    The example of changing the units on the y-axis is only relevant if you're not doing your dimensional analysis properly. If the slope of the feet-feet plot is 0.5, then the slope of the meter-feet plot is 0.15m/foot=0.5

  • @tvvt005
    @tvvt005 7 місяців тому

    Just noticed puppycat on her table! 💗

  • @MaureenMurphy_
    @MaureenMurphy_ 6 років тому

    Thank you for thissss!!

  • @luminias.upscmentor
    @luminias.upscmentor 6 років тому

    Gain in my knowledge is perfectly correlated with the number of crash course videos I watch and shows the value of absolute +1 as the correlation coefficient #CrashCourse ..... 😁😁😁

  • @xxtheswagger8xx263
    @xxtheswagger8xx263 4 роки тому +12

    who is here for school

  • @xionpentagast
    @xionpentagast 6 років тому

    Loved it!

  • @maddijackson_design
    @maddijackson_design 6 років тому +2

    Please do more literature!!

  • @gamereditor59ner22
    @gamereditor59ner22 6 років тому

    Great video!!!😊

  • @amohamoud3992
    @amohamoud3992 6 років тому +1

    While taking my stats course I started sleep talking and explained empirical rule to my mon

  • @怪兽宇
    @怪兽宇 6 років тому

    很棒的视频, 对学习统计学非常有帮助

  • @daniels4209
    @daniels4209 6 років тому

    Thank You.

  • @StKozlovsky
    @StKozlovsky 4 роки тому

    3:12
    > Hummer, the epitome of in-your-face Americanness
    > Russian license plate

  • @malteeaser101
    @malteeaser101 4 роки тому

    If A caused B then there is a correlation between A and B.
    The rising of the Sun caused the eating of an ice cream by John.
    Therefore, there is a correlation between the rising of the Sun and the eating of an ice cream by John.
    My question is, how would you quantify those events and plot the correlation between them on a graph? Would I count the number of times these events occurred? What if an event only causes another once? What if John died after the first ice cream? Can we still say that there was a correlation?

  • @elijahsassercollins3685
    @elijahsassercollins3685 5 років тому +4

    now go teach the media this so they can stop blaming video games for all the worlds problems

  • @ibnufajar8733
    @ibnufajar8733 6 років тому +2

    does the "r²=0.7" mean that we could predict accurately by 70% ?

  • @omarkhalaf7014
    @omarkhalaf7014 4 роки тому +3

    Wait... Technically everything is connected. Maybe the relationship between 2 variables are correlated even tho it doesn't make sense that they cause each other, but that happens because these 2 variables are connected to other variables that we didn't observe yet these variables can indirectly influence the relationship between the main 2 variables we are comparing. So I guess that means, one way or another, correlation DOES imply causation. Error 404

  • @ComedyCorner619
    @ComedyCorner619 6 років тому +1

    Hello great video

  • @AnanthaSKrishnan
    @AnanthaSKrishnan 6 років тому

    @crash course team, not all the graphs in the datasaurus dozen shown in the end doesn't seems like having same correlation coefficient. Few look like having r=1, few r=0. Please correct me if I'm wrong

  • @HrishikeshPalande
    @HrishikeshPalande 5 років тому

    We don't predict the temperature in Fahrenheit we calculate it using the formula (c*9/5)+32

  • @Angelusloco15
    @Angelusloco15 5 років тому

    EXCELLENT!

  • @ZoggFromBetelgeuse
    @ZoggFromBetelgeuse 5 років тому

    I watched this video without having seen the previous ones, and spent a considerable amount of time wondering "what the heck is an 'old faithful eruption' ?"

    • @ZoggFromBetelgeuse
      @ZoggFromBetelgeuse 5 років тому

      (For those who have the same problem: "Old Faithful" seems to be the name of a geyser. (I don't know where it is, but when an English UA-cam show refers to a location, person, event or sports ritual you have never heared of, you can be pretty sure it's in North America.)

  • @unacomn
    @unacomn 6 років тому +5

    I don't know, Nic Cage may be dragging people to the deep after they see his movies. The evidence is there.

  • @tvit
    @tvit 6 років тому

    Those movie computer tick noises (when charts are presented) drive me mad, and I don't even have EQ in my setup to damp them down. Good vid though!

  • @archiegent1357
    @archiegent1357 6 років тому +5

    Y = mx + b?i thought it was c

    • @ultimaterex
      @ultimaterex 6 років тому +1

      Phony Aardvark i learned it as y= ax+b

    • @verdatum
      @verdatum 6 років тому +1

      Bryan, what does m stand for? The mmmslope? (I actually don't know the answer, now that I think about it)

    • @verdatum
      @verdatum 6 років тому +1

      O_O *head-explosion*

    • @Samantha2209
      @Samantha2209 6 років тому

      I know it was c (at least in my part of the world)

    • @verdatum
      @verdatum 6 років тому

      "mx + c" is also reasonable in the sense that "c" is often used to refer to some "constant". This is also the explanation for e=mc^2. Because the speed of light in a vacuum is a constant.

  • @rkpetry
    @rkpetry 6 років тому +1

    *_...there'd be a negative-correlation where reducing air conditioning increases swimming..._*
    *_...or, an overriding 'cause' leading to watching-speeding or doing-it, another, negrelation..._*
    *_...so...what's the mathematically-concisely-stated-statistical-rule for causality-guessing..._*
    *_...(making statistics, like modulo arithmetic: where compounded moduli may get better)..._*

  • @twiggyvlogs6441
    @twiggyvlogs6441 6 років тому

    Any chance of crash course architecture (history of?)

  • @polarablues64
    @polarablues64 6 років тому

    When it's hot, people with no A.C. tend to go to the movies. Movie theaters are usually quite air conditioned and you get to enjoy it for a couple of hours.

  • @mielthebee
    @mielthebee 2 роки тому +1

    2:56 to the height of the Holy Spirit- 😮‍💨

  • @nickwilsonxc
    @nickwilsonxc 5 років тому +1

    I’ll have you know that my cat, Mr. Whiskers, loves me.

  • @nathanm4539
    @nathanm4539 6 років тому

    Yes. I am so sick of hearing people not know that correlation does not equal causation

  • @renovationgaming5438
    @renovationgaming5438 6 років тому +1

    The first eruption scatter plot has a typo

  • @NataPal
    @NataPal 6 років тому

    i love this

  • @DudeWhoSaysDeez
    @DudeWhoSaysDeez 6 років тому

    Are regression lines ever parabolic?
    What would be some examples if so?

    • @dabomba1951
      @dabomba1951 6 років тому

      optimum angle for maximum range. Range in terms of angle would have a turning point around 45 degrees where it reaches its max range then goes back down. one example

    • @chelseaparlett8069
      @chelseaparlett8069 6 років тому

      you can use a parabola to fit data. It would be polynomial regression where your x's are taken to various powers. Sometimes it's really useful to do so, since often data isn't perfectly linear.

  • @AdamShaiken
    @AdamShaiken 6 років тому +9

    This was very interesting...though, I wonder, just how significant it is ? Can you give me a chi squared on that ?

  • @NaihanchinKempo
    @NaihanchinKempo 6 років тому

    wish you'd touch on poker. Math and Data is very important in poker

  • @comareja4
    @comareja4 6 років тому

    Its was hillarious ,the data present by the reporter.

  • @BCsenge97
    @BCsenge97 5 років тому

    I love this chanel

  • @redstone8513
    @redstone8513 6 років тому

    1:20 They spelt eruptions wrong on the y-axis...

  • @PatrickMichaelOLeary
    @PatrickMichaelOLeary 6 років тому

    In Pearsons study, did he take into account that people often shrink as they get older?

  • @diamoniqueallen2231
    @diamoniqueallen2231 6 років тому +2

    The Bee and Puppy-cat doll in the back is sooo cute (๑>◡

  • @bigpapi3636
    @bigpapi3636 6 років тому

    The narrator is very easy to listen too. Even I understood the content

  • @Canada4evr
    @Canada4evr 6 років тому +1

    Cool-Cage Act; hilarious.

  • @gregoryfenn1462
    @gregoryfenn1462 6 років тому

    Can we do a talk on how you DO identify causation, not just rule out plausible causal relations? Or are we taking a Humean view of causation and saying there is no real force of causation at all, just a fixed regularity that humans imagine happens?

    • @CarlyDayDay
      @CarlyDayDay 6 років тому

      I think it requires an experimental study

  • @muhammadabdulhakeem7152
    @muhammadabdulhakeem7152 2 роки тому

    i will like to confirm that is the equation of a line equals y=mx +b or y=mx+c

  • @fame2011xoxo
    @fame2011xoxo 6 років тому

    Does anyone know how to interpret a Bland-Altman analysis?

  • @micanikko
    @micanikko 6 років тому +1

    So no one's commenting how she's got a *puppycat plush toy* behind her?

  • @dbuyandelger
    @dbuyandelger 6 років тому

    Hmm. I may have needed this video 2 years ago when I was toiling in the halls of grad school

  • @brittbrat756
    @brittbrat756 6 років тому

    omg! PUPPYCAT 😭💗

  • @danielmclaughlin5573
    @danielmclaughlin5573 6 років тому

    Mr. Fluffy does not miss me.
    Mr. Fluffy ran away...

  • @robertpalumbo9089
    @robertpalumbo9089 6 років тому

    Wow this is my doctor and his funny science