How statistics can be misleading - Mark Liddell
Вставка
- Опубліковано 2 жов 2024
- Explore the statistical phenomenon known as Simpson’s paradox, and how it can lead to incorrect conclusions about data.
--
Statistics are persuasive. So much so that people, organizations, and whole countries base some of their most important decisions on organized data. But any set of statistics might have something lurking inside it that can turn the results completely upside down. Mark Liddell investigates Simpson’s paradox.
Lesson by Mark Liddell, animation by Tinmouse Animation Studio.
Support Our Non-Profit Mission
----------------------------------------------
Support us on Patreon: bit.ly/TEDEdPat...
Check out our merch: bit.ly/TEDEDShop
----------------------------------------------
Connect With Us
----------------------------------------------
Sign up for our newsletter: bit.ly/TEDEdNew...
Follow us on Facebook: bit.ly/TEDEdFac...
Find us on Twitter: bit.ly/TEDEdTwi...
Peep us on Instagram: bit.ly/TEDEdIns...
----------------------------------------------
Keep Learning
----------------------------------------------
View full lesson: ed.ted.com/less...
Dig deeper with additional resources: ed.ted.com/les...
----------------------------------------------
The numbers will confess to anything if you torture them long enough.
That applies to anything mate
Lol...Interesting....and a very good analyst with integrity can find where the numbers were tortured and why. =))
Just like people, and bunnies.
Well, the bunnies died but it should’ve worked.
The NUMBERS, MASON!!!
THE NUMBERS!!!
A perfect way of looking at it.
That's what makes statistics both beautiful AND scary. It doesn't have to lie to trick you.
Which is why I always look into the methodology and prefer to look at data directly and never trust a conclusion someone else says the statistic would provide.
The problem is that you can NOT trust the data itself!!! You are just re-interpreting biased and faulty data!!
I am fast learning this necessary to do if you want to form a truly educated opinion.
Hiding a fact is a lie
@@rahulpandey2022 thus complicates the problem further. Relevancy is in the eye of the beholder.
Accordion to a recent study, 90% of people fail to realize that the beginning of this sentence is an instrument
.
Ns ouch that hurts
nice sentence
Papa Shpee got me
I was gonna say, that isn’t even a sentence because there’s no period.
I remember that in middle school, our teacher would say that people in the past didn't live longer than 40.
The statistics showed that the average lifespan was not more than 40, but that was because the child mortality rate was so high. People could still get 70 - 80 :P My teacher didn't know that!
It was average but compared to the past most people die at early age because of the diseases that were not curable at that time
@@user-ss1ky4tu8y the thing is that the distribution for life expectancy is U-shaped.
Then and now. Ppl mostly either die when young or when old.
Most other distributions we encounter are bell shaped (Gaussian, also called Normal). In a bell shaped distribution, the mean is very meaningful - most individuals will be around the mean. Eg height - most ppl will be of approximately average height, only few will be either very short or very tall.
With U-shaped distributions, the mean isn't intuitively useful. There are very few ppl who die at the average life expectancy age, most ppl die when older or when very young.
In former times, child mortality was a lot higher, so the average life expectancy was farther from the old age at which ppl would die who had managed to survive childhood. Nowadays, child mortality is quite low in developed nations and therefore, the average age is now closer to the end of life dying age of ppl. It's still U-shaped, though.
Thank you for explaining this bc this never made sense to me
@@TheFeldhamster That's why persons who do statistics corectly first do test if distribution of data is normal/gaussian. If not then they don't count mean. Since some so called mathematician did it, it actually proves that he was rather uneducated botcher ( like me when I use mean... But I'm not mathematician :) )
@@greyrock420 elaborate pls.🌝
"A good liar tells mostly the truth. A masterful liar tells only the truth." I don't remember where I heard this but..
ah, lying by omission
another great statistic: In avarage, every human has one boob and one ball, ever thought about that?
And 1.99 legs
+Stingy Greindger Vsauce?
+BuFufilms lol
exactly what I remembered it from
+BuFufilms
could you please link me the vsauce video?
thanks in advance :)
+Stingy Greindger And the average penis length is 7cm:D
Man, I thought the "Simpson's Paradox" came from Homer saying "people can come up with statistics to prove anything. 14% of people know that." XD
+Richard Hannay What a load of rich, creamery butter you just spouted.
@@empty_cognizance2234 is that a complement or an insult
Nova Star yes
@@eshelsh1905 thats the joke r/whoooosh
wasn't homer gay>+?
This guy's voice is so calming
true
+That Guy u w0t m8?
+That Guy LOOOOOL
+Cassandra D I know right, it gets me wet as fuck
+Cassandra D I think you find it trustworthy. That is calming. If he seems to have knowledge that's also calming. I find it more meant to reassure that this is how it is.
To many people in the comments are dismissing statistical analysis completely. That is a mistake. The moral here is that you need to be careful and properly recognize and incorporate relevant variables. Statistics when properly used can show trends and help policy makers implement solutions to negative trends.
For example in the hospital example once you incorporated the information about the health of the patients before they were admitted to the hospital you could see,that perhaps hospital A has some, serious deficiencies.
+Tony Zan Go away. Statistics show that more than 99% of the statistics have flaws and not to be trusted.
Statistics can be minimally useful but they should Never be taken at face value or as being genuinely accurate. Statistics, at most, should be just one of many many tools for discerning Facts! The entire process of attaining statistics is highly flawed and shouldn't carry the weight it does!! In the end, Statistics are not much better than Anecdotal Evidence...It's something to be considered but that's it!!
David Belcher "Statistics are much better than anecdotal evidence".
^ That is a flawed sweeping generalization.
David Belcher disagree, obviously statistics can be biased, but they can also represent an accurate portrayal of reality. And are much more likely to show an accurate portrayal of reality than anecdotal evidence.
To compare statistics to anecdotal stories is insane. Anecdotal stories take the perspective that one event is representative of a larger phenomenon.
At the very least statistics use a much larger range of variables. That alone makes this takes more relevant than anecdotal evidence.
To say that statistics are never accurate to say that polling is never accurate. And if that were true then someone like Nate silver must be a psychic because of how accurate his predictions of the 2008 and 2012 elections were.
To say that statistics are not more relevant than anecdotal evidence. Is the dismiss the fact that statistics like the gini coefficient show that the u.s. is the most economically unequal developed nation in the world.
The say that statistics aren't true used to say that the statistics that show, government provided Healthcare increases life expectancy is all by accident.
***** Are you so sure?
I love how TED-Ed never has ads. They're here to spread knowledge, no make profits.
Yeah sure, but they still only make $ 3539 on patreon. A single guy- CGP Grey makes 15 times as much. Also, my point is the quantity:quality ratio of both channels
I agree that TED-ed is an organization but still Grey also has writers, animators, editors, etc. too!
that's because their profit is made from the 3k or so price to attend their conferences
Idk if they've changed but I got an ad on this but I think they deserve the income, regardless of how they get it (well with legitimate means ofc)
So you're implying the people making these animations, doing the research etc. Don't deserve to be paid
@@tanvikhare9710 shut up capitalist swine
Ted-Ed don't you ever stop making these videos.:)
That first example is huge. Better hospitals often have lower survival ratings because it’s better, so sicker people often to to that hospital instead.
It's like Mark Twain said. "There are three kinds of lies. Lies, damn lies, and statistics."
+Shawn Ravenfire That quote was first used by a British prime minister not Mark Twain.
Hash Kazi I stand corrected.
Yeah, but Twain won the rights to the quote in a Poker game!!
He didn’t say that.
Statistical error.
It's like gerrymandering of data!
Theres a nice saying i wanna share: Statistics are like bikini, what they reveal is suggestive. But what's hidden is vital :)
Rafiq me too
Caveat: doesnt apply to nudist colonies. :0)
I like this! I’m stealing it! Thank you!!
.
@@waynejshynzhinds4434 They are genuine statistsics!
I would like to congratulate the animators of this TED-Ed! The guest artists usually do a good job in the videos, but I presume this topic was really hard and the animation not only richly ilustrated it, but also was able to make me have fun while watching. Thank you very much. I hope to see more colaborations of this company with TED.
Did you know that there's a 100% chance that you're currently reading my comment?
Did you know that your sentence is actually correct? "You" comes to be whoever is reading that sentence; so yeah, there is a 100% chance. I found that funny and interesting.
Yes, and it's taken from the sample size of all peoples that will be reading your comment. I did my own expert calculations and well, well, I arrived at the same percentage! (Although in scientific notation: 1*10^2 %)
YAHALLO!!!
@@AloeEdits What? Lmao
I’m glad this is mentioned in the video and I imagined this is what it would be about. This kinda just goes down into simple scientific method really. Just because one thing is found doesn’t mean that there can’t be more factors affecting the situation. I do take statistics into consideration for a lot of decisions of course but I always try to separate it into separate factors if possible and try to find stuff that might disprove certain personal ideas that come up either because of the statistics or because of emotional causes.
Sadly sometimes there just isn’t enough statistics to check for everything. So at that point you do the best with what you got
Lurking variable background sound is what i have been missing my whole life
if you put the intro and the outro in reverse, you get piano sounds, i just checked it
this fucked me up
I'm not convinced. It sounds more like a string instrument like a violin.
Or a trumpet or whatever it's called. All I know is that it's a continuous instrument
+Maddix I literally checked it, its a piano, with some more unknown instuments
+ChaoSpiritZ rotmg
Could you perhaps upload verification?
Animation,sound effects are simple yet so good and interesting you guys
Its engaging and edutaining so i have lesser chance of being distracted n is easily comprehend...keep it up ted ed..we need channels like these..thanks so much for the information.
I think it's important to understand that there is still a purpose for statistics. Just be selective with your sources, consider if what you are reading a biased source such as left or right wing media. What does the writer want to you to feel, angry? shocked? What is their objective? Will they be able to give you a balanced view? If you can't find an unbiased resource read opposing biased sources to get a better idea of reality. Always read critically.
Got news for you, EVERY STATISTICAL SOURCE IS BIASED! Someone had to pay for the study.
@@deltavistastudio124 some might be not, if every result is equally good for paying side, or accuracy is actually the most beneficial. For example statistics telling what kind of paying side's product people like the most. Also not every company is immoral probably and in some cases falsing statistics would be very immoral (for example about deadliness of some substance)
What is the purpose of statistics?
“There is no one size fits all answer” the Ted Ed equivalent of “get gud”
Unless you have green eyes.
Pretty good explanation of one reason why correlation doesn't guarantee causation
Yup, this is why i don't care about "Vague" statistics.
someone should make this into a videogame
Amazing educational piece. This is relevant in every aspect of our lives. Hearing politicians using manipulated statistics for their agenda is common
thanks Ted Ed
there's so much learning in this video ,
it guided me
Geosquare moment
He was right anyway.
The whole world should have to watch this video on their first day of high school or its equivalent....it's common sense to some of us and painfully absent in others. I have no stats on these two groups :)
I never stopped trusting something as fast as I stopped trusting statistics after I took a statistics class.
Yes so very true, it's scary how often stats are used to determine where money is placed in government, policing, health, education- people's lives are determined by the faith in statistical gathering and interpretation as if it were incontrovertible truth. Terrifying.
taking statistics class should be mandatory we're all getting fooled ='D
Gender wage gap anyone?
+Short Burrito This
There IS a gender wage gap, but its much smaller than that one study says it is (its around like 5-6% at worst if I recall correctly), and only about half of it is (possibly) due to muh sexism, or muh patriarchy
+Short Burrito Yeah the whole women make 70% men do does not account for a some very important variables. First there are some women who still prefer the stay at home mom lifestyle. Their husbands make enough to support the family and they are happy. and that is fine if that lifestyle is what both of them want. But with a large group of women making no income then that drags that wage gap down. if you account for that then the wage gap is much smaller. Another variable is, maternity leave something that effects women wages but not men. And finally male dominated and female dominated fields. If a male dominated field has higher salaries while a female dominated field has lower. Then even if the males and females within those fields receive the exact same pay for the same work then that wage gap will still appear. right there is an example of the Simpson's paradox. i do understand to that these different fields is probably the result of gender bias or cultural and social norms preventing women from pursuing higher advancing their career. However the point still is that this is not due to any kind of wage disparity. there are also other variable that I might not have touched on.
TL:DR Even if women are payed exactly equal there will still be a wage gap because the issues are more complicated then just men get payed more.
+MegaAwesomeNick Gender gap statistics are usually controlled per industry so those who don't work aren't included.
+Kamizushi Akinari Ok, but my point still stands that that the issue is a lot more complicated then just saying women make 70% what men make. there are other variables that need to be taken into account like before mentioned motherhood, differences in career fields, age and many other things. and that the actual wage gap is probably a lot different then what is spouted all around the media.
***** I agree.
this is a great video. definitely an example of simpson's paradox is the wage gap as evidence of sexism as it only takes into account the average earnings of men and woman and doesn't go into any variables that may affect that statistic such as the average hours men and woman work and the different fields men and woman go into.
This video pretty much summarises the whole book called "FACTFULNESS"
Like when "campus rape statistic" data is determined by the surveyor, not the surveyed, and includes things such as feeling verbally threatened or groped as a rape?
_A recent study shows that anyone will believe anything as long as the study in question is prefaced with "a recent study shows". This, of course, is all according to a recent study using recent statistics._ No, but in all seriousness, try googling or looking up "a recent study shows" or "according to a recent study". You'll be surprised by all the "recent studies" you'll find. Most of them claiming to be from or citing "reputable resources".
Good statistical analysis requires (1) expertise and (2) integrity. Statistics give power, so they will always attract (1) incompetent people and (2) immoral people.
I laugh everytime the grandma jumps up with an "eeh!"😂😭💀
is as simple as to look at as much data as posible, things are often more complex than we think
I saw a stat where two different acts were compared . One was 99.99% safe. The other was reported as being "25× more deadly". Sounds scary, but that means the second one is 99.75% safe.
Bruh
@@pustota7254what
That dude’s laugh drove me insane.
Thank you so much, you probably won't see this since this video is 4 years old but this helped me a lot with one of my math classes.
This could be pretty useful to know someday!
Chipperz Right now.
Rn with covid it pretty useful.
Funny how the last example this video gives on it self can be misleading. You need to look into the context of the data(as you state so many times), it is possible that the victims that were black were due to gang violence and the victims that were white were due home invasions.
They both are murder cases, but the context of them can greatly change the sentence, the only way is to look for the data(as was done in example one) and see it. In this case it seem that was omitted, giving it the(possible false) notion that the determining factor was racial bias instead of the type of murder and the way it was carry out. Yet another fact that was omitted was the age and sex of the victims.
I just found somewhat ironic that in a video that's supposedly about addressing bias and manipulation of the persuasive nature of statistics(for possible ideological motives) might be committing the very same thing.
Whoa nelly! Rikortez, can't even suggest such a thing without triggering the snowflakes!!!
"It is possible that" is a nice way of saying you don't know. Everything you're saying is technically correct of course, but it would be way more interesting and useful if you had some real data
That's his point, making the assumption that the race of the convicted affects how likely they are to die rather than the crime itself is a manipulation of statistics.
You are saying, it's not wrong unless you prove it by looking at the data. But it's not right until looking at all the data, that is the point of the video.
And like the guy said, that's exactly the point the video is making, yet the video is being misleading in the same way the video is trying to prevent viewers from being fooled.
I don't get it, they say that blacks are sentenced to death comparatively higher than whites, and then proceed to say whites have a higher sentencing rate😅
Great video. But it's not statistics to blame here because without a method to organize any of our data we would be in a greater disorder today.
It is true that statistics have deliberately lied and data has been manipulated. But most of the time (am I saying this statistically?), it's the loopholes in statistical graphs that people (I said people, which includes politicians and companies) exploit to fulfill their agendas. I mean how can a SINGLE, error-filled, filtered, non-contextual graph prove a theory, claim or anything like that? Representation is key, especially when dealing with unobservant audiences.
I very much appreciate the author's point that the issue of lurking variables isn't easily resolved. I would add: it's good to have the same phenomenon, even the same data sets, examined by statistics with multiple perspectives.
Thanks a lot Ted ed for posting this video. It is really informative. I have always thought statistics can be relied. Today I learnt to look at different perspectives of statistics and the information it provides. Thanks a lot! Keep posting such videos!
You know that misleading smoker study was done by big tobacco.
اخوكم من عند الدحي
مي توو
Study shows that more than 80% viewers consider the lurking variable's animation to be scary
This is why WE JUST DONT KNOW who is right😂😂😂 even what seems to be “facts” have hidden facts. And you thought data couldn’t be used as a fallacy. 🤷🏼♂️
An easy example how statistics can being tricky:
We have 2 men and 2 breads, and one of them eats the 2 breads, but statistically says every man eats 1 bread.
Great video. Um, but... if you have good health, why do you even go to a hospital?
Good health = just sick, bad health = almost dead.
Well, you could have a broken wrist or something similar, or maybe you’re giving birth.
I'm going to show this to everyone that comments to me with statistical data..especially if it is from some bias source like dogsbite.
omg, this is so cool, it really caught the first time, thank you so much!
🎯 Key points for quick navigation:
[00:06] 📊 Statistics can mislead decisions by hiding crucial details.
[01:32] 🏥 Simpson's paradox reveals contradictory trends in grouped data.
[02:01] 🧠 Lurking variables significantly impact statistical outcomes.
[02:29] 🚬 Misleading survival rates can occur when key variables like age are overlooked.
[02:58] ⚖️ Dividing data by factors like race exposes hidden biases.
[03:52] 🤔 Avoiding Simpson's paradox requires careful data interpretation.
📉 Overall numbers often provide clearer insights than segmented data.
🛡️ Awareness of lurking variables prevents data manipulation for biased agendas.
Made with HARPA AI
This is how the world works: A green frog gets its legs cut off. A white frog does not. Therefore, scientists conclude green frogs cannot jump
Video on one of TED channels talking about misleading statistics? I'm sure this aged well :)
If you suspect a lurking Factor missed out in a statistic (on accident or intentionally), just look if things that should obviously be considered (the health of patients in their Hospital example is a good one) are missing or not mentioned and maybe look up other statistics of the same thing. Usually every statistic tells you what kind of data they took. So if you see a statistic that says "tested 1000 healthy people, around 50% of each gender", you might realize that that other statistic said nothing about the initial health of the subjects, but on turn said they where between age 15 and 50. This is a great way to get closer to the truth with statistics.
I used this paradox in a lesson i've held to highschoolers as part of my last year exam. SO thank you for introducing me to it. It was a great finishing touch that got the kids really excited. And my lesson was about philosophy of science and the role of common sense.
I like how this video show how statistic can be misleading and then quantify all murders happening in florida to be of the same value, discarding context as if judgment case are broadly the same thing and are not unique in their combinations of circumstances and factors. Was it murder of children? First or second degree? Judgment rendered by judge or jury? The real world is not the world of theorics macroeconomics where every single product that does the same basic function has the same value.
Covid19: you've exposed me
Another good example of statistics is that there’s 104 days of summer vacation
but Phineas and Ferb has 189 episodes
Basically the stats don't mean anything without context
feminism at it's finest.
A nice and important video. Can also recommend a TED Talk by Peter Donnelly about statistics.
This is why you need to question results that may seem favorable for the narrative pushed by main stream media. Who sponsored the study and what variables were used and what were left out?
in life experience, statistics are statistically incorrect most the of timr
That's why you need to test their reliabilities. Cronbach's Alpha is one of the most common ones.
All of science is correct until proven false. Statistics is no exception.
so who didn't do this during their collage research?
something that always annoyed me about statistics is that 50% doesnt mean one every two
70% of statistical data is misleading. Remaining 40% of statistical data is errorful. Wait, what did I say just now?
Or you could simply use decent statistical analysis software to find determining variables... Just sayin'
Simpson’s paradox is really scary and beautiful at the same time.
The 97 dislikes r people who make misleading graphs and now less people fall for them
plot twist: the statistics used in this video are misleading
If we can click the 'Like' button multiple times to upvote this video, I'd keep on clicking for the next hour
I still don't get the hospital thing. How can hospital A have a better survival rate when hospital B is better for patients with poor and good health? They said it's Simpson's paradox but..can they explain more with this case in particular
+Harreh Pottah
Yeah, it's really unintuitive. Here's a visual Information Theory explanation that includes a visualization of Simpson's paradox:
colah.github.io/posts/2015-09-Visual-Information/#aside-simpsons-paradox
+Harreh Pottah Basically, hospital A had a HIGHER rate of people arriving to their hospital in better health, and therefore overall had a better statistic.
To simplify, imagine 20 people go to hospital A. 18 of them arrive in good health and only 2 arrive in bad health. The hospital only looses 2 patients from the group of people with good health, and loses 1 from the people arriving in bad health, giving them success rates of 88% and 50%, respectively, and an overall survival rate of 85%
20 people also go to hospital B. At that hospital, 10 arrive in bad health, and 10 arrive in good health. They save 6 people that arrived in bad health (80%), and all of the people arriving in good health (100%). They have better survival rates in both categories, but when taken overall, hospital A only has a survival rate of 80%.
So, again, it was because hospital B, in both the video's situation and my simplified one, had a higher rate of people arriving in bad health who had less of a chance of surviving, and even though they saved more of them, it still brings down their survival rate relative to the other hospital.
Thank you! I understand now
You don't know what you're talking about
Great explanation, thanks!
Although, weird thing, I'm not seeing the `Read more` link on youtube comments anymore.
+Alex Knauth You may already know a way by now, but what you can do is either reload the page, or change from "Top Comments" to "Newest First" and that makes the 'Read more' link show up again.
Very enlightening, thanks for the video!
Unlock "This works," they simply translating video!
Kind of dark scenarios. I realize those are high stakes examples.
To this day I'm still fascinated by Simpsons paradox
As a statistician, I must agree with you, but there are more useful things in statistics. AI, machine learning, data mining is some example of it. Without it, there would be no search engines like google.
Do not let politicians make you skeptic about statistics.
Does anyone have the source for the smoking study I can't find it anywhere.
This is Donald Trumps FAVORITE way to show Covid and racial statistics. I wish more people would look at this today.
Simpson's paradox is kind of like the inverse of the logic behind gerrymandering if you think about it
great example - the gender wage gap & college rape statistics. the wage gap study that is most often quoted did not account for factors such as how many hours a woman works during her lifetime vs a man (due to needs, choices or life changes). the rape study most often referenced included skewing factors such as both parties engaging in consensual sex while under the influence of drugs or alcohol.
Don't know why I'm watching this, my GCSE Statistics Examination has been cancelled, due to the COVID-19 pandemic. lol
EXCELLENT !!
Ive been saying this for so long, nobody ever listened me.
I didn't understood that Florida death sentence thing.
Great example, something that people should know to understand society better.
Statistics are true 100% of the time, every time it's true.
Given all the completely bunk research done by gender study majors, who couldn't read nor assemble a statistic analysis to save their lives; this feels like the least of our worries about statistics. Extremely nice explanation nonsense the less :)
So we interviewed people who drove at full speed at the highway both of them sustained no injuries therefore driving at full speed is not reckles driving and it is 100% safe
Not agree. Statistics also include the fields not taken into consideration in these examples.
Counting the poor health ration for two hospitals comparison is also a part of statistics.
Grabbing representative samples is also a part of statistics.
Wow a TED presentation that isn't garbage.
this is ted-ed, not ted i think.
I have an idea! Ted-ed is a bit redundant, let's shorten it to Ted!
TED-ed is the best one, and TEDx is just a bunch of ideological crap, normal TED is hit and miss.
Lack of awareness of Simpson’s Paradox is basically the bane of all doctor’s existence.
I think about some other random thing for one second and then we start talking about a paradox,
THANKS A LOT ATTENTION SPAN