Weirdness in the Definition of Sufficient Statistic

Robert Cruikshank

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 10 лис 2024

КОМЕНТАРІ • 58

@VultureGamerPL Рік тому ⁺⁷
Thank you for your excellent explanation! This video is like the sufficient statistic for the topic of sufficient statistic because you don't have to go back to other sources to solve the exercises:)
@mohammedalawlaqi1227 6 місяців тому ⁺¹
This is the best explanation of the definition of sufficient statistics I have ever seen. Thank you for sharing this awesome experience.
@robertcruikshank4501 6 місяців тому ⁺¹
Thank you! Fair warning, a couple of professional statisticians have contacted me saying I'm wrong on a technicality I don't really follow, saying "parameters are not random variables". It remains true that this is a correct result and an easy way to remember it.
@mohammedalawlaqi1227 6 місяців тому
@@robertcruikshank4501 It is true. Thank you for the clarification. That is why I'm Bayesian. Your explanation is great.
@RoyalYoutube_PRO 3 місяці тому ⁺¹
absolutely great video
@CountMonsparkle 10 місяців тому ⁺¹
Studying for an advanced statistics exam rn, so this helped a lot, thank you! =)
@rollychairs Рік тому ⁺¹
Thank you for this video! I’ve been having a lot of trouble wrapping my head around sufficient statistics
@HassanAli-os3py 3 роки тому ⁺⁴
Finally! This helps resolve the discrepancy between the UA-cam videos trying to explain the concept and my inference book trying to define it. Good work and thanks a lot!
@robertcruikshank4501 3 роки тому ⁺¹
Glad it was helpful!
@alessandrorossi1294 Рік тому
Thanks for the upload, very clear thinking!
@hiclh4128 2 роки тому ⁺¹
This is the best video on this topic I have watched so far. Thanks.
@jakeaustria5445 Місяць тому ⁺¹
Thank You
@raterake 3 роки тому ⁺⁵
Thanks very much for this video! I spent the last couple of hours trying to wrap my brain around exactly this. Thanks for saving me from further headache!
@raterake 3 роки тому
Also, my guess as to why they flipped it is that, while the intuitive definition makes sense, it seems less straightforward to compute. If we know the underlying distribution of the data (even if theta is unknown), P(data|U) is more straightforward to calculate than P(theta|U). It would have helped if they clarified that in my textbook though.
2 роки тому ⁺²
Exactly what I was looking for, thank you
@rianduong7373 Рік тому
Thanks Robert, really useful explanation!
@FerdinandLeube 6 місяців тому ⁺¹
THANK YOU SO MUCH
@alessionucci97 Рік тому ⁺¹
awesome explanation!
@zepherinkevinfrancis7539 Рік тому ⁺¹
You are a god sent, bless you sir and thank you.
@hectorlavaux343 3 роки тому ⁺⁷
Hello, nice video. The reason probably goes back to the guy who coined the concept of sufficiency: Fisher. Your definition, while being indeed more intuitive, treats Theta as random. Fisher was "against bayesianism", thus against treating unknown parameters as random. The advantage of the usual definition of a sufficient statistic is that is can be formulated in a frequentist framework.
@robertcruikshank4501 3 роки тому ⁺³
Thanks for the interesting commentary!
@velvlgershevich450 3 роки тому ⁺²
@@robertcruikshank4501 Dear Robert! As far as I understand the matter, the treatment of \theta as a random entity is not good at all. My point is that when you try to write all your arguments in a rigorous manner, you will feel that the conditional probability involving \theta as a parameter but not as a random quantity do not make sense. This is my feeling and I may be wrong, although I checked the calculations. However, I appreciate your video since it confirmed to me that not only me and my son have troubles in understanding the definition of sufficient statistics. I am PhD in Probability and my son is the second year at bachelor in Statistics and I had tough time to explain the sufficient statistics to him.
If I find time, I shall write down my explanation and send it to you. Thanks once more. Sincerely, Vladimir Belitsky.
@bojelotiroyakgosi Рік тому ⁺¹
Thank you 🙏🏾
@LesPauLinz96 2 роки тому ⁺¹
Thank you so much for your videos! They make life a lot easier, really appreciate it
@robertcruikshank4501 2 роки тому
Glad you like them!
@nicolabertin8212 Рік тому ⁺¹
Thanks a lot!
@gourangvats8763 8 місяців тому ⁺¹
Thanks a lot sir
@lorenzotolomelli811 Рік тому ⁺²
I had trouble remembering this definition because to me it didn't quite match the intuitive concept that was presented before the definition was given (i.e. you don't need to go back to the data for a better inference on theta after you observe U). Thanks to you, this finally makes sense and I don't need to check my notes whenever the definition of sufficient statistic is mentioned.
However, I want to remark that I didn't find obvious the implication "P(data | U, theta) = P(data | U) -> P(theta | U, data) = P(theta | U)", because whenever I tried to prove it with the axiomatic properties of conditional probability (the ones on the right of your whiteboard) I found it difficult to deal with the "double conditioned probability" (P(data | U | theta)). Thus I decided to write every conditioned probability with its definition and prove the implication this way (to make things clear, I wrote the first equality as P((X in A),(U in B),(theta in C))/P((T in B),(theta in C)) = P((X in A),(T in B))/P(T in B), I hope this is clear).
As to why the definition is given backwards, I think it is because the intuitive definition doesn't make sense in the frequentist approach: P(theta | U, data) doesn't mean anything since theta is a parameter and doesn't have a law. I hope this makes sense, thank you again for your video!
@mrnogot4251 3 роки тому
Dude yes. I had this lightbulb moment myself a while ago. I am usually really good at figuring out the intuitive notion behind mathematical definitions, but this one took me a while. Good work.
@robertcruikshank4501 3 роки тому
Thank you!
@stanbaltazar 3 роки тому ⁺²
Wow appreciate the great insight! Very concise too. Thanks!
@robertcruikshank4501 3 роки тому
My pleasure!
@theodouwes4432 2 роки тому ⁺¹
Super helpful...
@zilynn8688 2 роки тому ⁺¹
Thanks a lot !!
@Ryan라이언 3 роки тому ⁺²
Thank you so much for the insightful video
@robertcruikshank4501 3 роки тому
Glad it was helpful!
@Ajay-ib1xk 10 місяців тому ⁺¹
thanks sir
@lilmoesk899 3 роки тому ⁺¹
Thank you! Very useful explanation
@robertcruikshank4501 3 роки тому
Glad it was helpful!
@dhfggh2997 Рік тому
amazing, thank you
@enzeru97 Рік тому ⁺¹
When we say that a distribution doesn't depend on θ, we mean that we do not see θ in its equation. It's not the same thing as random variables' independency, thus we can't really use the respective theorems. Plus, θ is not a random variable.
@robertcruikshank4501 Рік тому
In Bayesian statistics, theta IS a random variable.I will have to work on figuring out how a frequentist can make sense of this definition. Thank you for pointing out this problem.
@enzeru97 Рік тому
@@robertcruikshank4501 I haven't studied Bayesian statistics, so Idk. If I don't take it too strictly, you make sense, though. Thanks for explaining it!
@unicorns3218 4 роки тому ⁺³
Thank you 🙏🏻
@robertcruikshank4501 4 роки тому
You’re welcome 😊
@huanranchen 11 годин тому ⁺¹
Thanks for the explanation! I wonder if they avoided defining the sufficient statistics by the posterior due to certain regularization conditions, like avoiding the marginal distribution is non-zero at certain points?
@robertcruikshank4501 5 годин тому
It's been pointed out to me that with a frequentist interpretation my description makes no sense. I'm not 100% sure of that, but my expertise is limited. I wasted ten hours wrapping my head around it so I wanted to spare everyone else those ten hours if I could.
@zahiralsulaimawi9693 4 роки тому ⁺²
Thanks!
@robertcruikshank4501 4 роки тому
You're welcome!
@쥐며느리-c1x Рік тому
감사합니다 많은 도움이 됬어요 :)
@robertcruikshank4501 Рік тому ⁺¹
천만에요 (You're welcome according to Google Translate)
@animeshpandey4571 4 роки тому ⁺¹
yes it was really nice
@aishaibraahim8168 4 роки тому
Mahadsanid mudane
@robertcruikshank4501 4 роки тому ⁺¹
Adaa mudan ("You're welcome" in Somali, according to Google Translate)
@muhitthemagician7651 4 роки тому
love you man
@guilhermebiem582 Місяць тому
The reason why they have to switch it up in general is that the expression P(theta | U, data) = P(theta| U) does not make sense unless you're a Bayesian. If you're doing frequentist statistics, the parameter is not random, just unknown. This means that the expression P(theta) simply does not make sense.
This also goes to show how many people are intuitively Bayesian to begin with hahaha.
@robertcruikshank4501 Місяць тому
Yes, I have heard this argument. As far as I can understand (which is limited), it leaves math behind and dives into philosophy. Granted, the philosophy of probability theory is seriously messed up to begin with. It wasn't until I tackled advanced statistics that I realized that I owed QM an apology for calling it nonsensical--it merely inherited most of its problems from probability theory. But to get back on point: if I fully understood the issue you are describing, I would have made another video about it. Sadly I must leave that to better minds than my own.
@donnavanderrijst4098 2 роки тому
Thanks!

Наступне

Автоматичне відтворення

Sufficient Statistics and the Factorization Theorem