Causal Inference - EXPLAINED!

Поділитися
Вставка
  • Опубліковано 2 гру 2024

КОМЕНТАРІ • 86

  • @CodeEmporium
    @CodeEmporium  2 роки тому +110

    I put a lot of effort into this one to make it as descriptive as possible. It's also a new style of delivering content / animation. Please let me know how you like this. :)

  • @gemini_537
    @gemini_537 8 місяців тому +2

    Gemini: The video is about causal inference. It explains what causal inference is and the challenges of performing causal inference using observed data. It also explains different techniques to address these challenges.
    The video starts with explaining randomized controlled trials (RCTs) which is the gold standard for causal inference. But RCTs are not always possible. So the video talks about causal inference using observed data.
    Causal inference using observed data is challenging because there can be confounding variables that affect both the treatment and the outcome. The video uses an example of a medical trial for the flu cure to illustrate this point. In the example, age is a confounding variable. The treatment group (people who received the elixir) has an average age of 35 while the control group (people who did not receive the elixir) has an average age of 65. Even if the people in the treatment group recover from the flu faster than the people in the control group, it might be because they are younger, not because of the elixir.
    Another challenge of causal inference using observed data is selection bias. Selection bias happens when the group chosen for the treatment is not representative of the population. For example, if the people who received the elixir in the medical trial were all young and healthy people, then the results of the trial would not be generalizable to the whole population.
    The video also talks about counterfactuals, which are what would have happened if a person had not received the treatment. Counterfactuals are necessary to estimate the causal effect of the treatment. There are two techniques for estimating counterfactuals: matching and machine learning.
    Matching involves finding people in the control group who are similar to the people in the treatment group on all observable characteristics except for the treatment. The outcome of the people in the control group can then be used as an estimate of the counterfactual for the people in the treatment group.
    Machine learning can also be used to estimate counterfactuals. A machine learning model can be trained on data from people who did not receive the treatment. The model can then be used to predict what would have happened to the people in the treatment group if they had not received the treatment.
    The video then talks about the assumptions that need to be made for causal inference using observed data. These assumptions are necessary to make the analysis possible. One of the assumptions is called the causal Markov condition. This assumption says that the treatment only affects the outcome through the variables that are included in the causal graph.
    Another assumption is called SUTVA (Stable Unit-Treatment Value Assumption). This assumption says that the outcome of a unit would be the same no matter what treatment the other units receive.
    The last assumption is called ignorability. This assumption says that there are no confounding variables that have not been included in the analysis.
    The video then shows how to calculate the average treatment effect (ATE) and the conditional average treatment effect (CATE). The ATE is the average difference in the outcome between the treatment group and the control group. The CATE is the average treatment effect for a specific subgroup of the population.
    In the example of the medical trial, the ATE was 0.1. This means that the people who received the elixir were more likely to recover from the flu than the people who did not receive the elixir. However, the CATE for people over the age of 35 was 0.4, while the CATE for people under the age of 35 was -0.2. This means that the elixir was effective for older people but not for younger people.
    The video concludes by saying that causal inference using observed data can be a powerful tool for making decisions, but it is important to be aware of the challenges and assumptions involved.

  • @andrews9719
    @andrews9719 2 роки тому +12

    I’m taking a masters in data analytics/program evaluation, and am learning this rn. You summarize the information really well, picking out the really important parts of causal inference to explain. Good job! The later part of the video even helped me conceptualize quasi experimental designs, which use matching like you described. Thanks for the help.

  • @kinanradaideh5479
    @kinanradaideh5479 Рік тому +5

    Absolutely beautiful, incredible explanation; I like that it's explained through a practical example!
    You're very underrated; the future of this channel is bright!

  • @tomasroosguerra8338
    @tomasroosguerra8338 Місяць тому +1

    Best video I've seen on this topic and I've sen MANY.

  • @LNJP13579
    @LNJP13579 6 місяців тому +1

    Brother, you have summarized really well in such a short video. Every second was GOLD 🙂

  • @narkprix2
    @narkprix2 Рік тому +1

    Incredibly well explained and very illustrative examples. Many thanks for the work you put on it.

  • @poniard123
    @poniard123 2 роки тому +2

    Very useful video. I spent two days reading the actual paper of causal influence. This video is concise but gives me a very good foundation to read the theory.

    • @CodeEmporium
      @CodeEmporium  2 роки тому +1

      That’s the hope! Thanks a ton for watching

  • @chrislin5938
    @chrislin5938 2 роки тому +2

    This content is gold. Thank you so much for making these kinds of videos!! Can’t wait to see more!!

  • @rajns8643
    @rajns8643 9 місяців тому

    Absolutely beautiful and incredible explanation. I also like the fact that it's explained through a practical example :)
    Cheers!!

  • @syhusada1130
    @syhusada1130 2 роки тому

    Clear and good explanation.

  • @tnmyk_
    @tnmyk_ 4 місяці тому

    Amazing explanation! Got to learn a lot and understood everything. Thanks a lot!

  • @andrewchen7710
    @andrewchen7710 Рік тому

    reading a paper utilizing causal inference rn - this cleared so much up, props!

  • @franklynLP
    @franklynLP 2 роки тому

    My prof needed 3h to explain this (and failed - thats why im here). Thanks for the video, helped a lot!

  • @ravisawhney8677
    @ravisawhney8677 Рік тому

    Really good video - appreciate the effort that it must have taken to convey the concepts intuitively whilst being as succinct as possible. Not easy!

  • @won20529jun
    @won20529jun 2 роки тому +2

    Amazing explanation! It must've been almost painful to not discuss all the details and caveats and technicalities, but that's what made it valuable for me
    Love the music as well :D

  • @clarezhou1652
    @clarezhou1652 Рік тому

    Great video. I read a lot of materials and couldn't digest. This one is the best I saw.Thanks

    • @CodeEmporium
      @CodeEmporium  Рік тому

      Glad it was helpful! And thanks for watching!

  • @InCaseofEconStruggles
    @InCaseofEconStruggles Рік тому

    I think this is a really good overview of Causal Inference and the main assumptions! Good DAG explanation as well!

    • @CodeEmporium
      @CodeEmporium  Рік тому +1

      Thanks so much for watching ! And the comment

  • @AndyHo-j3f
    @AndyHo-j3f 9 місяців тому

    This is a really really really well done video, thank you!

  • @mark2292
    @mark2292 2 роки тому

    Thank you so much, your explanation is way more clear than my prof in this class

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Super glad you feel this way! Thanks so much for watching!

  • @tiff-anniekenny1222
    @tiff-anniekenny1222 2 роки тому

    Thank you for this exceptionally well-presented video - rich in content and succinct.

  • @GhemonA
    @GhemonA 2 роки тому +1

    How do you create the Treatment and Control groups at 8:55?

  • @tomoki-v6o
    @tomoki-v6o 2 роки тому +1

    I think the job you doing in this videos is better than some research papers ,by simplifying this topics for the public
    .

  • @kunaldang7611
    @kunaldang7611 2 роки тому +1

    Thanks a lot for this video! Keep up the good work, and please try to cover Causal Graphs (Directed Acyclic Graphs) vs Bayesian Network structure learning(also in detail) if you can. Thanks in advance.

  • @zbigniewz.lewandowski8629
    @zbigniewz.lewandowski8629 4 місяці тому

    Your presentation is missing a key element, which you mention at 11:31, namely how to estimate missing data. Could you send a link to an explanation of this element of the presentation?

  • @zbigniewz.lewandowski8629
    @zbigniewz.lewandowski8629 4 місяці тому

    All of calculations are simple and clear but there is lack of a key element, which you mention at 11:31, namely how to estimate missing data. Could you send a link to an explanation of this element of the presentation?

  • @hameddadgour
    @hameddadgour 8 місяців тому

    Great video. Thank you for sharing!

  • @mahrym984
    @mahrym984 5 місяців тому

    Great summary thank you🙏🏾

  • @victorrodriguez5981
    @victorrodriguez5981 2 роки тому +1

    great explanation, i've been studying c.i. for the past 6 months and your way of explaining was very clear.
    Cheers from Bolivia.
    P.S. can you share your discord link again plz

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Thank you! Appreciated! The discord link should be on the description of this video :)

  • @stavs1792
    @stavs1792 2 роки тому

    Great video! what tool do you use to create the presentation and the animations?

  • @masteronepiece6559
    @masteronepiece6559 2 роки тому +2

    Great video. 👏

  • @MSalman1
    @MSalman1 2 роки тому

    Awesome job explaining!!!!

  • @alghanimaa
    @alghanimaa 2 роки тому

    hello, what is the problem with the following approach which aims to account for age without counterfactuals?
    you can do mean(treatment) - mean(control) for the older group ((0+1+1)/3 = .67) - ((1+0)/2 = .5) resulting in a difference of .17 for the older group and a similar calculation for the younger group yields ((1+0)/2 = .5) - ((1+0+0)/3 = .33) resulting in a difference of .17 for the younger group as well.
    using this approach, there does not seem to be a difference due to age!

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w Рік тому

    Well explained.

  • @suhailshaikh5091
    @suhailshaikh5091 Рік тому

    Hey Ajay, thanks a lot for making this video. Super helpful. Best video I came across on Causal inference.
    I have a question regarding Balanceness check between treatment and control group.
    Is it necessary to satisfy the balance criteria if I am using a ML model to predict the counterfactuals? Is it okay if there’s no balance between some confounders in Treatment and control group? Would really appreciate helping with this.

  • @02dddanny
    @02dddanny 2 роки тому

    at 10:06 you mention that the age differences was large enough to warrant age to be labeled as a confounding variable. what exactly was the magnitude of difference that leads to that assumption? if the age means were 35 and 40, would that be a large enough difference? thanks.

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      You have 2 distribution of ages. You can conduct a statistical test to check if the 2 distributons are different. If significant, then yes.

  • @sanjaykrish8719
    @sanjaykrish8719 8 місяців тому

    Great video. small correction, pronunciation is causal and not caushal

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w Рік тому

    An idea. Can you consider a video comparing this to Bayesian network.

  • @patpattamon
    @patpattamon 9 місяців тому

    Thank you so much.

  • @spencerantoniomarlen-starr3069

    Could you please upload a separate dedicated tutorial teaching us how to say "efficacy" the way you do!

  • @HGZie
    @HGZie 20 днів тому

    Thanks

  • @pushkarparanjpe
    @pushkarparanjpe Рік тому

    Thanks!

  • @taotaotan5671
    @taotaotan5671 2 роки тому

    Does including the confounder variable in a multiple regression also “control” for false association.

    • @leongyinyee2182
      @leongyinyee2182 2 роки тому

      Yeah, I have the same question. But the matching approach seems very usual in biostatistics. I am still figuring their difference.

  • @isaidhs
    @isaidhs 7 місяців тому

    gold

  • @KnowNothingJohnSnow
    @KnowNothingJohnSnow 2 роки тому

    wow!!!! your explaination is better than my epidemiology professor. thanks a lot!!! By the way, is there any recommand paper for RCT design r about Causal Inference ?

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Thank you! As for specific resources, i put them in the description of the video. I don't think there is a single research paper that is the one size fits all for the topic, but a collection of these resources does paint a good picture. Also the next video's description had other resources from a Machine Learning perspective

    • @KnowNothingJohnSnow
      @KnowNothingJohnSnow 2 роки тому

      @@CodeEmporium Thank u so much ! I learn a lot from your channel

  • @markella3699
    @markella3699 2 роки тому

    Would age in this case be an effect modifier?

  • @primary0207
    @primary0207 22 дні тому

    Nice

  • @vaffapsychology6143
    @vaffapsychology6143 2 роки тому

    Your voice is soo ..beautiful 😍

  • @1UniverseGames
    @1UniverseGames 2 роки тому

    Sir, can you make a video on using DeepSpeed on Pytorch Cifar10? How to implement it on it.

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      I can look onto this and see what the most palatable format is for a video. Thanks for the suggestion

  • @Anonymous-be4qf
    @Anonymous-be4qf Рік тому

    50% recovered, RIP the other 50% elderly 😭

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w Рік тому

    Can causal inference be applied to time series data?

  • @karannchew2534
    @karannchew2534 7 місяців тому

    "Control other effect through randomisation"

  • @goldmandrummer
    @goldmandrummer Рік тому

    The counterfactuals seem questionable... Is it really reasonable to say Sam would not get better with the treatment if he did get better without the treatment? That seems highly unlikely, doesn't it?...and the inverse for Rondo seems highly unlikely as well...
    I'm admittedly clueless about statistics but I'm always on the lookout for bad logic and this was a red flag for me.
    I don't mean to suggest a bad example on your part but rather that, in general, it seems there is a huge opening for error to sneak in through counterfactuals.

  • @twoheadsy
    @twoheadsy Рік тому

    David Cross, is that you?

  • @uctube5502
    @uctube5502 Рік тому +1

    Your face being a distraction is some sort of error, you should release one with it to find the counterfactual

  • @kurtgodel5236
    @kurtgodel5236 Місяць тому

    caushl? caushl?

  • @dato007
    @dato007 2 роки тому

    This is missing statistical testing. All these are potentially non significant marginal results.

  • @AllUserNamesTaken111
    @AllUserNamesTaken111 2 роки тому +1

    does this guy not know how to pronounce "causal"?

  • @ExtraterrestrialIntelligence
    @ExtraterrestrialIntelligence 2 роки тому

    I am a causality denier! I don't believe in causality. At least not the causality that we are familiar with. I think we need higher-order logic of at least the 69th degree to come up with an explanation for causality. I don't wear a tinfoil hat. I wear a quantum metamaterial protective helmet.