Це відео не доступне.
Перепрошуємо.

Interaction Terms in Stata

Поділитися
Вставка
  • Опубліковано 19 чер 2017
  • This video will explain how to use Stata's inline syntax for interaction and polynomial terms, as well as a quick refresher on interpreting interaction terms.

КОМЕНТАРІ • 88

  • @timocallaghan4408
    @timocallaghan4408 3 роки тому +39

    finally I have a clue at what I am looking at in this assignment. You have a statistically significant positive effect on many of our grades!

  • @AN-yr7nm
    @AN-yr7nm 4 роки тому +3

    Dear Nick, big thanks, you just helped one more thesis! I really wish you that you get to enjoy the help you've been giving us!

  • @BB-se9bl
    @BB-se9bl 6 років тому +33

    Dude you’re saving my thesis. Thanks!

  • @abdullahrabaya9235
    @abdullahrabaya9235 4 роки тому +1

    Great video, for me this is first time I saw analysis of moderating variable like this simple way. Thank you so much for time n efforts.

    • @abdullahrabaya9235
      @abdullahrabaya9235 4 роки тому

      I have some questions
      1) what are the main differences between (##)(#) and which case I can interpret the results. For me, (#) seems to me like you split sample.
      2) when you calculate the total effect of moderator (b3 &b4), the P-value were not significant. So how you can do that?
      3) is there any academic paper published use this method? Pls If you have could you please share with me abdullah.rabaya@siswa.ukm.edu.my. Thank you in advance

  • @cheerioz83
    @cheerioz83 2 роки тому

    Wow. Just what I was looking for. Thank you!

  • @Garrettthethief
    @Garrettthethief 6 років тому

    A really valuable video. THANK YOU NICK!

  • @bapaknyaayadanfathan4686
    @bapaknyaayadanfathan4686 2 роки тому

    thank you.. you really save my dissertation.

  • @takesuretozooneyi4836
    @takesuretozooneyi4836 5 місяців тому

    video of the century

  • @amyheadley6429
    @amyheadley6429 4 роки тому +3

    Hi Nick, thanks for the video. Just wondering how to interpret odds ratios following a regression using the logistic command? Is the odds ratio for the interaction term still an additional effect or is it interpreted differently?
    Many thanks

    • @bmaclin
      @bmaclin Рік тому

      I have the same question!

  • @cate-pillar
    @cate-pillar 5 років тому +1

    Thank you Nick, this really helped!

  • @elizabethdonkor3274
    @elizabethdonkor3274 4 роки тому +1

    Hi Nick, thanks for the video. Quick question pls, do we always have to include the IV's in the model if we only interested in the interaction results? So say just having the control variables, interaction terms and DV only?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 роки тому +1

      It's pretty rare that you'd want to leave out the independent variables, as doing so makes the interaction terms very difficult to interpret and sometimes meaningless. Because they have no base independent variable coefficient to contrast to, the interactions lose the "how the effect of one variable changes with values of the other" interpretation

    • @elizabethdonkor3274
      @elizabethdonkor3274 4 роки тому +1

      Thank you very much Nick. I understand it now.

  • @jekamito
    @jekamito Рік тому

    thank you, crystal clear!!!

  • @Garrettthethief
    @Garrettthethief 6 років тому

    Nick, two questions:
    • If you use a fixed-effects regression instead: Is the interpretation still the same?
    • Would you present interaction after interaction or all interactions in one fixed effects regression for a presentation about these interaction effects?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 років тому +1

      Keep'nCalmProductions Yes, the interpretation remains the same, with the caveat that, like everything else with a fixed effects regression, you're looking only at Within variation. And if you think there are multiple interactions, then you should be estimating the correct model with all the interactions included (unless your sample size is small enough that doing so runs you into power issues). Unfortunately that does also make the results harder to present.

  • @kselev1708
    @kselev1708 3 роки тому

    Super simple and interesting explained. Thank you very much!!!
    Dear Nick, could you maybe create a set of videos about using Stata for econometric analysis E2E? That would be soooo great and helpful for multiple master/PhD students!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 роки тому

      Glad you like the material! Unfortunately I'm not planning to create any new Stata material in the near future. Plenty of E2E material in R though!

  • @dataman6744
    @dataman6744 2 роки тому

    Nicely done thanks

  • @takesuretozooneyi4836
    @takesuretozooneyi4836 5 місяців тому

    Will the interpretation of B4 remain the same if family income was a dummy?

  • @marinmes1722
    @marinmes1722 4 роки тому

    Hey Nick!
    I have a question: is there a command in Stata that makes 1=highest rank (like you have for classrank)? I'm doing an analysis which includes ranking in a Dutch football league and am curious to know how you did it. I only figured it out by making an additional variable which is constructed by doing: 19-rank (there are 18 teams in the league I am researching).
    Let me know!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 роки тому

      If you have a rank variable to start wtih, doing 19-rank sounds like a pretty good way to do it. If you're constructing ranks from a raw variable you might look at the rank function in egen

  • @LenaGlitzert
    @LenaGlitzert 6 років тому

    This is just awesome. Thanks a lot!

  • @amaranwar1249
    @amaranwar1249 4 роки тому +1

    It's the mean difference between the base category and the reported category.

  • @nikitapestov7602
    @nikitapestov7602 2 роки тому

    The video sound is pretty good, beyond my imagination

  • @TheFruitInvasion
    @TheFruitInvasion 4 роки тому

    You really helped me thanks!!

  • @TinaTina-xn9on
    @TinaTina-xn9on Рік тому

    Sir, what do with the high multicollinearity between the interaction term and one of its components variable?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  Рік тому

      It's not a concern. In fact it will be correlated with both of its components by construction.
      If the correlation is extremely high, that may indicate that the other component doesn't have much variation, at worst.
      In general multicollinearity is way overblown as a concern.

  • @JAY-ft1dw
    @JAY-ft1dw 4 роки тому

    Thank you for your video Nick, but do you also happened to know how to "outreg2" of multiple regression output at the same time? (I basically want to outreg regression output of each set of groups (300+groups) and hoping that I don't have to copy and paste that many times.....)

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 роки тому

      Do them one after the other. Do a regression, then do outreg2 with the "replace" option. Then do another regression and do outreg2 with the "append" option, and then do as many more append versions as you like

  • @ramachandrant2719
    @ramachandrant2719 5 років тому

    Dear Nick, Thank you for the video. How is the interpretation if the interaction term is a factor variable with three levels. For the third level, is it the difference from the omitted level or the second level?

  • @antonio23merino1
    @antonio23merino1 6 років тому

    Nick one question:
    If you use two categorical variables (sex and education) interaction, how will be the interpretation? 1=female for the first variable and for the second one the categories are: primary, secondary, and university education. Thank you so much!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 років тому

      Same basic idea, but everything relative to the omitted category. So if primary is omitted, then the coefficient on secondary is "difference between secondary and primary for men" and hte coefficient on secondary plus the coefficient on female*secondary is "difference between secondary and primary for women"

    • @antonio23merino1
      @antonio23merino1 6 років тому

      thank you so much!

  • @JaredForsyth
    @JaredForsyth 3 роки тому

    So in this example, the interaction term doesn't have a significant p-value -- doesn't that mean that we don't have evidence of an interaction? And would we still add the coefficient (b4) in that case?
    Thanks!

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 роки тому

      In general you never want to pick which variables to include or exclude on the basis of significance. The interaction should be included or excluded on the basis of theory or which hypothesis you're trying to test (often, the interaction itself is what you're curious about so you'd need to keep it in). If your really want to determine whether to keep or remove it on a predictive basis, something like LASSO would be more principled than a significance test

  • @abeypp
    @abeypp 2 роки тому

    Hi Nick, may i ask if we can use the interaction term as a control variable in the same model? For example, age or ethnicity.

  • @emilienneyameogo3525
    @emilienneyameogo3525 4 роки тому

    Hi Nick, thank you very much. Is it possible to use interaction terms with the PMG model? Because when I try, it is telling me (note: lnKOF dropped because of collinearity//initial vector: extra parameter lnKOF found//specify skip option if necessary). How can I use all my variables in the same model with the interaction term without collinearity issues?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 роки тому +1

      Yes it is. That error generally means that you have perfect multicollinearity, ie you can perfectly predict lnKOF with a linear mix of your other variables. Try running the same model as a regression and you should get the range result. Your model is misspecified

    • @emilienneyameogo3525
      @emilienneyameogo3525 4 роки тому

      @@NickHuntingtonKlein I will try that. Thank you.

  • @eddiele644
    @eddiele644 4 роки тому

    So when do we actually interact our variables? Is there a way to see if it is necessary or do we just do it and then see if the coefficient on the interaction term is statistically significant?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 роки тому

      Ideally, build your model using theory. Should the effect of one variable be mediated by another? But outside of that, yes, adding the interaction and checking for significance is commonly done.

    • @eddiele644
      @eddiele644 4 роки тому

      @@NickHuntingtonKlein Thanks Nick

  • @dehiole6463
    @dehiole6463 4 місяці тому +1

    is it still true if i put white = 1; not white = 2??? 6:50

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 місяці тому

      Yep it will work for any value of white (although in this case white is binary so it can only be 0 or 1 anyway, but in a case with a variable with a wider range, yes)

  • @camillepouchucq1133
    @camillepouchucq1133 4 роки тому

    Hi Nick, thank you very much. Is it possible to use interaction terms with stepwise selection ? Because when I try (like this for example : xi : stepwise, pr(.2) lockterm1 : logistic success (i.classage i.sex) i.surgery##i.sex) it answers = "interactions not allowed"
    Thank you for your help.

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 роки тому

      Sounds like it's not compatible with interactions (some commands aren't). You can usually use interactions anyway by creating them as their own variables, but in the case of stepwise regression in guessing that would make it work improperly since it wouldn't know what to drop. Maybe do a LASSO instead? That's usually better than stepwise anyway

  • @obeydzomonda9821
    @obeydzomonda9821 3 роки тому

    Hi Nick. Thank you for the video. I wanted to ask if its possible to test moderation effect on 5 independent variables and one dependent variable using panel data in stata

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 роки тому +1

      I don't see why that wouldn't be possible. Anything where you're testing that many independent variables for their effects at once is going to be a bit jumbled and likely difficult to identify though

    • @obeydzomonda9821
      @obeydzomonda9821 3 роки тому

      Thank you for your response. Well appreciated.
      Regards
      Obey

  • @andersengbergpetersen4437
    @andersengbergpetersen4437 5 років тому

    Really great video! I'm looking at social origin effect on occupational attainment in three different countries and want to run a regression with an interaction between the social origin dummies and country. However, in the dataset (ESS) the country variable is a string variable (no numbers assigned). Is it possible to do an interaction when one of the variables is a string variable?

  • @shanschannel2682
    @shanschannel2682 6 років тому

    this was really helpful thank you!

  • @AK-md1mr
    @AK-md1mr 4 роки тому

    Hey Nick,
    First of all thanks a lot for your video!!
    Can I use interaction terms with longitudinal data as well? For example I want to look how the effect of "female" and "r_white" on "classrank" change in the last lets say 4 years ("year"). Would xtreg classrank female##year r_white##year give me any information about that?

  • @WeyNurSame
    @WeyNurSame 5 років тому

    Thank you so much my man

  • @blessingsam7198
    @blessingsam7198 3 роки тому

    Very helpful, Please can I generate a variable instead though, like gen whitefeminc= 1.white*feminc, would this count as an interaction term too, I want my table to look good, without the # in the variable name.

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 роки тому

      That would work (without the 1.). It will stop functions like margins from working properly afterwards though. Generally if you want the table to look nice it's better to do it properly with the interaction in the regression, and then use a table making function like outreg2 to change the variable names

    • @blessingsam7198
      @blessingsam7198 3 роки тому

      @@NickHuntingtonKlein thank you.

  • @DPREY1234
    @DPREY1234 Рік тому

    great!!!!!

  • @annedenhartog9389
    @annedenhartog9389 6 років тому +1

    Can you also use control variables?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 років тому +1

      Yep, just add 'em in there.

    • @annedenhartog9389
      @annedenhartog9389 6 років тому

      Thankyou! I want to interpret if income differs between Men and women within different education levels (middle and upper). So I get interaction term 0.2258 for middle if someone is a male and 0.7203 for upper is someone is male. How do I interpret that?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  6 років тому

      I'd recommend writing out your regression equation and plugging in values like in the video. I couldn't really say the interpretation without knowing your model.

  • @CoryBradley
    @CoryBradley 5 років тому

    This is tremendous. Does anyone know if a video like this exists for logistic regressions?

    • @ellenkileyfriends
      @ellenkileyfriends 11 місяців тому

      Mike Crowson has a ton of great videos on logistic

  • @anikaholmnielsen
    @anikaholmnielsen 6 років тому

    Very helpful!!!!

  • @dr.wanmaslizawanmohammad349
    @dr.wanmaslizawanmohammad349 5 років тому

    tq sir really helpful

  • @thommievg6008
    @thommievg6008 5 років тому +1

    legend

  • @selcenzorlu
    @selcenzorlu 3 роки тому

    Where can we download the data you are using?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 роки тому

      I don't think I have this data on hand any more, but it comes from the NLS Investigator

  • @Non-disjunction
    @Non-disjunction 3 роки тому

    Learned some tricks with the i. and c. commands, but overall this video went quite fast and I had to pause a lot.

  • @FemkeHuisman
    @FemkeHuisman 3 роки тому

    Wait, but when should you use # and when ##?

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  3 роки тому +1

      # is for interaction only, ## is for interaction plus each variable by itself

  • @jawadtariq88
    @jawadtariq88 4 роки тому

    I am so confused right now, can any one tell me that is moderation and interaction are same ??

    • @NickHuntingtonKlein
      @NickHuntingtonKlein  4 роки тому

      Roughly, if z moderates the relationship between x and y, then the coefficient on x will shrink when you include z as a control. If z mediates the relationship between the two, then the coefficient on the interaction between x and z will be nonzero.

  • @deeg1385
    @deeg1385 5 років тому

    Thank u

  • @tomsrol
    @tomsrol 3 роки тому

    thanks nerd

  • @ellenkileyfriends
    @ellenkileyfriends 11 місяців тому

    Helpful but moves too fast for me.

  • @matto4187
    @matto4187 3 роки тому

    dear nick, i kiss you on the mouth

  • @deeg1385
    @deeg1385 5 років тому

    Thank u