Link functions for GLMs... MADE EASY!!!

Поділитися
Вставка
  • Опубліковано 9 січ 2025

КОМЕНТАРІ • 16

  • @Arct22
    @Arct22 2 місяці тому +2

    Man this is pure gold. No BS, just esence. Subscribed!

  • @sabinaharding1990
    @sabinaharding1990 6 місяців тому +2

    This is a great explanation. I love the visuals showing how they are all related. Thank you.

  • @Akhil.Velati
    @Akhil.Velati 5 місяців тому +2

    Can u create a whole playlist for the GLM's. Please do consider doing this

  • @Indioharp
    @Indioharp Місяць тому

    Great explanation Brian! I have a small question, though. If the response variable has to be normal (in a normal linear regression), why do you think most statistics articles insist that only the residuals have to be normal and not the variable? What tests do you think should be done before a GLM, besides residual plots?

    • @statswithbrian
      @statswithbrian  Місяць тому +1

      Saying the response is normal and the residuals are normal means the same thing basically. The response is normal (around the mean for that X value), which just means the response’s distance from the mean (residual) is normal with mean 0. If we want to evaluate normality of residuals, it’s then easier to look at a graph of residual since they all have the same mean so we can easily visualize if they seem normally distributed.

    • @Indioharp
      @Indioharp Місяць тому

      @@statswithbrian Thank you.

  • @brazilfootball
    @brazilfootball 3 місяці тому

    Great work, quick question! Why is it ok to use a normal distribution for response variables like weight if weight can't be negative, or zero? I see it a lot, but don't understand why it's so common.

    • @statswithbrian
      @statswithbrian  3 місяці тому +1

      There's pretty much nothing that *really* follows a normal distribution - it's all approximations. Take height for example - and suppose the height follows an approximately normal distribution with mean = 64 inches and sd = 4 inches. Even though a normal distribution has some probability of being less than 0 (which is impossible), because that is 16 standard deviations away from the mean, the probability is basically 0 anyways (less than 1 in a billion billion billion billion billion billion). So yes, you're totally right that it's impossible, but assuming it's normal makes things easy and the probability calculations are often pretty accurate!

    • @brazilfootball
      @brazilfootball 3 місяці тому

      @@statswithbrian Works for me, thank you!

  • @DrewAlexandros
    @DrewAlexandros 3 місяці тому

    In your final slide, you say that the link function maps from the original scale to "the parameter of the relevant probability distribution". You also say the parameter is personalised....
    Is your final slide saying that in general, the link function maps to the parameter of the data's distribution? e.g. "p" in Bernoulli, "sigma" in Rayleigh?
    Apologies if i haven't understood this correctly.

    • @statswithbrian
      @statswithbrian  3 місяці тому

      Yes, the link function is just transforming a real number with no restrictions (negative infinity to infinity) to something with the correct possibilities for the parameter of interest.
      In logistic regression, if we were predicting the probability of having diabetes based on weight, you and me would each get a personalized parameter p based on our weight. The heavier person might have p = 0.7, reflecting the fact that their weight makes it more likely that they may have diabetes. The lighter person might have p=0.3. But they will both be between 0 and 1 no matter eat because the link function transformed the scale to ensure that it’s between 0 and 1, which regular linear regression did not do.

  • @gabrielplzdks3891
    @gabrielplzdks3891 8 місяців тому

    But you missed the best part, how we can engineer any combination we want to fit our data. We can model different types of trends, heteroscedasticity and of course, sample from either pdf or pmf. They are incredibly flexible.
    By the way, ultimately what's the scope of this channel? Can we eventually expect videos on things like measure theoretic probability, stochastic processes and the like?

    • @statswithbrian
      @statswithbrian  8 місяців тому +1

      There might be one video on measure theory sometime, but no, I plan to stick more on the statistics and data science end. Any more probability videos would probably be similar to the Markov/Chebyshev's inequality videos.

  • @santiagodm3483
    @santiagodm3483 8 місяців тому +1

    Finally it came!!!

  • @qkdnrnskfirnsvabk
    @qkdnrnskfirnsvabk 7 місяців тому

    Thanks!