5 Probability Distributions you should know as a Data Scientist

Поділитися
Вставка
  • Опубліковано 24 лип 2024
  • Here are a few probability distributions you should know as a data scientist
    Follow me on M E D I U M: towardsdatascience.com/likeli...
    JOIN US ON DISCORD: / discord
    SPONSOR
    Kite is a free AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I've been using Kite. Love it!
    Learn more: www.kite.com/get-kite/?...
    REFERENCES
    [1] Where do we see normal distributions? stat.ethz.ch/~stahel/lognorma...
    [2] Useful probability Distributions with applications: towardsdatascience.com/6-usef...
    [3] Assumption of Normality explained very well here: www2.psychology.uiowa.edu/facu...
    [4] Monte Carlo Simulations in practice: • Running Simulations as...

КОМЕНТАРІ • 46

  • @rishisharma8311
    @rishisharma8311 2 роки тому +11

    The real life examples for each of the distribution were amazing !!

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Glad you liked them. Many thanks :)

  • @hkumar7340
    @hkumar7340 2 роки тому +12

    1:00 Normal Distribution
    5:37 Log-normal Distribution
    7:30 Uniform Distribution
    8:48 Beta Distribution
    10:33 Chi-squared Distribution.

  • @mohammadrezaghiasy6618
    @mohammadrezaghiasy6618 2 роки тому

    Hey buddy. Awesome as always. THANK YOU 💓

  • @bipinkapri9986
    @bipinkapri9986 2 роки тому +1

    That was really helpful! Amazing content!

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Many thanks and very glad you enjoyed it :)

  • @harry8175ritchie
    @harry8175ritchie Рік тому

    Counts are often distributed under a Poisson. The domain is very important to mention when selecting a distribution, and the discrepancy between probability mass functions and density functions. Keep it up man, love your stuff.

  • @McMurchie
    @McMurchie 2 роки тому +5

    Ahhh, there are like a million Data/ML channels but this is still the only one I subscribe to (after being burned a bit by Siraj). Love this guys ability to articulate complex phenomena in a way that makes sense.

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Thanks for being a part of the community 🙂

    • @McMurchie
      @McMurchie 2 роки тому

      @@CodeEmporium pleasure!

  • @ronin2963
    @ronin2963 2 роки тому

    Nice summary of five different topics that could be their own lessons

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Thank you. Will def dive into these topics in thier videos in some consumable form. I just need to think of the best way to deliver this content

  • @hamzadata
    @hamzadata 6 місяців тому

    Man you are awesome!

  • @gokulkurup1584
    @gokulkurup1584 2 роки тому

    Really good content

  • @yensteel
    @yensteel 2 роки тому +2

    Is there a way to create a custom probability distribution from a sample dataset? It can then generate new data with similar characteristics while remaining completely continuous?

    • @CodeEmporium
      @CodeEmporium  2 роки тому +2

      Yes that is possible. In python, scipy has distributions where you call a "fit" function and pass in sample data. For example, check out scipy.beta.fit.

    • @yensteel
      @yensteel 2 роки тому

      @@CodeEmporium Thank you so much for the reply!

    • @harry8175ritchie
      @harry8175ritchie Рік тому +1

      @@CodeEmporium Man, I feel like I'm really annoying here. I'm sorry! Be careful with this. Understand your data first: if there's any domain expertise you can throw into this, the data may be enforced to be a certain distribution, despite it not looking like it yet.
      For example: counting the number of times you see cars drive past your house within one hour blocks. Maybe you collect a handful of data. You notice a small tail at 2-5 cars, a peak at 7 cars, and a tail at 10 cars. You might think this is normal, but from the definition of the experiment, this is indeed a Poisson distribution: counting within set intervals.

  • @erickballesteros4531
    @erickballesteros4531 2 роки тому

    good vid :)

  • @scott7948
    @scott7948 2 роки тому

    You missed tweedie distribution which is used in insurance modelling

  • @doristhebrowndog
    @doristhebrowndog Рік тому

    how are y’all so smart… i left everything i learned about statistics back at where it started, at Uni :(

    • @CodeEmporium
      @CodeEmporium  Рік тому +2

      Honestly I did the same. But the more you work with this stuff on applications, the better you’ll remember it. :)

  • @timz2917
    @timz2917 5 місяців тому

    The sample means can still be normal even if the samples arent

  • @SiyaMedia
    @SiyaMedia 2 роки тому

    poison ooops we need to talk about the poisson distribution as well

  • @monkyebrain
    @monkyebrain 2 роки тому +1

    Weibull gang stand up!

  • @k.alipardhan6957
    @k.alipardhan6957 2 роки тому

    start at 1:00

    • @k.alipardhan6957
      @k.alipardhan6957 2 роки тому

      i think 4 & 5 needed much more details, as much as we got for 1. but good video, thank you

  • @lucio8794
    @lucio8794 2 роки тому

    My man, I love your videos, but the audio is often out of sync, just a heads up

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Yep. Thanks for the heads up. I'm trying to get better with this for future videos :)

  • @dragonman101
    @dragonman101 2 роки тому

    does anyone else see a lag between audio and video?

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      Sorry about that. It happens a couple of times through the video. Will try to correct for future videos

    • @dragonman101
      @dragonman101 2 роки тому

      @@CodeEmporium no worries! :) I just couldn't tell if the issue was my computer or the video itself hahaha

  • @shaelanderchauhan1963
    @shaelanderchauhan1963 2 роки тому

    Data is is just a game of giving 100 different fancy names for the same concept to make it Extremely confusing for learners

  • @larrybird3729
    @larrybird3729 2 роки тому

    no gamma :(

  • @tusharbedse9523
    @tusharbedse9523 2 роки тому

    R u lipsing bro

    • @CodeEmporium
      @CodeEmporium  2 роки тому +1

      Nah. It's your imagination

    • @tusharbedse9523
      @tusharbedse9523 2 роки тому

      @@CodeEmporium thanks for replying.... Was watching sm of ur videos ...awseome stuff...thanks!!

  • @ssshukla26
    @ssshukla26 2 роки тому

    One those videos where it's implicitly assumes that you know stats before hand and explicitly follow that assumption throughout the video...

    • @CodeEmporium
      @CodeEmporium  2 роки тому

      I think only the normal distribution is technical here. The other 4 are a lot easier to pick up. Looking back, maybe could have easier explained the normal distribution. But I'll keep this mind for other videos

  • @kushagrachaturvedi2144
    @kushagrachaturvedi2144 Рік тому

    when i hear u first time its very weird u r voice does not match u. means don't know why its feels like that u r lisping and someone else is talking