How to check assumptions of linear regression in Python | How to check linear regression assumptions

Поділитися
Вставка
  • Опубліковано 26 лип 2024
  • How to check assumptions of linear regression in Python | How to check linear regression assumptions
    #LinearRegressionAssumptions #UnfoldDataScience
    Hello ,
    My name is Aman and I am a Data Scientist.
    About this video:
    In this video, I show the python explanation of how to check assumptions of linear regression in python. I show the demo and give explanation of checking assumptions of linear regression in python. Below topics are explained in this video:
    1. How to check assumptions of linear regression in Python
    2. How to check linear regression assumptions
    3. Checking assumptions of linear regression in python
    4. Checking linear regression assumptions in python
    5. check assumptions of linear regression in python
    About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
    If you need Data Science training from scratch . Please fill this form (Please Note: Training is chargeable)
    docs.google.com/forms/d/1Acua...
    Book recommendation for Data Science:
    Category 1 - Must Read For Every Data Scientist:
    The Elements of Statistical Learning by Trevor Hastie - amzn.to/37wMo9H
    Python Data Science Handbook - amzn.to/31UCScm
    Business Statistics By Ken Black - amzn.to/2LObAA5
    Hands-On Machine Learning with Scikit Learn, Keras, and TensorFlow by Aurelien Geron - amzn.to/3gV8sO9
    Ctaegory 2 - Overall Data Science:
    The Art of Data Science By Roger D. Peng - amzn.to/2KD75aD
    Predictive Analytics By By Eric Siegel - amzn.to/3nsQftV
    Data Science for Business By Foster Provost - amzn.to/3ajN8QZ
    Category 3 - Statistics and Mathematics:
    Naked Statistics By Charles Wheelan - amzn.to/3gXLdmp
    Practical Statistics for Data Scientist By Peter Bruce - amzn.to/37wL9Y5
    Category 4 - Machine Learning:
    Introduction to machine learning by Andreas C Muller - amzn.to/3oZ3X7T
    The Hundred Page Machine Learning Book by Andriy Burkov - amzn.to/3pdqCxJ
    Category 5 - Programming:
    The Pragmatic Programmer by David Thomas - amzn.to/2WqWXVj
    Clean Code by Robert C. Martin - amzn.to/3oYOdlt
    My Studio Setup:
    My Camera : amzn.to/3mwXI9I
    My Mic : amzn.to/34phfD0
    My Tripod : amzn.to/3r4HeJA
    My Ring Light : amzn.to/3gZz00F
    Join Facebook group :
    groups/41022...
    Follow on medium : / amanrai77
    Follow on quora: www.quora.com/profile/Aman-Ku...
    Follow on twitter : @unfoldds
    Get connected on LinkedIn : / aman-kumar-b4881440
    Follow on Instagram : unfolddatascience
    Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
    Watch python for data science playlist here:
    • Python Basics For Data...
    Watch statistics and mathematics playlist here :
    • Measures of Central Te...
    Watch End to End Implementation of a simple machine learning model in Python here:
    • How Does Machine Learn...
    Learn Ensemble Model, Bagging and Boosting here:
    • Introduction to Ensemb...
    Build Career in Data Science Playlist:
    • Channel updates - Unfo...
    Artificial Neural Network and Deep Learning Playlist:
    • Intuition behind neura...
    Natural langugae Processing playlist:
    • Natural Language Proce...
    Understanding and building recommendation system:
    • Recommendation System ...
    Access all my codes here:
    drive.google.com/drive/folder...
    Have a different question for me? Ask me here : docs.google.com/forms/d/1ccgl...
    My Music: www.bensound.com/royalty-free...

КОМЕНТАРІ • 40

  • @gowthamsr3545
    @gowthamsr3545 2 роки тому +3

    There are so many videos explaining the assumptions of linear regression but no one was explaining how to do it............I was searching for this from last 3months,,, thank you sir.....
    Thank you 🙏

  • @nikhilgupta4859
    @nikhilgupta4859 2 роки тому +2

    Heyy Aman, I am your subscriber from past 1.5 year and I feel honoured to tell you, after following you I finally got a job transition as a senior data scientist at an MNC 6 month back. Now I have understood the datascience project ecosystem in my company. You are one of the contributors for my success.
    Thanks a Ton!!!!!
    Also I would like to open my hands for helping learners. So learners you can tag me asking any doubts. I would be more than happy helping you.

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому +1

      Thanks Nikhil, your comments are precious.

    • @nikhilgupta4859
      @nikhilgupta4859 2 роки тому

      @@UnfoldDataScience Thank you Aman!!

    • @UnfoldDataScience
      @UnfoldDataScience  Рік тому

      Join here for free to share your experience live with me
      www.lighthall.co/class/6fdae050-85a4-48c4-9ecc-8dd2fa6ff175

  • @spicytuna08
    @spicytuna08 Рік тому

    among all uTubers, i would rank this guy #1 for insightfulness. what a gift of teaching!!!

  • @mayankmehta8480
    @mayankmehta8480 2 роки тому +1

    Well explained. I was very useful. Pls continue uploading lectures.

  • @commonNigerian
    @commonNigerian 2 роки тому +1

    You're the best bro! I understood this explanation of yours more than I understood anybody else's. I've also saved a copy of the notebook (from Google Drive) and imported it into my DataSpell IDE so I can easily refer to it whenever I want to check assumptions - I've heard that memorising syntax is not important as long as one understands the logic :). Much love from Nigeria. Subscribed and liked. Cheers!

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому

      Thanks, Goriola. Your words mean a lot to me. Keep learning.

  • @shekharkumar1902
    @shekharkumar1902 2 роки тому

    The title of your Channel could be " unfolding the untold data science" . Aman ji you reach and teach what no one dare to teach or explain . Amazing job!

  • @developerboy8341
    @developerboy8341 2 роки тому

    Very intuitive video.

  • @asitnayak636
    @asitnayak636 2 роки тому +1

    Clean !!!

  • @kalam_indian
    @kalam_indian 2 роки тому

    you are excellent always sir

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому +1

      Thanks for your positive feedback. Please share with others as well who could be benefited from such content.

  • @sixlifeofficial4820
    @sixlifeofficial4820 Рік тому

    Great video! I wanted to ask, in order to find whether there is linearity in the first 4 scatterplots, shouldn't one plot the line of best fit? Also, regardless of whether that's true or not, how would one plot the line of best fit?
    "a, b = np.polyfit(X,Y,1)
    plt.plot(X,a*X+b)" doesn't seem to do anything and obviously im doing something wrong I'm just not sure what

  • @anmolanreja1227
    @anmolanreja1227 Рік тому

    Can you please tell me if we have large data means we have more than 30 columns in our data so linear regression will be good for training that data or not?

  • @naveenkrish4037
    @naveenkrish4037 2 роки тому

    When do we need to check these assumptions? After we do the Train Test Split and prediction, or before the start of Train Test Split?

  • @andrewhenderson593
    @andrewhenderson593 Рік тому

    How can we make a Residuals vs. Leverage plot which displays the Cook's distance?

  • @ajaykushwaha4233
    @ajaykushwaha4233 2 роки тому

    Hi Aman, if we have encoded any categorical variables to numerical variables by count or frequency, or by onehotencoding or by top categories by any mean so that columns also need to be converted to Gaussian distribution form ?

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому +2

      Good question. No, not needed because gaussian is primarily defined for continious only.

  • @beautyisinmind2163
    @beautyisinmind2163 2 роки тому

    Sir, does it mean relation between predictor variable and target variable isn't linear means it does non follow Normality condition(Non normally distributed) and vice versa?

    • @UnfoldDataScience
      @UnfoldDataScience  2 роки тому

      No, it does not mean it. Neither way. It's quite possible that a variable is not normally distributed however has a linear relationship with target variable and vice versa.

  • @sanyamsingh4907
    @sanyamsingh4907 2 роки тому

    Hi Aman can you Please advise us on "How to keep the relations with different Data Science Managers long lasting on Linked In?

  • @pradeept328
    @pradeept328 11 місяців тому

    Actually what should be normal in linear regression? The training data(target variables) or the residuals only?

    • @UnfoldDataScience
      @UnfoldDataScience  11 місяців тому

      Both ideally

    • @pradeept328
      @pradeept328 11 місяців тому

      But after we create a OLS regression model only we get the residuals and residuals normal distribution checks right?
      But we are saying normality of residuals should be confirmed before making the regression model.
      How the both cases satisfies each other?

  • @chalmerilexus2072
    @chalmerilexus2072 2 роки тому

    One question though not related to this video. At what point we do train - test split. BEFORE preprocessing like normalization, imputation etc or AFTER preprocessing?

  • @karthebans2420
    @karthebans2420 2 роки тому

    Can you please explain about errors and residuals, I am not able to get the concept clearly in websites