Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)

Поділитися
Вставка
  • Опубліковано 5 сер 2023
  • Today we take a look at how we can apply feature scaling to data sets within scikit-learn in python. This is useful when applying Normalization or standardization to data which allows for machine learning models to perform better.
    Dataset is available on my Github
    Interested in discussing a Data or AI project? Feel free to reach out via email or simply complete the contact form on my website.
    📧 Email: ryannolandata@gmail.com
    🌐 Website & Blog: ryannolandata.com/
    🍿 WATCH NEXT
    Scikit-Learn and Machine Learning Playlist: • Scikit-Learn Tutorials...
    Cross Validation: • A Comprehensive Guide ...
    Train Test Split: • Train Test Split with ...
    Logistic Regression: • Hands-On Machine Learn...
    MY OTHER SOCIALS:
    👨‍💻 LinkedIn: / ryan-p-nolan
    🐦 Twitter: / ryannolan_
    ⚙️ GitHub: github.com/RyanNolanData
    🖥️ Discord: / discord
    📚 *Practice SQL & Python Interview Questions: stratascratch.com/?via=ryan
    WHO AM I?
    As a full-time data analyst/scientist at a fintech company specializing in combating fraud within underwriting and risk, I've transitioned from my background in Electrical Engineering to pursue my true passion: data. In this dynamic field, I've discovered a profound interest in leveraging data analytics to address complex challenges in the financial sector.
    This UA-cam channel serves as both a platform for sharing knowledge and a personal journey of continuous learning. With a commitment to growth, I aim to expand my skill set by publishing 2 to 3 new videos each week, delving into various aspects of data analytics/science and Artificial Intelligence. Join me on this exciting journey as we explore the endless possibilities of data together.
    *This is an affiliate program. I may receive a small portion of the final sale at no extra cost to you.
  • Наука та технологія

КОМЕНТАРІ • 17

  • @user-rw3vn5om3t
    @user-rw3vn5om3t 4 місяці тому +5

    underrated channel great video

  • @v.jananayagan3284
    @v.jananayagan3284 Місяць тому +1

    you teach very well than other channels but i don't know why pepoles are not spend time on your channel really helpfull man

  • @sandeep-kc9hs
    @sandeep-kc9hs 25 днів тому

    learned a lot from this. excellent teaching🙌

  • @qaisshefa4846
    @qaisshefa4846 22 дні тому

    Thanks so much

  • @Welcomereddy
    @Welcomereddy 11 місяців тому +1

    Excellent brother !

  • @sara-sx7gm
    @sara-sx7gm Місяць тому

    Helpful . Thank you so much

  • @onurbltc
    @onurbltc 11 місяців тому +1

    Great video!

  • @lilikoimahalo
    @lilikoimahalo 7 місяців тому

    Could you also explain how the choice of feature_range affects the output processing please? Trying to understand in which case it should be (0,5) and when it should be (0,10), and how you then interpret the output, for example? Also, I am wondering: you are applying scalers to the whole dataset, but what if you have a regression type task (predicting an actual number)? If you apply scalers to all columns then your targets also change

  • @lancerkind
    @lancerkind 4 місяці тому

    Very good video! I learned a lot. If I was to ask for more, it would be to fill in WHY normalize or standardized. You mention some about “getting your numbers in order.” Add to that there are reasons for visualization tools, comparison analysis, and whatever else. I have some ideas why, but I’m guessing as a Pandas user you have encountered many more.
    Thank you for sharing.

    • @RyanNolanData
      @RyanNolanData  4 місяці тому

      No problem and I may make a statistics course video in the future. Just waiting on my job to apply more skills

  • @rishikeshjadhav4774
    @rishikeshjadhav4774 20 днів тому +1

    can you please post the jupyter notebook containing code , it will be very healpful

    • @RyanNolanData
      @RyanNolanData  19 днів тому

      Will be on my website soon, I’m moving the code from the vids into articles