Feature Scaling | Standardization Vs Normalization | Data Preprocessing | Python | Data Magic

Поділитися
Вставка
  • Опубліковано 14 жов 2024

КОМЕНТАРІ • 26

  • @Iranvlog_
    @Iranvlog_ 3 роки тому +2

    hi thank you. Totally you mean we can use standaridzation for any type of data distribution for example non normal distribution like uniform. and it is recommended to use normalization just for normally distributed data sets? Does the normalization or standasdization change the distribution of a data?

    • @DataMagicAI
      @DataMagicAI  3 роки тому +2

      -Yes, you can use standardization for any type of data distribution
      -Normalization or Standardization never changes the distribution of your data.
      Hope I answered your questions.

    • @Iranvlog_
      @Iranvlog_ 3 роки тому

      @@DataMagicAI Thank you.

  • @alndr4u
    @alndr4u 3 роки тому +2

    Thank you so much.... How to convert standardized values between 0 to 100 ? Plz help me 🙏🏻

    • @DataMagicAI
      @DataMagicAI  3 роки тому +1

      I am not sure what's your purpose behind it. But you can use simple maths, multiply each scaled value bet ween 0 to 1 with 100.

    • @alndr4u
      @alndr4u 3 роки тому +1

      @@DataMagicAI z score values are between -3 to +3 range, then that multiply with 100, then values are remain -ve and +ve, but i want only 0 to 100 range

    • @alndr4u
      @alndr4u 3 роки тому

      @@DataMagicAI thank you for rply

  • @MohdFadzilHassanIsmail
    @MohdFadzilHassanIsmail Рік тому

    Hi. Can I perform this feature scaling before spliting training and testing dataset? Any effects regarding data leakage?

    • @DataMagicAI
      @DataMagicAI  Рік тому +1

      Perform feature scaling after dataset splitting to avoid data leakage and ensure realistic model evaluation.

  • @fernandoborges3044
    @fernandoborges3044 2 роки тому

    Thank you, very helpful to me, I liked it a lot!

  • @rmb1106
    @rmb1106 3 роки тому +1

    Hi I want to use normalisation for 4 numbers, to make it between 0&1 , but Xmin should be 0 and Xmax should be the max number from the 4 numbers we are taking.
    Can you tell me the code for that

    • @DataMagicAI
      @DataMagicAI  3 роки тому

      You can solve with simple math equation. ((Xmax - Xmin)/100)*(Xvalue-Xmin) = scaled Xvalue

    • @rmb1106
      @rmb1106 3 роки тому

      @@DataMagicAI I meant is Xmin =0( for all set of values) and our formular of normalisation (x - Xmin)/(Xmax-Xmin) becomes x/Xmax , so is it right to use this x/Xmax and call it normalisation

  • @sudippandit1
    @sudippandit1 4 роки тому +2

    Awesome videos

  • @davideje8841
    @davideje8841 3 роки тому

    Why do we still need to fit our test data, will it be improper to just transform both the train and test data after fitting only the train data?

    • @DataMagicAI
      @DataMagicAI  3 роки тому +1

      If you are training your model with scaled dataset and try to validate with test dataset which not scaled then you will get the wrong results. You always scale your test dataset along with train dataset so you will get the correct results for prediction/validation.

    • @davideje8841
      @davideje8841 3 роки тому +1

      @@DataMagicAI Alright, thanks

  • @ali.mazeed
    @ali.mazeed 3 роки тому

    thank you but where is method for Normalization to make the value between 0 to 1
    where is the code and how make it

  • @zainularifin-ll4th
    @zainularifin-ll4th 3 роки тому

    Umm sir where's the next step where after scaling we use some regression model?

    • @DataMagicAI
      @DataMagicAI  3 роки тому

      You may checkout the ML Algorithms playlist on Data Magic UA-cam channel for the same.

  • @AshokKumar-qh5vc
    @AshokKumar-qh5vc Рік тому

    sir you didn't told about normalisation

    • @DataMagicAI
      @DataMagicAI  Рік тому

      I will add new short episode on it