Handling categorical data

Поділитися
Вставка
  • Опубліковано 22 бер 2020

КОМЕНТАРІ • 18

  • @user-gc5xl4vp5r
    @user-gc5xl4vp5r 3 дні тому +2

    super sir ,thank you soo much

  • @bhulekhyag4189
    @bhulekhyag4189 Рік тому +1

    i have been working on it for long time , you made it simple .. thank you

  • @sasikala_chowdary
    @sasikala_chowdary Рік тому +1

    Time saver ,thank you so much🙏

  • @aartisharma4793
    @aartisharma4793 2 роки тому +2

    Thank you so much.
    These concepts were not that easy for me as you made these now.Any average learner can understand and implement these concept.

    • @sukamal_das
      @sukamal_das  2 роки тому

      Glad that I could help you 🙂👍

  • @gpoojitha2614
    @gpoojitha2614 4 місяці тому

    Can you please give a piece of code from which we can get the number of categorical columns and numeric columns in the whole dataset

  • @abdulwahabchudhary6269
    @abdulwahabchudhary6269 6 місяців тому

    Excellent work! I have been working on it for two days, but I did not grasp the main concept. However, after watching the video, I now understand the whole concept. Is my use of 'dose' correct? Also, please check the entire sentence

  • @JyotirmoyeeRoy
    @JyotirmoyeeRoy 2 місяці тому

    I'm getting true or false instead of 0 and 1 after applying the dummies. Why is that?

  • @mkeremyucedag
    @mkeremyucedag 3 роки тому

    Hey! Great video as always. I have a question for you. In the end you're doing fit_transform with for loop. How can I do it with map, list ? When I do list(map(le.fit_transform(df_cat),df_cat)) it gives this error :
    y should be a 1d array, got an array of shape (513, 2) instead.
    How would you do map,list as an alternative to for loop ?

    • @sukamal_das
      @sukamal_das  3 роки тому +1

      You can also use this technique -
      df_cat = df_cat.apply(lambda x : LabelEncoder().fit_transform(x))

    • @mkeremyucedag
      @mkeremyucedag 3 роки тому +1

      @@sukamal_das Oh, thanks! That works and surely is an easy way to work things. Wish you good luck, thanks again !

  • @shahfahad3682
    @shahfahad3682 2 роки тому

    Great explanation!
    I have a question though,
    When we apply label encoder and the categorical column has more than 3 unique values it assigns the value as 1,2,3,4 etc. Are there any chances that our model prioritizes the category which has a higher number compared to others?

    • @sukamal_das
      @sukamal_das  2 роки тому

      Yes you are right. To avoid this problem we can go for One Hot Encoding technique.

    • @shahfahad3682
      @shahfahad3682 2 роки тому +1

      @@sukamal_das but what if we have 100 categories? Then it would create 99 extra columns right?
      How do we handle this?

  • @SusmitaMelodies
    @SusmitaMelodies Рік тому

    Thankyou❤

  • @SusmitaMelodies
    @SusmitaMelodies Рік тому

    Im having error even after converting categorical values df value still shows object type