КОМЕНТАРІ •

  • @pankajgoikar4158
    @pankajgoikar4158 Рік тому +8

    You are amazing bro. Don't have words to thank you. you have cleared my many concepts. Lots of love from UK and god bless you. 😊

    • @HackersRealm
      @HackersRealm Рік тому +2

      Thank you so much for your kind words ❤️

  • @grandson_f_phixis9480
    @grandson_f_phixis9480 2 місяці тому +1

    Thank you very much sir!!

  • @negusuworku2375
    @negusuworku2375 5 місяців тому +1

    This is very helpful. Excellent.

  • @insight_generator
    @insight_generator 5 місяців тому +1

    This video helped me a lot. Thanks!

  • @ocraking
    @ocraking Місяць тому +1

    what an amazing video

  • @debangshubarua5345
    @debangshubarua5345 Рік тому +2

    Good vedio... Do i need check for all the numeric columns one by one and perform capping operation??????

    • @HackersRealm
      @HackersRealm Рік тому +1

      You can use a loop to do it for all numeric columns at once...

  • @sushmitarawat6438
    @sushmitarawat6438 11 місяців тому

    Too good....and simple thanks a lot☺️🙏🏼

    • @HackersRealm
      @HackersRealm 11 місяців тому +1

      Glad you like it sushmita!!!

    • @sushmitarawat6438
      @sushmitarawat6438 11 місяців тому

      @@HackersRealm could you suggest some paid internship which I can start off with the very next month

    • @HackersRealm
      @HackersRealm 11 місяців тому +1

      @@sushmitarawat6438 For ML based internship, it's better to compete in hackathons or contest to get internship.. You could checkout hackerearth, techgig, etc., for that

    • @sushmitarawat6438
      @sushmitarawat6438 11 місяців тому

      @@HackersRealm ok

  • @vietttt0104
    @vietttt0104 Рік тому +1

    Greate Tutorial!! Thanks a lot!! I have a question that How could I do it with the whole dataset? not a single one

    • @HackersRealm
      @HackersRealm Рік тому

      you can iterate the columns and process the whole data

    • @aniketlode4808
      @aniketlode4808 Рік тому

      @@HackersRealm So to iterate it we will be using for loop passing each column name as I??

    • @HackersRealm
      @HackersRealm Рік тому

      @@aniketlode4808 yeah

  • @DJnaidu22
    @DJnaidu22 3 місяці тому +1

    really a great explanation

  • @massoudkadivar8758
    @massoudkadivar8758 5 місяців тому

    Thank you so much,
    I have a question, do we need to do this process for each column one by one?

    • @HackersRealm
      @HackersRealm 5 місяців тому

      yes, that's correct, you can use loops to automate this.

  • @DJnaidu22
    @DJnaidu22 3 місяці тому +1

    Bruh I have a doubt..... please explain briefly..... These three techniques are used for trimming or capping outliers in the dataset...... But why don't we use only z-score to find outliers. Then what's the diff between these three techniques??

  • @adityachoudhari3596
    @adityachoudhari3596 2 роки тому +2

    Yo bro I m also learning ai and ml concepts I just need to work one some project or get the training in this
    Plz tell me if you can help

    • @HackersRealm
      @HackersRealm 2 роки тому +1

      check the iris dataset analysis project in the playlist for start

  • @mohamads9759
    @mohamads9759 2 місяці тому +1

    Very Great.

  • @titi-cu8dx
    @titi-cu8dx 6 місяців тому +1

    What about dealing with categorical columns in the context of outliers?

    • @HackersRealm
      @HackersRealm 6 місяців тому

      I don't think there will be outliers in categories

  • @santoryuu989
    @santoryuu989 2 роки тому

    what do you think is the best method out of these three ?

    • @HackersRealm
      @HackersRealm 2 роки тому

      You can use any method as it's producing similar results, but instead of deleting samples, trim it in the range

  • @Serene__Soul98
    @Serene__Soul98 2 роки тому

    Hii..my dataset has 19 columns and at least 10 colums shows outliers..
    So do I have to perform this process for every column each time?

    • @HackersRealm
      @HackersRealm 2 роки тому

      Yes it's better to do the process in a loop and fix it for better results

    • @avashchand9623
      @avashchand9623 2 роки тому

      @@HackersRealm Can you kindly show this process too. Searching for it everywhere can't find it.

    • @HackersRealm
      @HackersRealm 2 роки тому

      @@avashchand9623 what process you're referring?

    • @aniketlode4808
      @aniketlode4808 Рік тому

      @@HackersRealm I think he is asking for the process of looping the columns

    • @nihalkausar2215
      @nihalkausar2215 2 місяці тому

      Pls after I have handled each column outlets how do I save it and which data frame should I continue using

  • @karthika8610
    @karthika8610 Рік тому

    Which method is the most preferred?

    • @HackersRealm
      @HackersRealm Рік тому +2

      It's not about preference, it depends on where and which use case you're trying to solve

    • @madhulikasuman2803
      @madhulikasuman2803 3 місяці тому +1

      @@HackersRealm if there are 40% outlier then ?

    • @HackersRealm
      @HackersRealm 3 місяці тому

      @@madhulikasuman2803 it depends on the nature of data, need to understand the domain, and see why this is the case. We could do some data transformation like log transformation to change it

  • @ricesweat9951
    @ricesweat9951 8 місяців тому

    why you decided to use residual sugar as a column to find outliers? any tips and tricks on which columns should be used to find outliers within the dataset?

    • @HackersRealm
      @HackersRealm 8 місяців тому +1

      we can use boxplot or violinplot to find the outliers. You can see some dots outside the line which can be considered as outliers.

  • @nihsacinan19
    @nihsacinan19 10 місяців тому

    8:35 outliers=26

  • @Niyati_11
    @Niyati_11 7 місяців тому +1

    My df is empty while finding the outliers. Any idea why it is so?

    • @HackersRealm
      @HackersRealm 6 місяців тому

      which cell you faced the issue?