Hindi Machine Learning Tutorial 10 - Decision Tree

Поділитися
Вставка
  • Опубліковано 6 жов 2024
  • Decision tree algorithm is used to solve classification problem in machine learning domain. In this tutorial we will solve employee salary prediction problem using decision tree. First we will go over some theory and then do coding practice. In the end I've a very interesting exercise for you to solve.
    #MachineLearningHindi #PythonMachineLearning #MachineLearningTutorial #Python #PythonTutorial #PythonTraining #MachineLearningCource #DecisionTree
    Code: github.com/cod...
    To download csv and code for all tutorials: go to github.com/cod..., click on a green button to clone or download the entire repository and then go to relevant folder to get access to that specific file.
    Website: codebasicshub.com/
    Facebook: / codebasicshub
    Twitter: / codebasicshub

КОМЕНТАРІ • 83

  • @PankajVerma-p8y
    @PankajVerma-p8y 3 місяці тому

    my mode score is 98% .. Really amazing videos sir. Aap jo samjhate ho an maza aa jata hai and concept clear ho jata hai...

  • @poojabehera8675
    @poojabehera8675 4 роки тому +9

    sir, please make separate videos on Info gini, Gini Impurity & Information gain

  • @izainonline
    @izainonline Рік тому

    Many Thanks finally I found the clear guidance for example how and why transform the data into numeric and how to train,Inplace?,drop or add column reason.
    Thanks again

  • @muhammadjaffarrazadar967
    @muhammadjaffarrazadar967 4 роки тому +8

    waiting for you to upload Deep learning tutorials in hindi

  • @ajaykushwaha4233
    @ajaykushwaha4233 3 роки тому

    Sir aap itna accha explain kerte hain ki kya batayein. Sab kuch samajh aa jata hai.

  • @sanjayrajbanshi7698
    @sanjayrajbanshi7698 2 роки тому

    I got 80% accuracy by doing train_test_split method (Changed null value in 'Age' column and Converted 'Sex' column into dummy variable column).

  • @as-it-is6627
    @as-it-is6627 22 дні тому

    Wow, what an explanation.

  • @niku237
    @niku237 Рік тому

    Sir... ANN algorithm bhi kra dijiye hindi mei... hmare liye kaafi benefit hoga..

  • @ashishmachcha8626
    @ashishmachcha8626 4 роки тому

    Can you make more video of machine learning in python in hindi....it is very helping me.

  • @DEV-un3ok
    @DEV-un3ok 4 місяці тому

    For NaN(missing values) values I calculated Mean of Age column and replaced the NaN values in the Age column with the mean of Age column and when I split (train_test_split method)the data I got 80.44 % accuracy but when I did not split the data I got 97.41 % accuracy.
    But when I visualise my Decision tree then the one with the train_test_split method looks more accurate so which one is correct?

  • @waidapapa1514
    @waidapapa1514 11 місяців тому

    If we see the actual documentation of scitkit learn of label encoder we will know that label encoder ia used for target or output variable only.

  • @VishalMalik2023
    @VishalMalik2023 Рік тому +1

    my model predict without using train_test_split so they predict score is 97% , then i using train_test_split with test_size = 0.2 then they predict score is 82%

  • @nidhivishwakarma1339
    @nidhivishwakarma1339 Рік тому

    bro you are just amazing..

  • @rushikeshdargeai5154
    @rushikeshdargeai5154 4 роки тому +1

    97.97%
    Thank you!!!

  • @farman4164
    @farman4164 Рік тому

    Make a video on Deep learning pls..

  • @kainaatmakhani6550
    @kainaatmakhani6550 Місяць тому

    good job.

  • @Gemreviewer
    @Gemreviewer 6 місяців тому

    Hello sir, just wanted to confirm that, in the 4th step in the Jupiter notebook, instead of creating 3 objects for Label Encoder, we could have created just one object and the used it for the 3 columns. Was there any particular reason to use it 3 times? Please let me know

  • @abhasharma9455
    @abhasharma9455 3 роки тому

    Sir according to initial 1st minute how can we check data is distributed like this so that we will apply DT.
    Thank u very much Sir for great lectures.

  • @shahidmalik6107
    @shahidmalik6107 Рік тому

    very well explaind

  • @Ishowmeat8
    @Ishowmeat8 3 роки тому

    Everyday i visit this channel and i watched videos after liked it .

  • @rockfighter9974
    @rockfighter9974 Рік тому

    Really very nice video

  • @sudarshandev6369
    @sudarshandev6369 3 роки тому

    lajwaab sir awesome explanation

  • @ashkraze
    @ashkraze 2 роки тому

    i read that LabelEncoder should only be used for dependent variables only , can we use it for independent values also??

  • @debdyutimohanta9007
    @debdyutimohanta9007 2 роки тому

    There are lots of Nan values in age column

  • @NationalistNewsNetwork
    @NationalistNewsNetwork Рік тому

    How it behave on irrational input.
    pls reply

  • @varshaasabe1192
    @varshaasabe1192 Рік тому

    Hello sir, i am seen all your vedio and thanks for that , I understand but I am not understand how import csv file in R. Can me explain I am doing data science. T

  • @ashwiniawati2297
    @ashwiniawati2297 3 роки тому

    Hi Sir, you are doing really a great job. Kudos to you 🙌. I just solved the Titanic exercise with inputs Age, sex, Pclass and Fare. I got my score 0.75. Please advise about the score. Is it recommended to choose single column as an input and then check the score?

    • @owais20017
      @owais20017 2 роки тому

      please send source code

  • @abhasharma9455
    @abhasharma9455 3 роки тому

    Sir u explain great.

  • @musakhan9779
    @musakhan9779 2 роки тому

    Why is Decision Tree a regression model if it classifies. Isn't Regression model expected to have continuous data and prediction.

  • @govind1937
    @govind1937 4 роки тому

    sir firstly i did without splitting into train and test, then it gave score 0.9777
    and after splitting it gives 0.822222 taking .05 as test_size.
    and there is a strong correlation between Pclass and Fare, so we can use only one.

  • @uddiptasaikia3121
    @uddiptasaikia3121 Рік тому

    Sir , after coverting "male" and "female" to 1 & 2 respectively , it is not showing 1 and 2 while i type inputs.head()...it is showing NaN....please help.

  • @mazharbukhari786
    @mazharbukhari786 3 роки тому

    Sir, one question, from where you learnt?

  • @me_debankan4178
    @me_debankan4178 2 роки тому

    I have tried the exercise and each time i am getting more accuracy using Logistic regression than decision trees

  • @sagar8460830871
    @sagar8460830871 4 роки тому

    how to drop variable please explain correlation and p-value. i am totaly confuse

  • @neerajrokaya1132
    @neerajrokaya1132 2 роки тому

    Sir, I got 81.6% with test_size = 0.25 and random_state = 10 . I checked with your github repo and found that you have taken mean to fill the nan value in age column and got ans as 79.3%. I did the box plot and found out that there are many outliers. So, will you please explain in short about the mean and median selection.

    • @data_science_06
      @data_science_06 Рік тому

      mean is used for when data is normally distributed...
      median is used for when some outliers are existing in data.....
      mod is used .... when your data has category..... like:- 0,1,2,0,1,1 and male , female and so on....

  • @shivam7164
    @shivam7164 Рік тому

    some values in age are missing.

  • @apnadesicricket
    @apnadesicricket 4 роки тому

    Titanic program ka score aaya 0.98765 is it correct please tell first time i done the program from my own

    • @codebasicsHindi
      @codebasicsHindi  3 роки тому +1

      IT is correct

    • @apnadesicricket
      @apnadesicricket 3 роки тому

      @@codebasicsHindi thanks so much Sir, watching your videos great help for me, then i understand the ML in simpler way. Thanks again.

  • @prathameshmore5262
    @prathameshmore5262 2 роки тому

    Got 0.97 score on titanic dataset using decision tree

  • @akshatsingh6036
    @akshatsingh6036 4 роки тому

    sir tensorflow kb aaegi

  • @ashmitsingh5473
    @ashmitsingh5473 4 роки тому

    I got 1.0 is it fine..??

  • @anantsharma5330
    @anantsharma5330 4 роки тому

    I am getting 98.59% Model Score. I have Label Encoded all columns of inputs. Is it wrong?

  • @preetiranichaudhary9758
    @preetiranichaudhary9758 4 роки тому

    my score is 0.7867(input column is pclass and sex and target variable is survived) then i got score is 0.7867..is it correct or not???

  • @poojabehera8675
    @poojabehera8675 4 роки тому

    why my fit.transform() throwing error as:
    transform() missing 1 required positional argument: 'y'
    I have written corect code

    • @fooglefirst9738
      @fooglefirst9738 3 роки тому

      fit_transform()
      ye wala correct karloo aap ko aagayegaa

    • @oldisgold5803
      @oldisgold5803 2 роки тому

      mine too is throwing the same error..... Did you find way out of it?

    • @DEV-un3ok
      @DEV-un3ok 4 місяці тому

      df['column'] = column.fit_transform(df['column'])

  • @muhammadjaffarrazadar967
    @muhammadjaffarrazadar967 4 роки тому

    I tried to do it by my own and did it with 0.9797979797979798
    test scores. but I didn't bread the data into test and train. maybe that's why.

    • @codebasicsHindi
      @codebasicsHindi  4 роки тому +1

      Yes splitting data in test and train is important otherwise your model is biased and you get a high score

  • @DharmendraKumar-pf4fs
    @DharmendraKumar-pf4fs 4 роки тому

    My score is 0.977 is this correct for Titanic exercise .???

    • @codebasicsHindi
      @codebasicsHindi  4 роки тому

      Yup.97.7 is a pretty good score. Good job 👏👍

  • @mashalamjadkhan9351
    @mashalamjadkhan9351 2 роки тому

    please share the csv file

  • @DopeGaming719
    @DopeGaming719 4 роки тому

    got a score of 79.10 is it ok??

  • @soumyaprakashdasmohapatra7717
    @soumyaprakashdasmohapatra7717 4 роки тому

    got a score of 74.8% is it correct?

  • @mashalamjadkhan9351
    @mashalamjadkhan9351 2 роки тому

    Sir please share csv file argent

  • @karmakarishan56
    @karmakarishan56 2 роки тому

    result is 1

  • @shivanshsingh5555
    @shivanshsingh5555 4 роки тому

    0.82

  • @ajmeershahtricks6666
    @ajmeershahtricks6666 2 роки тому

    sir i got 1.0 score (accuracy) but im confused that what does it means ? is this the accuracy in %?

    • @mohammedmouizuddin4403
      @mohammedmouizuddin4403 2 роки тому +1

      1.0 means 100% accuracy rate
      and 0.98... , 0.95 ... means 98% , 95% accuracy rate ....

    • @ajmeershahtricks6666
      @ajmeershahtricks6666 2 роки тому

      @@mohammedmouizuddin4403 thank you bro❤

    • @owais20017
      @owais20017 2 роки тому

      please @Ajmeer Shah tricks send me sorce code

  • @subhamsaha2235
    @subhamsaha2235 3 роки тому

    Sir aapne to isme decision tree banaya hi nhi????????????????

  • @darshankachhadiya8014
    @darshankachhadiya8014 3 роки тому

    My model score is 0.81

  • @kamilazim5498
    @kamilazim5498 4 роки тому

    For Titanic Dataset
    Test_size = 0.3
    model.score = 0.8059701492537313

  • @anilkumarsharma8901
    @anilkumarsharma8901 2 роки тому

    Apney subscribe waley ko artificially intelligent ka support dilva do phir Duniya following karegi

  • @RahulVerma-ud1rr
    @RahulVerma-ud1rr 3 роки тому

    :D 97.75% accuracy

  • @gaurangsharma5428
    @gaurangsharma5428 3 роки тому

    My score is 0.997755310886644