Gini Index and Entropy|Gini Index and Information gain in Decision Tree|Decision tree splitting rule

Unfold Data Science

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 13 січ 2020
Gini Index and Entropy|Gini Index and Information gain in Decision Tree|Decision tree splitting rule
#GiniIndex #Entropy #DecisionTrees #UnfoldDataScience
Hi,
My name is Aman and I am a data scientist.
About this video:
How does a Decision Tree Work? A Decision Tree recursively splits training data into subsets based on the value of a single attribute. Splitting stops when every subset is pure (all elements belong to a single class)
This video explains Gini and Entropy with example.
Below questions are answered in this video:
1.What is Gini Index?
2.What is Information gain?
3.What is Entropy?
4.What is tree splitting criteria?
5.How is decision tree splitted?
About Unfold Data science: This channel is to help people understand basics of data science through simple examples in easy way. Anybody without having prior knowledge of computer programming or statistics or machine learning and artificial intelligence can get an understanding of data science at high level through this channel. The videos uploaded will not be very technical in nature and hence it can be easily grasped by viewers from different background as well.
Join Facebook group :
groups/41022...
Follow on medium : / amanrai77
Follow on quora: www.quora.com/profile/Aman-Ku...
Follow on twitter : @unfoldds
Get connected on LinkedIn : / aman-kumar-b4881440
Follow on Instagram : unfolddatascience
Watch Introduction to Data Science full playlist here : • Data Science In 15 Min...
Watch python for data science playlist here:
• Python Basics For Data...
Watch statistics and mathematics playlist here :
• Measures of Central Te...
Watch End to End Implementation of a simple machine learning model in Python here:
• How Does Machine Learn...
Have question for me? Ask me here : docs.google.com/forms/d/1ccgl...

КОМЕНТАРІ • 292

@islamicinterestofficial 3 роки тому ⁺⁷⁷
There is a mistake in your video:
You said to choose that attribute that has less information gain. But actually we have to choose that has high information gain...
@UnfoldDataScience 3 роки тому ⁺³²
Yes Naat, thanks for pointing out. I have pinned the comments related to it in the video for everyones benefit.
@islamicinterestofficial 3 роки тому ⁺⁵
@@UnfoldDataScience Pleasure sir
@nikhilgupta4859 2 роки тому ⁺¹
If you are saying that we have to choose high information gain. Then as per video we should take the impure node. For pure node gini would come 0 and hence 0 IG. Isn't something wrong.
@DK-il7ql 2 роки тому
At what time that has been said and corrected?
@RaviSingh-xx2wq 2 роки тому ⁺²
@@DK-il7ql 10:37 he said Low information gain by mistake instead of high information gain
@ahmedalqershi1245 3 роки тому ⁺²⁷
I usually don't like commenting on UA-cam videos. But for this one, I felt like I had to show appreciation because truly this video was extremely helpful. University professors spend hours explaining what you just explained in 11 minutes. And you are the winner. Perfect explanation.
Thank you so much!!!!
@UnfoldDataScience 3 роки тому
I appreciate it Ahmed. Your comments motivate me :)
@malavikadutta1011 3 роки тому ⁺¹⁵
Institutes spend two hours in explaining these two concepts and you made it clear in some minutes.excellent Explanation .
@UnfoldDataScience 3 роки тому
Thanks a lot :)
@sandipansarkar9211 3 роки тому
@@UnfoldDataScience I agree
@jehanbhathena6270 2 роки тому ⁺⁵
This has become my favourite channel for ML/Data Science topics,thank you very much for sharing your knowledge
@UnfoldDataScience 2 роки тому ⁺³
Thanks Jehan, your words are my motivation.
@zainahmed6502 3 роки тому ⁺¹
Wow! Not only was your explanation amazing but you also answered every single comment! True dedication. Keep it up!
@UnfoldDataScience 3 роки тому
Thanks a ton Zain.
@akhilgangavarapu9728 4 роки тому ⁺²
If i feel any concept is hard to understand, first thing i do is search for your videos. Very intuitive and easy to understand. Thank you so much!
@UnfoldDataScience 4 роки тому ⁺¹
Your comments are my motivation Akhil. Thanks a lot. Happy learning. Tc
@Guidussify 16 днів тому
Excellent, to the point, good examples. Great work!
@indrajithvasudevan8199 2 роки тому ⁺⁴
Best channel to learn ML and Data science concepts. Thank you sir
@UnfoldDataScience 2 роки тому
Thanks Indrajit. Kindly share video within data science groups if possible.
@__anonymous__4533 10 місяців тому ⁺¹
I have an assignment due tomorrow and this helped a lot!
@Pesions 3 роки тому ⁺²
You have a really good explanation skills, thank you man , i finally understand it
@UnfoldDataScience 3 роки тому
Most Welcome :)
@travelbearmama 4 роки тому ⁺¹⁴
With your clear explanation, I finally understand what Gini index is. Thank you so much!
@UnfoldDataScience 4 роки тому
You are welcome. happy learning. Stay Safe!!
@shyampratapsingh4878 3 роки тому ⁺¹
The simplest and best explanation so far.
@UnfoldDataScience 3 роки тому
Glad it was helpful Shyam.
@KASHOKKUMARgnitcECE 6 місяців тому
Thanks bro...explained in easy manner...
@kunaldhuria3935 3 роки тому
short simple and sweet, thank you so much
@UnfoldDataScience 3 роки тому
You're welcome Kunal.
@indronilbhattacharjee2788 3 роки тому
finally i am getting some clear explanations for various concepts
@UnfoldDataScience 3 роки тому
thanks Indra.
@alexandre52045 Місяць тому
Thanks for the video ! It was really clear and well executed. Would have been great to detail the entropy calculation though, I find it a bit elusive without an example
@joeycopperson 6 місяців тому
thanks for clear and easy explanation
@abhijitkunjiraman6899 4 роки тому
This is brilliant. Thank you so much!
@UnfoldDataScience 4 роки тому
Thanks Abhijit. Keep Watching. Stay Safe!!
@vishesh_soni 2 роки тому ⁺¹
Your first video that I came across. Subscribed!
@UnfoldDataScience 2 роки тому
Thanks Vishesh.
@valor36az 3 роки тому
I just discovered this channel what a gem
@UnfoldDataScience 3 роки тому
Thanks a lot. please share with others in various data science
groups as well.
@bhargavsai8181 3 роки тому
This is On point, thank you so much.
@UnfoldDataScience 3 роки тому
You are so welcome Bhargav.
@priyankabachhav5315 2 роки тому ⁺²
Thank you so much sir, before watching this video I have watched 4 videos related to impurity but everyone is doing mixup of entropy and impurity n it was not really clear like what exactly formula is, how does it works.. But after watching ur video.. It is tottaly cleared now. Thank you for this beautiful n clear explanation
@UnfoldDataScience Рік тому
Glad you understood
@hassangharbi3687 2 роки тому
Very goog and clear, i'm french speaking and i had understood almost everything
@UnfoldDataScience 2 роки тому ⁺¹
Thanks Hassan.
@nalisharathod6098 3 роки тому
Great Explanation !! very helpful . Thank you :)
@UnfoldDataScience 3 роки тому ⁺¹
Glad it was helpful!
@muhyidinarif9248 3 роки тому
thank you so much, this helps me a lot!!!
@UnfoldDataScience 3 роки тому
I'm so glad!
@mavaamusicmachine2241 Рік тому
Thank you for this video very helpful
@eiderdiaz7219 4 роки тому
love it, very clear explanation
@UnfoldDataScience 4 роки тому
Thanks Eider. Happy learning. Tc
@Shonashoni1 2 роки тому
Amazing explanation sir
@anandramm235 3 роки тому ⁺¹
Crystal Clear Sir!! Keep Going!!
@UnfoldDataScience 3 роки тому
Thank you Anandram.
@awanishkumar6308 3 роки тому
I appreciate your concepts for Gini and Entropy
@UnfoldDataScience 3 роки тому
Thanks Awanish.
@9495tj 2 роки тому ⁺¹
Awesome video.. Thank You so much!
@UnfoldDataScience 2 роки тому
Thank you.
@Kumarsashi-qy8xh 4 роки тому
sir Your explanation really very much helps me thank you
@UnfoldDataScience 4 роки тому
You are welcome.
@ARJUN-op2dh 3 роки тому
Simple & clear
@UnfoldDataScience 3 роки тому
Thanks a lot.
@fromthenorthfromthenorth8224 3 роки тому
Thanks for this clear and well explain Gini index.... Thanks ....
@UnfoldDataScience 3 роки тому
Glad it was helpful!
@seanpeng12 3 роки тому ⁺¹
Your explanation is awesome, thanks.
@UnfoldDataScience 3 роки тому
Thanks a lot for your valuable feedback.
@deepikanadarajan3407 3 роки тому
very clear explanation and very helpfull
@UnfoldDataScience 3 роки тому ⁺¹
Glad it was helpful Deepika.
@Sagar_Tachtode_777 3 роки тому
Thank you for your wonderful explanation.
Please make a video on PSI and KS index.
@UnfoldDataScience 3 роки тому
Will do soon Sager. Thanks for feedback.
@Kumarsashi-qy8xh 2 роки тому
U r doing great job sir
@UnfoldDataScience 2 роки тому
Thanks a lot.
@prernamalik5579 3 роки тому
It was very informative, Sir. Thank you :)
@UnfoldDataScience 3 роки тому
Most welcome Prerna.
@zuzulorentzen8653 6 місяців тому
Thanks man
@RaviSingh-xx2wq 2 роки тому ⁺¹
Amazing explanation
@UnfoldDataScience 2 роки тому
Thanks Ravi.
@response2u 2 роки тому
Thank you, sir!
@UnfoldDataScience 2 роки тому ⁺¹
Very welcome!
@ece7700 10 місяців тому
thank you so much
@johnastli9250 4 роки тому
Awesome work and very intuitive explanation! Thank you. I have an exam in Data Mining and you helped me sir!!
@UnfoldDataScience 4 роки тому ⁺¹
Glad it helped! Happy Learning!
@MrKhaledpage 4 роки тому
Thank you, well explained
@UnfoldDataScience 4 роки тому ⁺¹
Glad it was helpful!
@yyndsai Рік тому
Thank you, no one could have done better
@UnfoldDataScience Рік тому
You comments mean a lot to me
@sandipansarkar9211 3 роки тому
great explanation
@UnfoldDataScience 3 роки тому
Glad it was helpful!
@reviewsfromthe60025 2 роки тому ⁺¹
Great video
@UnfoldDataScience 2 роки тому
Thanaks a lot.
@lalitsaini3276 3 роки тому ⁺¹
Nicely explained....! Subscribed :)
@UnfoldDataScience 3 роки тому
Thanks Lalit. So nice of you :)
@kamran_desu 3 роки тому ⁺¹
Very nice explanation and icing on the cake for comparing their performance at the end.
Just to confirm, is Gini/IG only for classification?
For the regression trees we would use loss functions like sum of squared residuals?
@UnfoldDataScience 3 роки тому ⁺¹
That's a good question, since it's based on probability so it is applicable to classifiers. For regression, we see something like to minimize SSE or other error.
@mannankohli 3 роки тому ⁺¹
@@UnfoldDataScience
Hi sir, as per my knowledge "Information Gain" is used when the attributes are categorical in nature. while "Gini Index" is used when attributes are continuous in nature.
@mannankohli 3 роки тому ⁺¹
Hi sir, as per my knowledge "Information Gain" is used when the attributes are categorical in nature. while "Gini Index" is used when attributes are continuous in nature
@23ishaan 3 роки тому
Great video !
@UnfoldDataScience 3 роки тому
Thanks for the visit
@sadhnarai8757 3 роки тому ⁺¹
Great content.
@UnfoldDataScience 3 роки тому
Thank you.
@mahimano4469 2 роки тому ⁺¹
Thnaks alot
@UnfoldDataScience 2 роки тому
Welcome
@ranad2037 Рік тому
Thanks a lot!
@UnfoldDataScience Рік тому
You're welcome!
@jarrelldunson 3 роки тому ⁺¹
Thank you
@UnfoldDataScience 3 роки тому
Welcome Jarrell.
@chrisamyrotos8313 4 роки тому
Very Good!!!
@UnfoldDataScience 4 роки тому
Thank you Chris. happy learning. stay safe. tc
@adityasrivastava78 Рік тому
Good teaching
@UnfoldDataScience Рік тому
Keep watching
@yohanessatria2220 2 роки тому ⁺¹
So, the only difference between Gini and Information Gain is only the performance speed right? I assume with the same state of descision making and data, both Gini and Information Gain will be able to pick the same best attribute, right?
Great video btw!
@UnfoldDataScience 2 роки тому
That is correct. Also the internal mathematical formula is different.
@OverConfidenceGamingYT 3 роки тому ⁺¹
Thank you ❣️
@UnfoldDataScience 3 роки тому
Welcome.
@shubhangiagrawal336 3 роки тому ⁺¹
very well explained
@UnfoldDataScience 3 роки тому
Thanks for watching Subhangi.
@soheilaahmadi4807 2 роки тому
Hi Great explanation. Thank you so much. Do you have any videos explaining the criteria for Decision Tree regression?
@UnfoldDataScience 2 роки тому
Thanks a lot. for Regression, not yet, will upload soon.
@nikhildevnani9207 2 роки тому
Amazing explanation aman . I have one doubt like suppose there are 5 columns(4 independent and 1 target). For split i have used 1,2,4,3 columns and other person is using 3,2,1,4. Then on what factors we can decide either my splits are best or the other guy's split is best.
@UnfoldDataScience 2 роки тому
Its algorithm decision which columns to use.
@samhitagiriprabha6533 4 роки тому ⁺²
Awesome Explanation, very sharp! I have 2 questions:
1. Since this algorithm calculates Gini index for ALL splits in EACH column, is this process time-consuming?
2. What if the algorithm finds TWO conditions where GINI Index is 0. Then how does it decide which condition to split on?
Thank you in advance!
@UnfoldDataScience 4 роки тому
1. It is process consuming but it does not happen one by one internally for numerical columns, algorithm tries to figure out in which direction it should move smartly. For categorical columns it happens one by one and time consuming.
@UnfoldDataScience 4 роки тому ⁺¹
2.0 means homogeneous sets hence no further split will happen
@stevenadiwiguna1995 3 роки тому
Hi! i want to make sure about gini index. You said that "criteria of the split will be selected based on minimum GINI INDEX from all the possible condition". Is it "gini index" or "weighted gini index"? Thanks a lot tho! Learn a lot from this video!
@UnfoldDataScience 3 роки тому
Thanks Steven. "Gini Index".
@SivaKumar-rv1nn 3 роки тому
Thankyou sir
@UnfoldDataScience 2 роки тому
Welcome Siva.
@subhajitdutta1443 2 роки тому
Hello Aman,
Hope you are well. I have a question. Hope you can help me here.
If probability(P) =0,
Then Gini Impurity becomes = 1,
as per the formula.. Then why it always ranges from 0 to 0.5?
Thank you,
Subhajit
@melvincotoner4878 3 роки тому ⁺¹
thanks
@UnfoldDataScience 3 роки тому
Welcome.
@preranatiwary7690 4 роки тому
Good one again! Please add more technical videos as well where audience is not a layman but someone who is into data science.
@UnfoldDataScience 4 роки тому
Thanks for your feedback. I ll definitely cover advance topics as well as we move forward with subsequent topics.
@vishalrai2859 3 роки тому ⁺¹
Thank you so much sir please do some projects
@UnfoldDataScience 3 роки тому
Thanks Vishal.
@sahilmehta885 Рік тому ⁺¹
✌🏻✌🏻
@datafuturelab_ssb4433 3 роки тому ⁺¹
Great explaination
I have que
Is gini index negative
@UnfoldDataScience 3 роки тому ⁺¹
Hi, no it can not be.
@anthonyamponsah1693 4 роки тому
hello, very insightful. You almost explained the best times to use either of the criterion. Can you shed more light into that. The best kind of criterion to use for data in a model
@UnfoldDataScience 4 роки тому
Hi Anthony, it is usually not easy to say which method(gini/entropy) works on what kind of data beforehand. Usually we try to check with various options to see model performance and then choose one. Hope this clarifies. Thank you.
@anthonyamponsah1693 4 роки тому ⁺¹
@@UnfoldDataScience Yeah Thank you.
can i get your email? I'd like to stay in touch
@UnfoldDataScience 4 роки тому
Sure it's there in my UA-cam.
@karthikganesh4679 3 роки тому ⁺¹
Sir kindly explain entropy in detail just like the way you presented gini index
@UnfoldDataScience 3 роки тому
Sure Karthik. Keep watching.
@mx1327 4 роки тому ⁺¹
does the CART go through all the possible numerical values under loan to find the best condition? If you have a large amount of data, then should it be very slow?
@UnfoldDataScience 4 роки тому ⁺¹
That is a good question. Thanks for asking. In general, for a numerical variable, first split point is chosen randomly and then the point is optimized based on "in which direction" loss function is moving. Please note, loss in this case is the node purity after split.
@skvali3810 2 роки тому
i have one question aaman . at root node is the gini are Entropy is high are low..
@prasanthkumar632 4 роки тому
Aman, Can you please explain entropy also with an example like you did for Gini Index
@UnfoldDataScience 4 роки тому
Yes Prasanth, I will try to cover that topic in one of the upcoming video.
@prasanthkumar632 4 роки тому
Thank you Aman
@abhishekraturi 3 роки тому ⁺¹
Just to make clear, the Gini index ranges from 0 to 0.5 and not 0 to 1. Jump to to video at 7:10
@UnfoldDataScience 3 роки тому
Yes, this is the common comment from many users. Your are right Abhishek.
@bishwajeetsingh8834 Рік тому
Which one to choose, like how by seeing the data I can assume, what we can use gini or IG?
@UnfoldDataScience Рік тому
Cant decide in advance, its more of trial and error(there are some directions though)
@anildelegend 3 роки тому ⁺¹
Good explanation.. But correction needed. Gini oscillates between 0 and 0.5.. The worst split could half positive half negative.. Gini impurity for that wing is 0.5 also overall weighted gini would be 0.5..
It is entropy that oscillates between 0 and 1.
@UnfoldDataScience 3 роки тому ⁺¹
You are Right Anil. This feedback is coming from other viewers as well may be I mentioned this part wrong in video. I am pinning your comment to top for everyone's benefit. Thanks again.
@rosh70 2 роки тому
Can you show one numerical example using entropy? when the formula starts with a negative sign, how can the value be positive? Just curious.
@kunalshaw2440 2 роки тому
because log(x)
@abhishekgautam231 4 роки тому
Indeed the math is quite interesting. Thanks for sharing.
@UnfoldDataScience 4 роки тому
You are welcome Abhishek.
@bhagyashreemourya7071 3 роки тому ⁺¹
I'm a bit confused between Gini and Entropy. I mean is it necessary to use both methods while analyzing or we can go for any one of them?
@nikhilgupta4859 2 роки тому ⁺¹
We have to use only one of them. Which one to choose depends on data.
@UnfoldDataScience 2 роки тому
Depends on case not both to be used
@amnazakria3876 4 роки тому
sir how you choose the loan amnt as root node ?we have to find gini for all columns and then select the root node?
@UnfoldDataScience 4 роки тому
Hi Amna, this is a good question. Thanks for asking. Yes for all olumns and
select the optimal split.
@ykokadwar 2 роки тому
Can you help to explain intuitively the Entropy equation
@awanishkumar6308 3 роки тому
But if we have datasets with multiple columns like more than this example then how we will decide select which input column should be splited?
@UnfoldDataScience 3 роки тому
Answered.
@dracula5505 3 роки тому ⁺¹
Do we have to calculate both gini and entropy to figure out which is best for the dat
aset??
@UnfoldDataScience 3 роки тому
Only one at a time.
@ruqaiyajaved6590 2 роки тому
Very informative video sir. I would like to know whether to calculate gini index/entropy manually if we go for decision tree using R studio? I basically want to know what to do after getting the decision tree in R studio? should I stop there and report the decision tree as it is Or prune it. Can you please explain the concept of pruning the regression tree and classification tree in R studio using a simple example. It would be of great help😇 thank you.. Kindly revert back.
@UnfoldDataScience 2 роки тому
Hi Ruqaiya, very good questions:
1. you no need to calculate manually - tool will calculate
2. After getting tree, you model is fit, you can use it for prediction
3. You must prune your tree - otherwise it may overfit
4. I will explain pruning in separate video.
@GopiKumar-ny3xx 4 роки тому ⁺¹
Nice presentation.. Keep going....
@UnfoldDataScience 4 роки тому
Thanks a lot.
@gmcoy213 4 роки тому ⁺¹
So if i am using the C5.0 algorithm? Which Separation technique will be used?
@UnfoldDataScience 4 роки тому ⁺¹
entropy for measuring purity.
@geethanjaliravichandhran8109 3 роки тому
well sir how the root node selection criteria occurs if two data sets shares same and lowest gini index value
@UnfoldDataScience 3 роки тому
Happens very rarely, Geethanjali.
@umair.ramzan 3 роки тому ⁺¹
I think we select the split with the highest information gain when using entropy. Please correct me if I'm wrong.
@abdobourenane9294 3 роки тому ⁺¹
You are right, When an internal node is split, the split is performed in such a way so that information gain is maximized.
@UnfoldDataScience 3 роки тому ⁺¹
Thanks Abdo. Yes maximum IG is considered for split. Probably I missed to include in video.
@abdobourenane9294 3 роки тому
@@UnfoldDataScience You are welcome. i also get some new informations from your video
@abelhirpo3109 2 роки тому
it is a nice tutor Sir ! But how could it be such category comes true ? since you made greater or equal to 200 and should be inclusive to the GINI index ?
@UnfoldDataScience 2 роки тому
Yes, that mistake I accepted already 🙂
@saumyamishra9004 3 роки тому
Firstly sir , how much i know higher the information gain gooder the split.
& I wann know that is any of them is for continues variable?
@UnfoldDataScience 2 роки тому
Higher IG is better
@PrithivirajSaminathan 4 роки тому ⁺⁴
buddy, gini does not lie between 0 to 1 .. its entrophy that lies between 0 to 1
gini is always less than 0.5 so it always lies between 0 to 0.5
@UnfoldDataScience 4 роки тому ⁺¹
I think yes, Gini lies between 0 to 1. Please help me with more details if you disagree.
@siddhantpathak3162 3 роки тому
I calculated the Gini Index for (4, 2) splits, it came as 4/9. Shouldn't it come close to 1 ? Since it is the worst case scenario?
@UnfoldDataScience 3 роки тому
Need to check with data and calculate however not always mandatory that it will be close to 1.
@frosty2164 2 роки тому ⁺¹
which model has less bias and high variance-logistic, decision tree or random forest? can you please help
@UnfoldDataScience 2 роки тому ⁺¹
Decision tree high variance low bias
Logistics regression - high bias, low variance
Random forest - Tries to reduce the high variance of decision tree. Bias is low.
@frosty2164 2 роки тому
@@UnfoldDataScience Thank you very much.. can you also share the reason behind this.. or if you got any link where i can understand
@shivanshjayara6372 3 роки тому ⁺¹
sir i am confused regarding the selection criteria for root node. Some where i have studied that whose I.G.-E value is maximum, that feature will be selected as root node...and here you have said that whose I.G. is less will be selected as root node....I am confused.
@UnfoldDataScience 3 роки тому
That's a good question.
Entropy and IG are related. Understand it like this, entropy should be less and IG should be more.
IG from a split = entropy of parent node - entropy of child nodes created.
Here, decision tree will try to split in such a way that IG is maximum, in other way entropy is reduced to maximum extent. Hope it's clear now.
@shivanshjayara6372 3 роки тому
@@UnfoldDataScience thanks for this response but is it true that, Gain value is max that will be selected as root node after that splitting take place based on that root node (feature). If we get the pure split then no further splitting take place but if we get any impure split then splitting will take based on that feature whose gain is second highest among those feature. Isi tarah hai na?
@shivanshjayara6372 3 роки тому
@@UnfoldDataScience and if possible plz give me ur email id. I have few more questions. I need to give the images of some points so that u can help me out.
@subhajitdutta1443 Рік тому
How gini index ranges from 0 to 1? For best case it is 0 and for worst case it is 0.5..then how it is possible? Please explain..
@UnfoldDataScience Рік тому
The coefficient ranges from 0 (or 0%) to 1 (or 100%),
@abhinai2713 3 роки тому
@10:38 where the information gain is high ,there we try to split to node right??
@UnfoldDataScience 3 роки тому
That is a good question. The formula you see @10:38 is for entropy of a node.
Information gain for a split = Entropy of node - Entropy of child nodes after the split
Decision tree splits at the place where the information gain is highest. In other way you can say , decision tree splits where entropy is reduced to largest extent.
@tanzeelmohammed9157 Рік тому
Sir, range of Gini Index is from 0 to 1 or 0 to 0.5? i am confused
@UnfoldDataScience Рік тому
see previous comments. we have discussed it.

Наступне

Автоматичне відтворення

Entropy (for data science) Clearly Explained!!!