Xgboost Classification Indepth Maths Intuition- Machine Learning Algorithms🔥🔥🔥🔥

Krish Naik

3 400

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 5 лют 2025
XGBoost is a decision-tree-based ensemble Machine Learning algorithm that uses a gradient boosting framework. In prediction problems involving unstructured data (images, text, etc.) artificial neural networks tend to outperform all other algorithms or frameworks.
All Playlist In My channel
Complete ML Playlist : • Complete Machine Learn...
Complete NLP Playlist: • Natural Language Proce...
Docker End To End Implementation: • Docker End to End Impl...
Live stream Playlist: • Pytorch
Machine Learning Pipelines: • Docker End to End Impl...
Pytorch Playlist: • Pytorch
Feature Engineering : • Feature Engineering
Live Projects : • Live Projects
Kaggle competition : • Kaggle Competitions
Mongodb with Python : • MongoDb with Python
MySQL With Python : • MYSQL Database With Py...
Deployment Architectures: • Deployment Architectur...
Amazon sagemaker : • Amazon SageMaker
Please donate if you want to support the channel through GPay UPID,
Gpay: krishnaik06@okicici
Discord Server Link: / discord
Telegram link: t.me/joinchat/...
Please join as a member in my channel to get additional benefits like materials in Data Science, live streaming for Members and many more
/ @krishnaik06
Please do subscribe my other channel too
/ @krishnaikhindi
Connect with me here:
Twitter: / krishnaik06
Facebook: / krishnaik06
instagram: / krishnaik06
#xgboostclassifier
#xgboost

КОМЕНТАРІ • 181

@krishnaik06 4 роки тому ⁺⁸¹
We are near 250k. Please do subscribe my channel and share with all your friends. :)
@_curiosity...8731 4 роки тому ⁺¹
Krish Naik please make video on decisions tree pruning with mathematical details
@ArunKumar-sg6jf 4 роки тому ⁺¹
Lgbm is Missing
@yashkhandelwal3877 4 роки тому
@@tamildramaclips8548 Depends on your college. Which college with these branches are you talking about?
@yashkhandelwal3877 4 роки тому ⁺¹
@@tamildramaclips8548 You should definitely go with ECE. Since AI DS is a very new branch there is no surety how your college would groom the students with this branch. Also your college is not a national level college. So you shouldn't take any risk. That's all my suggestion.
@hirdhaymodi 4 роки тому
sir could you make any video for a roadmap of machine learning engineer??
@animeshsharma7332 4 роки тому ⁺⁹⁹
Man, this guy is now coming in my dreams. Who else have been binge watching his channel for months?
@gauravpatil2926 4 роки тому
😂😂
@thepresistence5935 3 роки тому
I am learnng from him for data science
@geekyprogrammer4831 3 роки тому ⁺³
Same here 😂😂😂 But this man should be given nobel prize for inspiring the present and future generations!
@gandhalijoshi9242 3 роки тому ⁺¹
I have started following his machine learning series..And it's very nice..
I am also doing data science course simultaneously . His videos are helping a lot .
@shaelanderchauhan1963 2 роки тому ⁺²
HAHAHAHA ! You are being haunted by Ghost Naik
@bhavikdudhrejiya852 3 роки тому ⁺⁷⁶
Great video. Understood in depth
I have jotted down the processing steps from this video:
1. We have a Data
2. Constructing base leaner
3. Base learner takes probability 0.5 & computing residual
4. Constructing Decision as per below
Computing Similarity Weights: ∑(Residual)^2 / ∑P(1-P) + lambda
- Computing Similarity Weight of Root Node
- Computing Similarity Weight of left side decision node & its leaf node
- Computing Similarity Weight of right side decision node & its leaf node
Computing Gain = Leaf1 Similarity W + Leaf2 Similarity W - Root Node Similarity W
- Computing Gain of Root Node & left side of decision node and its leaf node
- Computing Gain of Root Node & right side of decision node and its leaf node
- Computing Gain of other combination of features of decision node and its leaf node
- Selecting the Root Node, Decision node and leaf node have high information gain
5. Predicting the probability = Sigmoid(log(odd) of Prediction of Base Learner + learning rate(Prediction of Decision Tree))
6. Predicting residual = Previous residual - Predicted Probability
7. Running the iteration from point 2 to 6 and at the end of the iteration, The residual will be the minimal.
8. Test Prediction on the model of iteration have minimal residual
@manojsamal7248 3 роки тому
what if there are no. of classification in output (0,1,2,3) the average will be 1.5 but this is more than 1 i.e this cant be probality which 0.5 to base learner that time what we should do..?
]
@pawanthakur-df2yk 3 роки тому
Thank you🙏
@manojrangera 2 роки тому ⁺¹
@@manojsamal7248 yes bro..same question ...did you get the answer of this?..please let me know..
@manojsamal7248 2 роки тому
@@manojrangera not yet bro
@manojrangera 2 роки тому ⁺⁵
@@manojsamal7248 I was thinking if there are 4 classes then probability will be 1/4 = .25 and if there are 5 then 1/5 =.20 because we are calculating probability ..I will confirm this but I think this is right..
@johnnyfry2 4 роки тому ⁺¹³
Great work Krish. Don't ever lose your passion for teaching, you're a natural. I appreciate how you simplify the details.
@yashkhandelwal3877 4 роки тому ⁺¹³
Hats off to you Krish for doing so much hardwork so that we can learn each and every concept of ML, DataScience!
@nareshjadhav4962 4 роки тому ⁺⁷
I was desparately waiting for this since last 7 months...now I will complete mashine learning playlist💥
Than you Krish..god bless you😀
@joeljoseph26 3 роки тому ⁺⁸
Guys, please watch for the mistake. There is a mistake made at 16:10 i.e. For credit >50 (G,B) = {-0.5,0.5} its not three, there is only two. The information gain for the right side is 0.67. However, you chose the right node.
Btw, your teaching very simple and understandable. Keep doing more videos. Love your content.
@moindalvs 2 роки тому ⁺⁶
Thanks a lot, for eveyrthing you do. You did turn off the fan so that it doesn't interrupt the audio, you were sweating and breathing heavily with all this trouble and hardship you deserve more. I wish you success in life and a healthy and a prosperous life.
@sandipansarkar9211 4 роки тому ⁺¹
Very very important to crack in product based companies.Great explantion too.Thanks
@amitsahoo1989 4 роки тому ⁺¹¹
Hi krish, i have been watching ur videos for the last few months and it has helped me a lot in my interviews. A special thanks from my end. In this video, at 10:54 min 0.33 - 0.14 should be 0.19.
@gshan994 4 роки тому
yes indeed bdw were u a fresher when u went for an interview?
@mrzaidivlogs 4 роки тому ⁺⁴⁰
How do u stay so focused , strong and learn everything in a very efficient way?
@yasharya1066 Рік тому ⁺¹
Nation wants to know🙃
@HistoryUnlocked-fi3er 2 місяці тому
Willpower
@yashkhant5874 4 роки тому ⁺⁴
Great Explanation sir... keep contributing to the community. We love your videos and most importantly you are serving your experience is the best thing.
@mohitjoshi4209 4 роки тому ⁺²
So much to learn from a single video, hats off to you sir
@dhruven19 4 роки тому ⁺⁷
Just what I was waiting for 🔥
@marijatosic217 4 роки тому ⁺³
This was amazing, I literally feel like I'm sitting in your class at a Uni.
@felixzhao9070 3 роки тому ⁺²
This is pure gold! Thanks for the tutorial!
@sajidchoudhary1165 4 роки тому ⁺²
i am most happiest person to see this videos thank you
@shashwattiwari4346 3 роки тому ⁺²
"Day 1 or 1 Day your Choice" Thanks a lot Krish!
@islamicinterestofficial 3 роки тому
what does this mean?
@narendradamodardasmodi3286 4 роки тому ⁺⁸
Thanks, Krish for building the nation Towards AI Journey.
@ajayrana4296 4 роки тому
chutiya nokri bhi to de
@abhishek_maity 4 роки тому ⁺³
Great.... Clear explanation !! Thanks a lot 😄
@ShahnawazKhan-xl6ij 4 роки тому ⁺³
Great
@frozen1860 4 роки тому
Sir the way you teaching us is more better than any varsity classes. pls do a practical implementation on XGBoost. sir pls it will be very helpful for us...
@ishitachakraborty1362 4 роки тому ⁺¹³
Please do a indepth maths intuition video on catboost
@BatBallBites 4 роки тому
agree
@thisismuchbetter2194 4 роки тому ⁺¹
I don't know why people don't talk about Catboost and LightGBM much..
@stabgan 4 роки тому ⁺¹
Congratulations on your new job in E&Y. Checked you on LinkedIn. Very impressive profile.
@antonym9744 4 роки тому ⁺²
Amazing !!!
@nitinahlawat2479 4 роки тому
Really Data science Bisham Pitama🙏 Respect you a lot👍
@nukulkhadse5253 4 роки тому ⁺²
Hey Krish, you should also have a video about Similarity Based Modelling (SBM) and Multivariate State Estimation Technique (MSET). They are actually widely used in the industries since 90s. There are many research papers to validate that. They also calculate similarity weight and residuals.
@Kiddzzvideos Рік тому ⁺³
hi, have one doubt, for p(1-p) + lambda in denominator to calculate similarity weight, if the residual is -0.5 it should be 0.5(1-(-0.5))= .75? or the negative sign does not matter?
@sheikhshah2593 Рік тому
Great sir🔥🔥
@amitupadhyay6511 3 роки тому
its tough to understand in first attempt ,but thanks for giving the outline so clearly, I will watch it untill I understand I implement it from scratch .
@MrPetarap 6 місяців тому
Lovely explanation !
@Mazree152 4 місяці тому ⁺¹
16:33 In my opinion there is a mistake in calculations.
It should be computed for (>50K) but G & B are also included from
@adityagupta8901 Місяць тому
I also noticed that, i guess maybe that is a mistake
@mohamedgaal5340 Рік тому
Thank You, Krish. Well explained!
@raneshmitra8156 4 роки тому ⁺¹
Super explanation
@mihirjha1486 Рік тому
Loved It. Thank You!
@vishnukv6537 3 роки тому
Sir you are too pleasant and amazing in teaching
@Amansingh-tr1cf 4 роки тому
the most awaited video
@muhammadsaqib2961 3 роки тому
Quite amazing and clear explanation
@ppersia18 4 роки тому ⁺⁶
1st view 1st like krish sir op
@govind1706 4 роки тому ⁺²
Finally !!!!
@ayanmullick9202 2 роки тому
You are legend sir.
@pulakdas3216 6 місяців тому
It started good but I got lost as the video ended. Can you please prepare something simpler and show that? as u did for adaboost and gradboost?
@arshaachu6351 11 місяців тому ⁺¹
Is there any detailed videos about Adaboost regressor and gradient boosting classifier? Please help me
@nothing8919 4 роки тому
thank you alot sir, you are my best teacher
@jamalnuman Рік тому
great
@modhua4497 3 роки тому ⁺²
Good! Could you make a video explain the difference between XGB and Gradients Boosting? Thanks
@datakube3053 4 роки тому ⁺³
thank you so much
@saptarshisanyal4869 2 роки тому
Statquest Light !!!!
Fantastic effort though.
@mohittahilramani9956 2 роки тому
Seriously thank u so much
@bayazjafarli3867 2 роки тому ⁺¹
Hi, thank you very much for this explanation! Great video! But I have one question. In 19:39 you first wrote 0 which is the probability of first row then you added learning rate*similarity weight. My question is instead of 0 shouldn't we write 0.5 which is the average probability of first (base model). 0.5+learning rate*similarity. Please correct me if I am wrong.
@rutvikvatsa767 Рік тому
base model comes after we put the first probability (0.5) through log(odds) at bottom right corner. Hence it is 0
@alokranjanthakur5746 4 роки тому ⁺⁴
Sir can you refer some NLP projects using python. I mean with live implementation
@IamGaneshSingh 3 роки тому
This video is "pretty much important!"
@REHAN-ANSARI- 2 роки тому
XG-Boost is the secret of my energy
@brunojosebertora7935 3 роки тому
Krish, I have a question:
when you compute the output value you are catching the similarity weighted. I think it is incorrect for classification, isn't it?
To compute the output you shouldn't square the residuals.
THANKS for the video!!
@RahulKumar-hb8cl 4 роки тому ⁺²
Sir, How will the Prob value( 0.5 for the base tree ) be updated in each tree?
@jainitafulwadwa8181 3 роки тому
The similarity score is not the output value, there is a different formula for calculating the output based on residuals, you just have to remove the square in the numerator of the similarity score function.
@SRAVANAM_KEERTHANAM_SMARANAM 4 роки тому ⁺¹
Dear Krish, We have a course on machine learning. Around 40000 people subscribe to this course. But since they dont understand many of them will drop out in the middle. Why dont you start creating videos parallel to what is taught in the class and make a playlist for it. So that you can easily many views with one shot. Are u interested in this.
@saimanohar3363 3 роки тому
Grt teacher. Just a doubt, can't we take the credit as first node?
@ashwinkrishnan4285 4 роки тому ⁺¹
Hi Krish,
I have a doubt here. Here all the input features (salary, credit) are categorical. so we are making the decision tree easily based on the categories. Say suppose if we get the salary feature as continuous like 30k, 50k and not like 50k, how this split of decision tree will be done.
@shubhambavishi5982 4 роки тому
Check out decision tree algorithm video in ml playlist. Inside it, he has mentioned how to handle numerical features..
@vishaldas6346 4 роки тому
Hi Ashwin, for numerical features, you have to set a threshold for each value by taking the average of adjacent values for example for 30k - 40k you have to take (30+40)/2 i.e 35k and create a decision tree by setting value less than 35k i.e
@vishaldas6346 4 роки тому ⁺¹
Hi Krish, I have a doubt, can you please confirm if XGBOOST is a part of ensemble technique or not as while importing from the library we are doing it separately not from sklearn library.
@krishnaik06 4 роки тому ⁺²
It is a seperate library
@vishaldas6346 4 роки тому ⁺¹
@@krishnaik06 but is it an ensemble technique?
@gshan994 4 роки тому ⁺¹
@@vishaldas6346 what is XGBoost and where does it fit in the world of ML? Gradient Boosting Machines fit into a category of ML called Ensemble Learning, which is a branch of ML methods that train and predict with many models at once to produce a single superior output.
@satwikram2479 4 роки тому ⁺²
Finally❤
@belxismarquez4447 2 роки тому ⁺¹
Please subtitle the videos in Spanish. There is a community that speaks Spanish and listens to your videos
@edwinokwaro9944 Рік тому
is the formula for similarity score of the root node correct? since this is a classification problem?
@nandangupta727 4 роки тому
Thank you so much for such a step to step explanation. but I have a quick question what would we do if we have continuous variable than categorical. would we proceed as we do in decision tree for continuous features? or it's not recommended to use XGBoost in case of continuous features?
@thepresistence5935 3 роки тому
i think we use all the models and will take the result by comparing those, I think It will be better for that.
@subratakar4392 2 роки тому
for continous data, like salary , first it will sort that particular column in ascending, then for each consucutive value will create an avg.Now each avg will be taken as a spliting condition. The one where the gain is the highest will be considered for the split . Like suppose you have 5 salaries 10,20,30,40,50. first splt would be on salary
@gardeninglessons3949 4 роки тому
sir please make a video on differences in all the boosting techniques , they are elaborate and couldn't find out the exact differences
@accentureprep1092 2 роки тому
Hi @krish
First of all kudos to you Great video
Can you tell me how xgboost is different from Aprori alogrithm or does it cover every combination as in Aprori cover ( ie it's covers all the combination while creating tree as Aprori will cover for same problem statement)
Thanks and love your work
Keep rocking
@adityarajora7219 2 роки тому ⁺¹
How is Pr gonna change please explain!!!!
@davidd2702 3 роки тому
Thank you for your fabulous video! I enjoy it and understand well!
Could you tell me if the output from the xgb classifier gives 'confidence' in a specific output (allowing you to assign a class) ? or is this functionally equivalent to statistical probability of an event occuring?
@ajiths1689 4 роки тому
what should be the new probability value we need to consider when we are considering the second decision tree?
@ArunKumar-sg6jf 4 роки тому ⁺¹
How u determine value of pr in base model
@RishikeshGangaDarshan 4 роки тому
When I training data first calculate residual and create dt but here we are not able to see how it classified the point and in this it say when new data point is come I am confused in this
@durjoybhattacharya250 Рік тому
How do you decide on the Learning Rate parameter?
@dulangikanchana8237 3 роки тому
can you do a video difference between statistical models and machine learning models
@tarabalam9962 Рік тому
Please upload a video on Light GBM.
@ajayrana4296 4 роки тому
what is similarity weight why we use it what is its advantage what is the intution behind it
@deepsarkar2003 4 роки тому ⁺²
Can anyone explain to me the video during 21:38 Mins ( 0-0.6)=-0.6 right not 0.4 right? or did I get it wrong Please Advise
@sudiptodas6272 4 роки тому
I got the same question .
@Jaydonj 2 місяці тому
yeaaa me toooooooooooo....helpppwwwww meeee!! arghhh
@seniorprog9144 4 роки тому
Sir . krish Do you have a code that deal with more than one target ( y1,y2,.. Y is 2 columns or 3 columns . (two target , three target )
@ManoharKumar-cw3ed 4 роки тому
Thank you sir! I have a question in this how we predict the probability value at the begging from 0-1
@hemantsharma7986 4 роки тому ⁺¹
isnt gradient boosting and xgboost same with miner difference?
@KOTESWARARAOMAKKENAPHD Рік тому
is any other value except 0 as a hyperparameter in XGboost algorithm
@Acumentutorial 2 роки тому
Wht is the role of lambda in the similarity weight here.
@sachinjaisar5776 2 роки тому
Shouldn't your similarity weight be 1? Residuals must be squared first before adding up.
@VinodRS01 4 роки тому ⁺²
Sir how does the model chooses which similarity weight should be multiplied with learning rate . Thank you sir u r doing great by helping us🙂
@vishaldas6346 4 роки тому
its not the similarity weight which is multiplied, its the Output of the leaf node. Similiraity weight is used to calculate the Gain for splitting the nodes of the decision tree.
@datakube3053 4 роки тому ⁺⁴
250k coming soon
@KOTESWARARAOMAKKENAPHD Рік тому
what is the need of LOG(odd) function
@mainakray6452 3 роки тому
the max_depth in xgboost for each tree is 2? plz answer ,
@titangamezone4379 4 роки тому
sir please make a video on gradient boosting for classification problem
@Shakhaiiuccse Рік тому
You didn't add the lamda. Why?
@stabgan 4 роки тому
23:00 that's lambda not alpha, please correct that
@sohinimitra7559 4 роки тому ⁺³
Can you please do a video on feature selection approaches? Especially the use of Mutual Information. Thanks. Great videos!!
@amitshende5161 4 роки тому
It's lambda as hyper parameter, which u mentioned as alpha...
@swethanandyala 2 роки тому
Hi sir @Krish Naik. What will be the initial probability when there are multiple classes....if anyone knows the answer please share...
@adireddy694 4 роки тому
How you have calculated the probability ?? How you have got 0.5 ??
@naveenvinayak1088 4 роки тому
Krish How do u stay so focused
@SwethaSubramanian-l7v 3 місяці тому
can someone please clear the log of odds part? similarity wt=1 means that's the output but to compute that we calculate the base model output with respect to 0.5 probability, why?
@subhodipgiri2924 4 роки тому
how can we subtract probability of a value from that value. if suppose i take approvals in terms of Y and N then also their probability remains same at 0.5. but we cannot subtract 0.5 from Y or N. I did not get your concept of subtracting the probability from value.
@KOTESWARARAOMAKKENAPHD Рік тому
why we split G,N into one but not separately
@mohana4179 2 роки тому
Please put lgbm mathematical explanation sir
@pratikbhansali4086 4 роки тому
U didn't upload gradient boosting classification videos i. e part 3 and part 4 of gradient boosting
@mayurpardeshi395 3 роки тому
how krish calculating gain ??
@chiranjivikumar3690 3 роки тому
What's is the use ?

Наступне

Автоматичне відтворення

Xgboost Regression In-Depth Intuition Explained- Machine Learning Algorithms 🔥🔥🔥🔥