Different Types of Feature Engineering Encoding Techniques

Krish Naik

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 19 гру 2024

КОМЕНТАРІ • 689

@krishnaik06 4 роки тому ⁺⁸⁷
Dear All, if you are looking for feature engineering materials, please check my feature engineering playlist, all videos are available. Happy Learning!
@yash20december 4 роки тому ⁺⁵
if you don't mind will u reopen the link or provide your writen codes on github with link
@krishnaik06 4 роки тому ⁺¹⁷
@@yash20december all materials are available in feature engineering playlist
@ulysses_grant 4 роки тому
Thank you sir.
@mamtarajput9846 4 роки тому ⁺²
is there something more you provide for the paid ones. please let me know.
@matrix4776 4 роки тому
Sir, Can you please send me the all feature engineering technique file. it will be very helpful to me, if you send them. My email id is
ara007kumar@gmail.com
@prateshtamhankar3568 4 роки тому ⁺¹⁹
What a coincedence, today is also an Independence day, this really suprised me, I was following your youtube videos and suddenly you greeted, for a movement it got a smile on my face. Happy Independence day.
@josealjndro 4 роки тому ⁺⁴
you are the best, greetings from an ecuadorian studying in Portugal.
@viveksrivastavasc 5 років тому ⁺⁸
There was doubt from so long about this that when there are more than 100 types of value then how to do encoding which is clear today thank you sir 🙏🙏
@jeevanraj1789 2 роки тому
Hi
@saimanikanta1365 Місяць тому
bro can you send me the material
@azharafridi9619 Рік тому
thank you so much respected sir. Alot love for you from pakistan. this video was very helpfull. we are looking foreword to see others playlist like these from you. once again thanks
@anirvansen6591 5 років тому ⁺¹
Just started watching your videos. You explain the concepts in a simple manner.Thanks
@pankajgoikar4158 2 роки тому
No Words for education. Many Thanks and wishes for futures.
@cenobit0815 4 роки тому
you saved my day with mean encoding
@sandhyas6972 4 роки тому
Hi Krish, It's the best video I have ever seen. Crystal clear.
@manishsahu3181 5 років тому ⁺⁴
Sir, please share the link once again. I saw your video and it's a very helpful for the student's like me. I want to know more about the feature engineering.
Thank you for making such an amazing lecture. Waiting for the feature engineering link.
@jeevanraj1789 2 роки тому
Hi
@akshaygupta6321 5 років тому ⁺³
Krish your way of explanation is just amazing....Thanks for these amazing videos and yes please share zip file
@jeevanraj1789 2 роки тому
Hi
@jeevanraj1789 2 роки тому
Hi
@samriddhlakhmani284 3 місяці тому
Still the best video out there. I think other content dont know what a practitioner of DS needs at 2:30 am .... :p
@sonalisrivastava9981 3 роки тому
We need mentor like you... Great job👍
@pravallikak7637 3 роки тому
Krish Sir the way you explain is easy to understand. Please reopen the form. Thanks 🙂
@sam71839 4 роки тому
Wao thank you soo much, sir you explained soo well. whenever I face any doubts your video saves my day.. God bless u .. Happy Learning
@gajjukumar8 5 років тому
Hi krish, nice way to collect the data free of cost.
@sanyamsinghal7992 5 років тому
thanks a lot, this thing can't be explained better than how you explained it.
I just became Fan of your ML knowledge.
@jeevanraj1789 2 роки тому
Hi
@bharathjc4700 5 років тому ⁺⁵⁰
Please re-open the form for feature engineering techniques. Thank you.
@streamingtamilan8421 4 місяці тому
Yes sir please re-open the form
Excited to learn the coding part too Sir.
@AkshayPatel-eq7uy 5 років тому ⁺¹
Thank you for putting the time and efforts to create this video, also all other videos. Very helpful.!
@deepeshkumarsharma6514 5 років тому ⁺¹
thanks sir for listening to my request to create a video on mean encoding , i am really enjoying your videos , and i have learned a lot from that. Please continue to create such awesome videos.
@jeevanraj1789 2 роки тому
Hi
@harshitgupta2515 5 років тому ⁺¹
Sir u r doing really great and I think under your guidance I will become a good data scientist soon...please help me sir
@floyd7835 3 роки тому ⁺¹
you are doing a wonderful job Kris...👏👏
@mahindrarao4565 3 роки тому
I liked the Mean Encoding technique and Target-guided encoding. We are preserving the normality of the data as well as not increasing the dimensions.
@shubhammishra8686 5 років тому ⁺³
Thanks Krish Bhai..I have learned a lot from your videos
@baneledludlu7983 Рік тому
Thanks man! Great content The Lord bless you with more understanding and help you to know Him better and better
@anirudhr.huilgol.9449 4 роки тому
Very useful information provided by u sir. Thank you.
@wordguinho 2 роки тому
Thank you so much for sharing your knowledge with us
@manojrana009 4 роки тому
Hi Krish, it is just a suggestion if u start same channel in Hindi language. It will more helpful to those Indian students who are living in small cities and not much familiar with English lecture. Hope u understand my request. I'm your regular viewer and respect ur effort and knowledge. God luck.
@vishalshukla2happy 5 років тому ⁺¹
Great help Krish... Thanks for your video man
@dollyshukla4821 5 років тому
Vishal Shukla. could you please share this docs with me on dolly.shukla7860@gmail.com
@jeevanraj1789 2 роки тому
Hi
@hokapokas 5 років тому ⁺²
Hey krish, nice video as usual... Filled the form and thanks for making motivational and additional support videos for encouragement. Kudos
@SanthoshKumar-dk8vs 5 років тому
Hi bro, could you please send me featuring document pls?
@hokapokas 5 років тому
@@SanthoshKumar-dk8vs you can fork it from either mine or krish's GitHub account. Check Krish's video description for his GitHub link and you find all there
@shivamkaushal4479 4 роки тому
Hello bro, can you share zip file, bcz I watched it today so not able to fill form as you know.
Kaushalshivam2018@gmail.com
@sarath20994 4 роки тому
hi bro this is sarath..
I am a data scientist aspirant can you share me feature engineering notes..
mail id : sarath20994@gmail.com
@mushirahunt 4 роки тому ⁺¹
Hello sir...your way of teaching is really incredible.
I am studying through your lecture for past 1week and that's why unable to fill the form to get the materials which you have prepared for the same...
So if possible please enable the form link again...
@gordongogah5376 4 роки тому ⁺²⁹
I came across this video today and i like to learn more feature engineering
"if you don't mind will u reopen the link sir"
@nemesisanims7401 4 роки тому ⁺¹
Yes pleaseeee
@karthikeyans1646 4 роки тому ⁺¹
Yes please sir reopen
@siddhantpathak6289 4 роки тому ⁺¹
Yes, it's very much needed now
@sarveshkhetan4241 4 роки тому
yes please sir
@priyabratapanda1216 4 роки тому
It's on the GitHub
@zeeshankhanyousafzai5229 3 роки тому
Sir, I am working in Data Science for a long time but want to your all playlist as I already have covered some of them. I need your notes on Feature Engineering so can you provide me it now. I shall be very thankful to you for this kindness.
Best wishes more love for you from Pakistan.
@jeevanraj1789 2 роки тому
Hi
@prakashsaravanan6613 3 роки тому
Excellent Explanation Sir, Thanks a lot
@salikmalik7631 3 роки тому
You are the best sir.
@anithjoseph8730 5 років тому ⁺¹⁹
Hi sir,
I want the feature engineering doc. Can you please open the link for the form?
Waiting for your response
@MM-vx8go 4 роки тому
Hello I want the feature engineering document👏👋👋👋👋👋 Just came across this video please
@akhil9869 4 роки тому
@@MM-vx8go its available on his github
@MM-vx8go 4 роки тому
Akhil Kasare where please
@MM-vx8go 4 роки тому
Akhil Kasare this is my email.. mmaxwell265@gmail.com
@MM-vx8go 4 роки тому
What's the github username
@tech_charli Рік тому
Amazing explanation sir 🙏🙏
@menardtchatchou5647 4 роки тому ⁺¹
so happy I found your channel...wooh amazing lecture
Please send me the zip file with respect to feature engineering
thank you sir
will definitely join your channel.
@thetensordude 4 роки тому
Thanks sir for all these free contents! :p
@raakupgaming 4 роки тому
Can you send the zip file to me.
arifmollick8578@gmail.com
@manikosuru5712 5 років тому ⁺¹
Thank U so much Sir for such Huge help....
@robertkumar7768 4 роки тому
The video is quite informative and easy to understand. I really loved the video :)
@VARISHROCKS 3 роки тому
Sir , Thankyou for this wonderful lecture , please share the study material
@shivaaryaprakash 4 роки тому
Thanks alot for sharing such a absolutely amazing knowledgeable video...
@taranilakshmi9680 5 років тому ⁺¹
Nice information about feature engineering. Thanks a lot
@abhaysharma3171 5 років тому
can you plz send it to me
@arvindsaini6678 4 роки тому ⁺²
Hi @Krish, can you please share the Feature engineering materials if possible. Your videos are really impressive.
@mr.foysalhossain2142 3 роки тому
excellent job Boss. really helpful
@Itachiuchiha-de9cj 4 роки тому
Guys plz if u don't like his videos then leave it, but don't do dislike 🙏
@tahamansoor599 4 роки тому
Great Video
plz give demo also
@ArindamSinha-v7b Рік тому
Hi Krish , I really liked the way you are teaching, could you please share the feature engineering study material?
@rojaroja9913 4 роки тому ⁺¹
Thank You Sir!!things we can understand easily by your Videos.Sir could you pleasee reopen the link where we could get the Feature engg materials that could be more great
@rahulranjan8682 2 роки тому
by introducing a higher number to the categories on the basis of a higher no. of occurrence in a given class ( say here 1) are you not introducing bias in the dataset? ( target guided ordinal encoding)
@svishaliyer2254 5 років тому ⁺⁹
Hi Krish, I am not able to fill the form. Its removed. Can you please upload that
@amarjeetkushwaha4258 5 років тому
Same here
@mohankumar-cw5lw 5 років тому
same here
@sravanijammula573 5 років тому
Where did krish upload the form... Can u share the link related to it
@svishaliyer2254 5 років тому
@@sravanijammula573 Krish uploaded the form when he uploaded the video. Now it's old so I think he removed that. I am also not able to fill the form as I saw video very late
@sravanijammula573 5 років тому
Thanks vishal for the update... If u are aware of it jus post it here...
@aakashprasad3126 3 роки тому
Clearly Explained, Thankyou!
@jackdairies2live400 5 років тому ⁺⁷
Hi krish,
I started seeing your videos now and want the feature engg doc. Can you please open the link for the form?
Waiting for your response.
@mohitchatterjee5517 4 роки тому
22:38 sir... how we find Output Columns and how it assigns as 0 or 1
@dionricky 3 роки тому ⁺¹
Hello, this may be late but i'll try to answer. Basically the output column is the target column. The example used on the video is binary classification, so there are only two class which is assigned to 0 and 1. If your target is not in numeric value, you can convert it yourself using pandas with assign function if i'm not mistaken.
@mohitchatterjee5517 3 роки тому ⁺¹
@@dionricky thanks buddy
@biggusmaximus1651 3 роки тому
thank you sir from tamil
@sandyjust 5 років тому
@16:18 position you are saying to use 'one-hot encoding with multi-category' for an ensemble technique. But the beginning of the video you had explained ensemble techniques does not require feature scaling. Can you please clarify?
@madunishant6052 5 років тому ⁺¹
Hey Krish! I have few questions based on encoding which are
1. Let’s suppose I have a feature which has 1000 different categories which I need to convert to either an integer/float how should I do that?Here I can’t go for one hot encoding as it might create 999 columns.And also it has only 1100 record /rows by which even though going by the “one hot encoding with multiple categories” method the most repeated categories will be extremely less how do we handle it in such cases?
2. In Ordinal encoding why are the ranks need to be assigned to a categorical label instead we can give some random unique number to the categorical values without ranking them for example PhD as 1 , BE as 2, Masters as 4 and Stats as 3.
3. Also regarding “Label Encoding” how are the ranks decided say if PhD needs to be given higher rank let’s suppose ‘4’ how can a library know that it should be given a higher rank? Or is it something else that in the library code we need to manually set it?
@pradeepkumar-qo8lu 5 років тому ⁺¹
Do you have any ideas how to tackle this issue ?
@krishnaik06 5 років тому ⁺⁴
For the first point u should not apply one hot encoding instead we can go ahead with Mean encoding.
Label encoding for ordinal categorical will be assigned with ranks. In this case PhD should have a highest rank or label. This will help us to specify the ML algorithm where in we are providing higher importance to phd
@madunishant6052 5 років тому ⁺¹
Krish Naik Thanks a lot Krish👍🏻😊 And thanks a ton for your awesome content learning new things. Waiting for the part - 2 of the series😊
@thakuraditi5 4 роки тому
Really good one Krish
@divyagayathritalla3255 2 роки тому ⁺¹
In mean encoding,If the feature values are replaced by the mean values ,the no of data values in the pincode column are still the same right??Then whats the point of doing mean encoding?
@ranjan4495 5 років тому ⁺³
1st to view, 2nd to like, 1st to comment.
@sofiarao7144 4 роки тому ⁺⁴
can someone share the feature engineering doc of krish pls? i missed filling the form.
@RahulKumar-lv9yz 3 роки тому
Did you get the material? If yes, can you share it?
@karthikeyanc7124 4 роки тому
Can anyone clarify at 18:10 , how to find the mean here? are we adding all '0' and '1' corresponding to A and dividing the total by number of occurences of A?
@vijay9-w1y 4 роки тому
yes .but am to not sure of it
@shivashisswain2682 3 роки тому ⁺¹
Hi @krish Naik, how can get the zip file of all feature engineering techniques? kindly help
@juliussilaa8998 2 роки тому
Please share with me too. Thanks
@Itachiuchiha-de9cj 4 роки тому
You are awesome sir 🙏
@jeevanraj1789 2 роки тому
Hi
@HarpreetKaur-qq8rx 4 роки тому ⁺⁸
Hi Krish, I am confused with your explanation. My doubts are: You said for target guided ordinal categories you are assigning the rank based on mean values then how does it matter if the category is nominal or ordinal since the ranks are assigned based on mean values and not the inherent rank/order of the variable itself. Also for label encoding the number actually mean anything since it isn't like price/sales where the number holds significance so won't the result be junk/unusable and if they are usable then how do you interpret the result
@benvelloor 4 роки тому ⁺⁴
If the categorical feature is ordinal then we can assign labels to it. Here the category which obtains highest target mean will be assigned the highest label value. If the categorical feature is nominal then we cannot assign labels.
This is because in label encoding the number does mean something. Different numbers teach the model to make different predictions. For example, a value of 4 (PHD) in the salary prediction example results in a prediction of higher salary whereas a value of 1 (Bcom) results in prediction of a lower salary! This happens because, in the training data, entries with PHD as educational attribute will have a higher salary in the target column. This is generally useful when we do not know the exact ranking!! We find the correlation on the categorical feature to the target and then rank categories according to the mean of the values observed in the target!
If the categorical feature is nominal, we do not want the algorithm to learn more from some categories compared to the others. Hence using one hot encoding, we set the values to be 1 and 0.
Now as for Mean encoding of nominal categorical features, we essentially map a relationship between the categories and the target. When applied to the example of pin code numbers, some pin codes may result in higher salaries (assuming we are trying to predict salary again). Hence the mean of that pin code will be higher. We simply map in the mean value in place of that category!! So now when the model learns, it will know that a data point with high value in the pin code feature should predict higher salary!
Regards!
@sivakrishna4396 5 років тому ⁺¹⁰
Could you please upload the forum again . ?
Thanks in advance :)
@saumyagupta4019 4 роки тому ⁺¹
Sir please open the form enteries to get zip file for feature engineering
@anirudhr.huilgol.9449 4 роки тому
Hi sir how is is going to be of target guided encoded and mean encoding in case of regression problem.?
@pallabsaha4098 5 років тому ⁺²
How would we calculate the mean if the output is a multi class classification. In that case shall we take 0,1,2 as output?
In the eg you have taken 0&1. Here we can do the calculation. What if there are more than 2 classification outputs.
If you could attach the notebooks in a link it would be easier i guess instead of sending personal emails. Just a suggestion.
@jeevanraj1789 2 роки тому
Hi
@firta_banjara 4 роки тому
Hi Krish,
At 20:23 the Label for A - 0 and A - 1 will be different based on mean right ?
for example the mean will be calculated this way right ?
A - 1 => 0.73
B - 1 => 0.6
C - 1 => 0.4
A - 0 => 0.5
B - 0 => 0.35
C - 0 => 0.36
Then the ordering of feature will be as below right ?
A - 1 >> B - 1 >> A - 0 >> C -1 >> C - 0 >> B - 0
@vikasdixit1166 3 роки тому
@krish Naik I have a doubt. FOr mean encoding and target guided encoding we need labels for encoding but how would we encoded the data at test time. ?
@leadership_guru 2 роки тому
Hi krish, thanks so much for shedding light on this topic of Feature Engineering. I'm at Beginner Level of learning DS/ML and I really fell in love with your way of teaching these techniques. I would really love to get that document on FE you mentioned about in this video. I tried to drop my details via the google form but I see it's closed. Kindly assist please. Thanks in advance!
@AVINASHPARASHAR-yb7cb 4 роки тому ⁺³
Hi Krish,
We cannot perform Mean or Target encoding on test data because we don't have target column in test data. So how can we deal with such a situation where we have variable with multiple level in it?
I am talking in respect with Hackathon where we generally don't have target variable, this is something which we have to predict.
Would appreciate your help.
@kentakeshi8716 Рік тому
You already got the ordinal number or the float number for each category class from the training data . So you dont need to do it again in test data. You will simply use it.
You might already know this.
But I am answering if someone else has this doubt.
@venkateshwaranp9517 4 роки тому
Hi everyone , does deleting one dummy variable column is automatically done by onehotcoding ? Or it should be done mannually
@vkasrajpurohit1614 2 роки тому ⁺¹
Hi krish.. google response link not active. how can I get the material
@sindhuranimmaraju7392 5 років тому
Hi
I already joined as a member
@javeda 3 роки тому
Please do a session work on the dython package and setting categories in it
@deviprasadmishra805 5 років тому ⁺²
Sir please could you please tell us why the theory of computation is actually used and what are the application of these subjects please Sir make a video on that
@Avyukt-AN 3 роки тому
31 dislike for what? Teaching you free of cost with market standard!!
One should provide the link of better videos,if they dislike anything. 🥇
@chayanikaporel3858 5 років тому ⁺²
SIr,I want the feature engineering doc. Can you please open the link again?
@AbhishekKumar-jv3fe 5 років тому ⁺¹
Hi Krish ..... I had recently started to follow up your video & it was very helpful. could you please provide me the materials related to feature Engineering......thanks in advance
@Anastasia-wy1uj 4 роки тому
Sorry for the stupid question: what´s the output? I mean you encode before you apply any ML algorithm and to that point you just have the dataset, what kind of output do you mean here? Thanx
@shrikarnarwade4424 2 роки тому
In Target guided encoding if mean of two variables are same then how to assign numbers? As both has same mean how to decide for which has to give more numbers.?
@ghurahuchaurasiya2444 4 роки тому
It's very helpful, sir please reopen the form link...
@mohitrock100 4 роки тому
Dear Sir - In one hot encoding with multiple categories - We are only taking the top 10 categories and applying One Hot Encoding on the same. What about the other categories, as we are dissolving the column completely to apply One-Hot-Encoding.
@jeevanraj1789 2 роки тому
Hi
@adhipathis12 3 роки тому
Thanks Krish!!!! :)
@arunporky 4 роки тому
Hi sir.
I have started to learn ML from your channel only. Thank you for your knowledge that you are sharing with us.
I also have one request for you can i get feature engineering zip file now. I am really interested in ML.
@meharajbeguma7493 4 роки тому ⁺¹
sir, pincode is non-categorical variable. Then why do we go for encoding?
@af121x 4 роки тому
Thanks a lot for the clear explanation. Can you Please reopen the google form again?
@mukul98m 5 років тому ⁺¹
how to perform mean and target encoding for Regression problem?
@deepeshkumarsharma6514 5 років тому
use groupby function to create a data frame consisting of mean value of a feature wrt to the id , or name( of which you want to know the value , then use merge function to add this mean value data frame to your original data frame( which you will use for training you model)
@gerardoacaba4573 2 роки тому
Dear Krish,
I can not find tutorial on One Hot Encoding using Postgresql. Can you show a psql syntax of it?
@rohitchan007 Рік тому
I read on scikit learn documentation that label encoding should be applied only to the output column.
@jjayeshpawar 3 роки тому
Krish what to do when its classification problem and we have pin code column in our dataset?
@nadellagayathri 4 роки тому
Hi Krish, I have started liking your channel so much. Hats off for the great service you are doing for the aspiring and already experienced Datascientists. The form url which you have shared is no more available. Could yuo please share the material via google drive or reactivate the form.
@rajashekarnagalikar7788 4 роки тому ⁺¹
Hey Krish, can you please do share the ZIP file which you have mentioned in the video about the Feature engineering, as I am unable to open the Google url link. it will be more helpful if you help me with the file.
@NazrulIslam-dw6bf 8 місяців тому
For Ordinal Data: How Label Encoding works ? I have some confusion here because in case of ordinal data value with weight matters ? Can you pls explain bit in detail pls.
@meharajbeguma7493 4 роки тому
For one hot encoding with multiple catergories, it will create columns for top most categories.Then what happened for the records with remaining categories. Do they have 0 for all columns.
@sagardesai1253 4 роки тому
Thanks for sharing, the video is helpful!
@tejas8211 4 роки тому ⁺²
My doubt is with the mean encoding
What if two values in one feature get the same mean ?
@shubhamkakade3681 3 роки тому
in one hot encoding for mult. category if top 20 category has same count then what should be done

Наступне

Автоматичне відтворення

Why Do We Need to Perform Feature Scaling?