Linear Regression Mathematical Intuition

Feature Selection Techniques Easily Explained | Machine Learning

Difference Between fit(), transform(), fit_transform() and predict() methods in Scikit-Learn

Прочистка шлюзов

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

Handle Categorical features using Python

Krish Naik

Переглядів 33 627

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 10 лют 2025
Here is a video which provides you the detailed explanation of how we can handle the categorical features using Python. We will basically be applying the get_dummies() function from the pandas library,
#HandlingCategoricalfeatures
Github url: github.com/kri...
You can buy my book in Finance from amazon
amazon url: www.amazon.in/...

КОМЕНТАРІ • 46

@HrishikeshShinde495 4 роки тому ⁺¹
This is the simplest way of encoding the categorical features. Thanks man!!
@programsolve3053 9 місяців тому
Well explained🎉🎉
@ishmeetsinghrayat 2 роки тому
you explain like pro bro.....
@amylock8275 2 роки тому
Exactly what I was looking for! Thank you
@timothythampy7202 3 роки тому
Thank you so much, sir! You are the best teacher
@MrKB_SSJ2 3 роки тому
Thank you so much... It was so easy...
@louerleseigneur4532 3 роки тому
Thanks Krish
@UnstoppableBird 4 роки тому
thank you so much, this is actually clearer than the stupid class I enrolled earlier
@ABCEE1000 3 місяці тому
which is better The Hark method you mentioned at the end or feature hashing ? if feature hashing is efficient may you explain it in another video?
@sreedharsree361 4 роки тому ⁺⁵
Thanks krish for this video ..
I have a doubt, at last part of the video .. while converting from categorical feature to numerical feature 2001 pincode represented at one instance as 1 and at other instance it is represented as 0 .. on what basis we represented like this ?
@sahanjayarathne3472 4 роки тому
Thank you so much for the video
@ijeffking 5 років тому
Thank you Krish.
@nikhilshingadiya7798 4 роки тому ⁺¹
what happen if we have all independent variable as categorical i.e. movies data set country origin,movie_type,director now i want to predict the imdb missing data how can i handle those categorical variable
@slowhanduchiha 4 роки тому ⁺¹
One Question Sir. I was working on a classification dataset. My out put variable is also categorical in nature . I applied OHE and later when i saw the heatmap it made no sense because the columns were bit blank. Correct me where i am wrong here
@simanchalpatnaik2566 5 років тому
Thanks for the video Krish.
When I ran the "df" command after concating, why all the values of Florida & New York comes as "NaN" ?
@aanniirr100 2 роки тому
how can we save the count of a particular category obtained to be used later in any calculation
@51kaushal 4 роки тому
Kris, but what if we have a regression problem then we would not have output as 0/1, then how do we encode the categorical features like pincode, do we use frequency/count encoding in there??
@somnathbanerjee2057 4 роки тому
Hi Krish, could you please guide me how can I handle text column for a regression problem. It's not about encoding categorical features. But what I am looking for is---extracting some meaningful information from the existing column containing text data using string manipulation method from regex...Please recommend me an effective way of doing this.
@dilipgawade9686 5 років тому ⁺⁴
Hi Krish,
can you show how to convert categorical variable to numeric variable through coding ?
@swaruppanda2842 5 років тому ⁺¹
At 17:08 u made it clear for 2001 as 1 as output will be 2/3=0.6 what about 2001 as 0 as output?
@aakashsinghrawat3313 5 років тому
actually the idea is getting mean of 2001, suppose 2001 has value 1 and 0 obvious mean will be 0.5 for both
@arjyabasu1311 5 років тому
Sir, please upload a video on how to perform mean encoding !!
@inderpreetkaur1441 4 роки тому ⁺¹
I want to create box plot for categorical variable (like subscribed: yes/no)
Firstly I wrote the code: train= pd.get_dummies(train['subscribed'], drop_first=1)
And then for creating box plot: train['subscribed'].plot.box()
But this will show error as- keyword: 'subscribed'
Please let me know my mistake.
@satishakuthota6290 5 років тому
please make video on visuvalistion using matplotlib
@ranoyavanniysingam3390 3 роки тому
if we have more than 5 categorical-feautures column, what to do for that? for example -- country, age group like this?
@chaithanyaravi8400 5 років тому
Hi Krish,
suppose in the case if we have 8 categorical names at that time it will generate 8 new columns?
@aakashsinghrawat3313 5 років тому
yes
@priyak1008 3 роки тому
How to apply onehot encoding if we have categorical data in Y (dependent column).
@parakhsrivastava7743 5 років тому ⁺¹
I have a doubt...
When dealing with categorical values having many classes, you took all 2001's and find out the probability where O/P is 1.
Suppose, that is coming 0.6(as in video). Now you are replacing all 2001's with 0.6, no matter O/P is 0 or 1... WHY?
Should we not replace 2001's by 0.6 only if O/P is 1, else replace it with 0.4?
Thanks for the video btw!
@swinal9710 5 років тому
The O/p column comes from where can you explain me that?
@theeagleseye4989 5 років тому
Same doubt here
@sahilkamboj747 5 років тому
@Premjith Augustine what if the output is not a classification variable. Target variable can be Continuous like Price, Fees,Profit
@aakashsinghrawat3313 5 років тому
that 0.6 is mean for 0 as well as to 1
@surbhisarawagi1821 5 років тому ⁺¹
If we have large number of categorical variables say 21, then if we use get dummies we'll have large number of columns so how to deal such a case?
@gurjotsingh752 5 років тому
are u able to find, how to cater your problem ie 21 categorical variables?
@aakashsinghrawat3313 5 років тому
i think, we can use feature_selection like SelectKBest to get top k(any N no. upto total columns) which means these new features have strong relationship with target
@gurjotsingh752 5 років тому
@@aakashsinghrawat3313 yes, your approach is good. I also find one more approach. Here we will replace categorical values with their number of count. Eg) We have 29 states. and suppose we have 10K records and Delhi has come 500 times, Karnataka come 900 times. So, i will replace delhi by 500 and Karntaka by 900.
@aakashsinghrawat3313 5 років тому
@@gurjotsingh752 isn't it inefficient to labelling 500,900 to categorical feature? This method might be good to ordinal features.
@ankita684 4 роки тому
Hi Krish.. I wrote exactly the same codes simultaneously to practice, but my score came out to be -5.667 (I got a negative value) whereas you got 0.9304. I am not able to understand why am I not getting the same value. Please explain.
@krishnaik06 4 роки тому
Just set the seed once
@vijaynale7893 5 років тому
Thanks bro
@nashrakhan5361 4 роки тому
getting key error for the column which I used for categorical data, please help
@SuperJg007 4 роки тому
what to do if there is mixed data, continuous and categorical?
@aakashsinghrawat3313 5 років тому
can someone please give me link of solved example using target encoding, mean encoding like above
@vipinamar8323 4 роки тому
What is the last encoding type called?
@alimisumanthkumar2769 4 роки тому
mean encoding

Наступне

Автоматичне відтворення

Linear Regression Mathematical Intuition

Linear Regression Mathematical Intuition

Feature Selection Techniques Easily Explained | Machine Learning

Feature Selection Techniques Easily Explained | Machine Learning

Difference Between fit(), transform(), fit_transform() and predict() methods in Scikit-Learn

Difference Between fit(), transform(), fit_transform() and predict() methods in Scikit-Learn

Прочистка шлюзов

Прочистка шлюзов

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

ДИЗЕЛЬ ШОУ 2024 🇺🇦 ❄️ ЗИМОВА ПРЕМ'ЄРА ❄️ 🇺🇦 ВИПУСК 154 на підтримку ЗСУ ⭐ Гумор ICTV від 13.12.2024

ДИЗЕЛЬ ШОУ 2024 🇺🇦 ❄️ ЗИМОВА ПРЕМ'ЄРА ❄️ 🇺🇦 ВИПУСК 154 на підтримку ЗСУ ⭐ Гумор ICTV від 13.12.2024

How do I encode categorical features using scikit-learn?

How do I encode categorical features using scikit-learn?

One-Hot, Label, Target and K-Fold Target Encoding, Clearly Explained!!!

One-Hot, Label, Target and K-Fold Target Encoding, Clearly Explained!!!

Different Types of Feature Engineering Encoding Techniques

Different Types of Feature Engineering Encoding Techniques

7 Python Data Visualization Libraries in 15 minutes

7 Python Data Visualization Libraries in 15 minutes

How to Detect and Remove Outliers in the Data | Python

How to Detect and Remove Outliers in the Data | Python

How to select the best model using cross validation in python

How to select the best model using cross validation in python

Variable Encodings for Machine Learning | Categorical, One-Hot, Dummy, Ordinal | ML Fundamentals 4

Variable Encodings for Machine Learning | Categorical, One-Hot, Dummy, Ordinal | ML Fundamentals 4

Standardization Vs Normalization- Feature Scaling

Standardization Vs Normalization- Feature Scaling

Finding an outlier in a dataset using Python

Finding an outlier in a dataset using Python

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

УГАДАЙ КОНТЕЙНЕР - ЗАБЕРИ ТАЧКУ! Новогодний выпуск!

УГАДАЙ КОНТЕЙНЕР - ЗАБЕРИ ТАЧКУ! Новогодний выпуск!

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

Сестра обхитрила!

Сестра обхитрила!

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

Син ПОВАЛІЙ ПЛЮНУВ ЇЙ в ОБЛИЧЧЯ! Скандальне ПРИВІТАННЯ для ЗРАДНИЦІ! | OBOZ.LIFE

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts