Use cross_val_score and GridSearchCV on a Pipeline

One Hot Encoder with Python Machine Learning (Scikit-Learn)

One-Hot, Label, Target and K-Fold Target Encoding, Clearly Explained!!!

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Three reasons not to use drop='first' with OneHotEncoder

Data School

Переглядів 5 557

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 11 січ 2025

КОМЕНТАРІ • 17

@dataschool 3 роки тому ⁺⁵
Thanks for watching! 🙌 If you're new to OneHotEncoder, you may want to watch this video as well: ua-cam.com/video/0w78CHM_ubM/v-deo.html
@eatbreathedatascience9593 3 роки тому
Does it mean I should also not drop if_binary or drop array ? Thanks very much !!!
@puzobaklan 3 роки тому ⁺²
Very useful tip! Thank you! 👍
@dataschool 3 роки тому
You're very welcome!
@rishidixit7939 Місяць тому
Why would I use a Standard Scaler on a categorical column ? Also if I use a Standard Scaler on numerical columns and not on the columns on which I applied One Hot Encoder can I then drop the column ?
@rishidixit7939 Місяць тому
Mostly with Logistic Regression , Linear Regression and Linear SVM this is an issue. So using drop at that time must be important ? How can SciKit Learn prevent an error in these cases ?
@rishidixit7939 Місяць тому
Also does not dropping columns affect the interpretability of the model ? I do not know what it means just asking you what it means ?
@sebastianweiler3997 2 роки тому
Thanks a lot for this and the other of your videos! But what's the right way to deal with this issue when using unregularized regression? I need to drop one category because of multicollinearity but I don't want my unknown category to be encoded the same way as my base-category is. Please help me out. Thank you
@dataschool 2 роки тому
If you set handle_unknown to 'error', then this won't be a problem. Hope that helps!
@21Gannu 3 роки тому
I think after watching your video on effective machine learning method i now know why not. As you discussed there i usually let my gridsearch decide it.....
@dataschool 3 роки тому
Glad to hear, Ganesh!
@programmer_sesat 3 роки тому
Which video is it?
@MrTulufan 3 роки тому
does this also apply to a regular logistic regression (not regularized)? I dont think the model would converge with perfectly co-corelated dummy variables. How does sklearn handle this?
@dataschool 3 роки тому
In scikit-learn, logistic regression is regularized by default.
@aytekin8669 3 роки тому
Thank you!
@dataschool 3 роки тому
You're welcome!

Наступне

Автоматичне відтворення

Use cross_val_score and GridSearchCV on a Pipeline

Use cross_val_score and GridSearchCV on a Pipeline

One Hot Encoder with Python Machine Learning (Scikit-Learn)

One Hot Encoder with Python Machine Learning (Scikit-Learn)

One-Hot, Label, Target and K-Fold Target Encoding, Clearly Explained!!!

One-Hot, Label, Target and K-Fold Target Encoding, Clearly Explained!!!

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

Комаровский. Когда конец войны, Трамп не поможет, потеря Украины, эмиграция, многоженство в Украине

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

How do I encode categorical features using scikit-learn?

How do I encode categorical features using scikit-learn?

How to Remember Everything You Read

How to Remember Everything You Read

How might LLMs store facts | DL7

How might LLMs store facts | DL7

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

Напряженность на российско-норвежской границе и как россияне ездят домой? Европа в фокусе

Напряженность на российско-норвежской границе и как россияне ездят домой? Европа в фокусе

Variable Encodings for Machine Learning | Categorical, One-Hot, Dummy, Ordinal | ML Fundamentals 4

Variable Encodings for Machine Learning | Categorical, One-Hot, Dummy, Ordinal | ML Fundamentals 4

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

Computer Scientist Explains Machine Learning in 5 Levels of Difficulty | WIRED

I Helped 2,000 People Walk Again

I Helped 2,000 People Walk Again

Нельзя смеяться | Смех с водой | 97 #shorts

Нельзя смеяться | Смех с водой | 97 #shorts

Уличный боец с ДУХОМ воина

Уличный боец с ДУХОМ воина

人是不能做到吗？#火影忍者 #家人 #佐助

人是不能做到吗？#火影忍者 #家人 #佐助

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

To Brawl AND BEYOND!

To Brawl AND BEYOND!

TOY STORY IN BRAWL STARS!?

TOY STORY IN BRAWL STARS!?

1% vs 100% #beatbox #tiktok

1% vs 100% #beatbox #tiktok

«Просив пробачення, що не уберіг Діму» - історія братів Василя Репчука і Дмитра Мурару #shorts

«Просив пробачення, що не уберіг Діму» — історія братів Василя Репчука і Дмитра Мурару #shorts