My top 50 scikit-learn tips

Comparing machine learning models in scikit-learn

Spotting and solving everyday problems with machine learning | Session

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Adapt this pattern to solve many Machine Learning problems

Data School

Переглядів 12 605

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 23 гру 2024

КОМЕНТАРІ • 16

@dataschool 3 роки тому ⁺⁴
Want to watch all 50 scikit-learn tips? Enroll in my FREE online course: courses.dataschool.io/scikit-learn-tips
This is the last scikit-learn tip I'll be posting... thank you SO MUCH for watching! 🙌
@grzegorzzawadzki8718 3 роки тому
I recently learned that you can use handle_unknown for OrdinalEncoder, but this requires scikit-learn 0.24 or later.
What do you think about using onehotencoder for only the 5 or 10 most common values?
@dataschool 3 роки тому ⁺¹
Regarding handle_unknown with OrdinalEncoder, that's correct! I was excited to see that option released.
Regarding OneHotEncoder with a frequency cut-off, that can be a useful strategy. It's not currently easy to do in scikit-learn, but it will be possible in a future version.
Thanks for your comment!
@johnanih56 3 роки тому ⁺¹
THE BEST TIP SO FAR!
@dataschool 3 роки тому
You are so kind, thank you! 🙏
@RRSS-ce5hf 2 роки тому ⁺²
Hey Kevin, very helpful videos! In this video,
num_cols = make_column_selector(dtype_include='number')
-> Does 'num_cols' here also include the dependent/target column? (Assuming it is a numerical column)
If yes, say we are scaling other independent features using RobustScaler() because of presence of lot of outliers.. But the target column does not have many outliers.. Will it affect the regression output?
What is the way out (I want to scale all numerical columns except the target column)?
@dataschool 2 роки тому
Excellent question! No, num_cols does not include the target column, because the preprocessor is only applied to the columns in X. Hope that helps!
@blink4037 3 роки тому ⁺¹
Thank you for the all tips learnt so much, I just wondered are we able to or is it proper to use like FeatureUnion instead of make pipeline while combining transformer objects and pass as featureunion1 and featureunion2 with these numerical/non-numerical constraints.
@abir95571 2 роки тому ⁺¹
Great videos ... one question, let's say if the number of categories in a column is large then what should be the ideal encoding? One hot encoding isn't really the ideal one as it will create too many dummy columns
@dataschool 2 роки тому ⁺¹
Glad you like the videos! As for your question, there are a lot of factors that influence the optimal encoding, but you can certainly try OrdinalEncoder instead. However, you will find that it's often not a problem to create thousands of dummy columns, and that feature will still be improving the performance of your model. Hope that helps!
@abir95571 2 роки тому
@@dataschool I thought of ordinal encoding. But you see ordinal encoding inherently introduces rank ... like 1 > 2 > 3 .. so on . In my case the categories have no order , all have equal weightage. I've chosen binary encoding coz at least it reduces the columns to log N , where N is the count of distinct categories . My only doubt is , does it introduce order or is it unordered
@KartikeyRiyal 3 роки тому ⁺²
Great video. I have been learning from your videos since 2018 end.
Thank you so much and God bless you Kevin. from India
@dataschool 3 роки тому
That's great to hear! 🙏
@pruthvips9565 2 роки тому
Can you explain who can we Perform EDA in NLP
@sargonsarkis1292 2 роки тому
Awesome!
@dataschool 2 роки тому
Thanks!

Наступне

Автоматичне відтворення

My top 50 scikit-learn tips

My top 50 scikit-learn tips

Comparing machine learning models in scikit-learn

Comparing machine learning models in scikit-learn

Spotting and solving everyday problems with machine learning | Session

Spotting and solving everyday problems with machine learning | Session

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

"ХИТРЕЦ": Трамп РОЗЛЮТИВ Скабєєву / Оля ЛИЄ ЯДОМ #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

Удержаться на воде?? 🌊 #симбочкапимпочка #симбочка #симба

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

My top 25 pandas tricks

My top 25 pandas tricks

How do I encode categorical features using scikit-learn?

How do I encode categorical features using scikit-learn?

How do I select features for Machine Learning?

How do I select features for Machine Learning?

11. Introduction to Machine Learning

11. Introduction to Machine Learning

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)

Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)

21 more pandas tricks

21 more pandas tricks

Machine Learning Tutorial Python 12 - K Fold Cross Validation

Machine Learning Tutorial Python 12 - K Fold Cross Validation

Machine Learning Algorithm- Which one to choose for your Problem?

Machine Learning Algorithm- Which one to choose for your Problem?

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Прочистка шлюзов

Прочистка шлюзов

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

ФИЛЬМ! НЕВИНОВНЫЙ ГОТОВИТ ДЕРЗКИЙ ПОБЕГ С НЕПРИСТУПНОГО ОСТРОВА-ТЮРЬМЫ! Мотылёк! Русский фильм

Уличный боец с ДУХОМ воина

Уличный боец с ДУХОМ воина

Тайское мороженое в Калининграде

Тайское мороженое в Калининграде

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

Cat mode and a glass of water #family #humor #fun

Cat mode and a glass of water #family #humor #fun