Variable Length Features and Deep Learning

Categorical Embedding for Training Machine & Deep Learning Models

How do I encode categorical features using scikit-learn?

Запорізьку АЕС повернуть Україні / Режим припинення вогню

skibidi toilet 76 (part 1)

🤯 МНЕ НУЖЕН ЕЩЕ 1 ПОДПИСЧИК - и НАСТЯ перестанет ломать пасту @nastyawhere

How to do Deep Learning with Categorical Data

Data Talks

Переглядів 9 179

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 8 лип 2024
If you’re like me, you don’t really need to train self-driving car algorithms or make a cat-image-detectors. Instead, you're likely dealing with practical problems and normal looking data.
The focus of this series is to help the practitioner develop intuition about when and how to use Deep Learning (DL) models in normal situations with normal data, e.g. structured (i.e. something you can read into pandas) data. I will teach you the fundamentals-the building blocks of DL.
There are many courses that teach DL on computer vision, NLP, etc. This is not that. We are about teaching the practitioner how to transform normal machine learning (ML) models into DL models-and I have a lot of experience doing just that.
Some existing DL courses are either overly theoretical (not useful to practitioners), overly simplistic (belying the sophistication), or even overly practical (providing the practitioner with a false sense of security). DL is hard. Real data science is hard. We want to steer you away from the most common mistakes.
By starting with tabular data, we can introduce you to the DL toolbox in a more intuitive way. Note, this series is not about the underlying math for neural networks or the like.
This series is aimed most directly at intermediate level users.
Helpful links:
Link to Deep Learning Building Blocks Series:
• Python Keras - Deep Le...
Link to GitHub repo including categorical data lesson:
github.com/knathanieltucker/d...

КОМЕНТАРІ • 29

@DataTalks 4 роки тому ⁺⁵
Quick note on the embedding layer! The input length being set to 5 in the embedding layer means that you have the same base categories (like words or tags) for each of the inputs. If you have 5 different types of categories you'll need to use 5 different embedding layers!
@jasonclement6305 4 роки тому
So if i had a categorical for say... zipcode and one for race... id separate them as multiple embedding layers correct?
@DataTalks 4 роки тому ⁺⁴
@@jasonclement6305 exactly!
@sifar1857 2 роки тому
Do you have a sample of how this is done?
@herrylau7381 2 роки тому
If I have two different categorical input I have to separately embed them and the concat the three data together?
@DataTalks 2 роки тому ⁺¹
@@herrylau7381 If those inputs are from different categories (eg color and size) then yes!
@soupizcool 3 роки тому ⁺⁴
You do not have enough views. This is fantastic. I have been working on a project and have been absolutely baffled about how to get the embedding layers to work properly. Keras API docs and many other sources/videos do not clearly address that you must separate the categorical variables from the numerical ones first. In most examples I have seen, people work with datasets that are entirely categorical but not a mixture. I was so confused why the embedding layers do not know which features to look at to embed, but your video made it so clear. Thanks again keep making videos.
@stackexchange7353 3 роки тому
Your videos are amazing. thanks for making these concepts so easy to understand.
@MasterBen007 Рік тому
tysm bro, this my first time doing ml and ive been pulling my hair out trying to figure out how to use zip codes for my data and somehow found this perfect video
@ph0b056 2 роки тому
Great tutorial! Although a small help, how do I get the classification report for this? Thanks in advance
@semidevilz 3 роки тому ⁺¹
Thank you! Wanted to clarify, does the “embedding” happen at “embedding_layer =“.... Or does it happen at the model training?
Also, how do I go about extracting the embedded vectors? I,e. I want to use these embedding to train it on another ML model?
@DataTalks 3 роки тому ⁺¹
the embedding always happens during training! you can use the below to get the weights:
layer. get_weights(): returns the weights of the layer as a list of Numpy arrays.
@semidevilz 3 роки тому
@@DataTalks thank you!
@feifei989 2 роки тому
never appreciate any video like this before.
@SayanRay0rayzallnight 2 роки тому ⁺¹
Thanks a lot, great explanation. A question - what if you have multiple categorical variables in your dataset? Do you have to use an embedded layer for each variable? Also, if your target variable is also categorical, do you need to embed it too?
@DataTalks 2 роки тому ⁺²
If your target is categorical you'll need to change the output function most likely to categorical cross entropy! If you've got multiple categorical variables you'll need to embed them separately :)
@SayanRay0rayzallnight 2 роки тому
@@DataTalks thanks!
@anticopss 2 роки тому
@@DataTalks Hi! and thank you for your great video :) If I correctly understood your comment, you mean that we would have to have multiple cat_inputs, one for each category? So basically, supposing that your categorical 5 variables would be different, we would have to perform the embedding 5 times, each with input_length = 1?
Thanks in advance for your answer!
@DataTalks 2 роки тому ⁺¹
@@anticopss that is exactly right!
@anticopss 2 роки тому
@@DataTalks thank you very much!

Наступне

Автоматичне відтворення

Variable Length Features and Deep Learning

Variable Length Features and Deep Learning

Categorical Embedding for Training Machine & Deep Learning Models

Categorical Embedding for Training Machine & Deep Learning Models

How do I encode categorical features using scikit-learn?

How do I encode categorical features using scikit-learn?

Запорізьку АЕС повернуть Україні / Режим припинення вогню

Запорізьку АЕС повернуть Україні / Режим припинення вогню

skibidi toilet 76 (part 1)

skibidi toilet 76 (part 1)

🤯 МНЕ НУЖЕН ЕЩЕ 1 ПОДПИСЧИК - и НАСТЯ перестанет ломать пасту @nastyawhere

🤯 МНЕ НУЖЕН ЕЩЕ 1 ПОДПИСЧИК — и НАСТЯ перестанет ломать пасту @nastyawhere

Арестович & Быков: Украине уже ничего не поможет?

Арестович & Быков: Украине уже ничего не поможет?

Deep Learning for Tabular Data: A Bag of Tricks | ODSC 2020

Deep Learning for Tabular Data: A Bag of Tricks | ODSC 2020

Fletcher Riehl: Using Embedding Layers to Manage High Cardinality Categorical Data | PyData LA 2019

Fletcher Riehl: Using Embedding Layers to Manage High Cardinality Categorical Data | PyData LA 2019

Variable Encodings for Machine Learning | Categorical, One-Hot, Dummy, Ordinal | ML Fundamentals 4

Variable Encodings for Machine Learning | Categorical, One-Hot, Dummy, Ordinal | ML Fundamentals 4

Categorical Entity Embedding Using Python Tensorflow Keras | Deep Learning

Categorical Entity Embedding Using Python Tensorflow Keras | Deep Learning

Predict Football Match Winners With Machine Learning And Python

Predict Football Match Winners With Machine Learning And Python

Imputation Methods for Missing Data

Imputation Methods for Missing Data

Handling Categorical Data in Machine Learning: Easy Explanation for Data Science Interviews

Handling Categorical Data in Machine Learning: Easy Explanation for Data Science Interviews

10 ML algorithms in 45 minutes | machine learning algorithms for data science | machine learning

10 ML algorithms in 45 minutes | machine learning algorithms for data science | machine learning

Categorical Embeddings in Structured Data

Categorical Embeddings in Structured Data

🟦🟨 ДЕНЬ КОНСТИТУЦІЇ 👊🤨 НАРОД ПРОТИ ЧИНОВНИКІВ 👺💸

🟦🟨 ДЕНЬ КОНСТИТУЦІЇ 👊🤨 НАРОД ПРОТИ ЧИНОВНИКІВ 👺💸

это самое вкусное блюдо

это самое вкусное блюдо

УГАДАЙ КАКАЯ ЧАСТЬ МАНДАРИНА НАРИСОВАННАЯ! (99% СПРАВИТСЯ) #Shorts #Глент

УГАДАЙ КАКАЯ ЧАСТЬ МАНДАРИНА НАРИСОВАННАЯ! (99% СПРАВИТСЯ) #Shorts #Глент

ДРУГИЕ - ВСЕ СЕРИИ ПОДРЯД

ДРУГИЕ - ВСЕ СЕРИИ ПОДРЯД

Арестович: Разворот Зеленского. Дело идет к миру. Сбор для военных👇

Арестович: Разворот Зеленского. Дело идет к миру. Сбор для военных👇

ОРБАН приехал к ПУТИНУ и "отчитался" о поездке в Киев 😁 [Пародия]

ОРБАН приехал к ПУТИНУ и "отчитался" о поездке в Киев 😁 [Пародия]

Survival skills: A great idea with duct tape #survival #lifehacks #camping

Survival skills: A great idea with duct tape #survival #lifehacks #camping

Поважай захисників | GOVOR TikTok #govor #shots

Поважай захисників | GOVOR TikTok #govor #shots