One Hot Encoding with Python | Handling Categorical Data
Вставка
- Опубліковано 8 лип 2024
- In this tutorial you can see how one hot encoding is applied in order to handle categorical data, step-by-step, in a real world data problem environment.
You can check out the whole project from A to Z on my GitHub page:
github.com/danbochman/FARS_LE...
If you have any questions, feel free to ask in the comments!
Please let me know if there are any specific machine learning tutorials you wish to see, and I'll be happy to do them.
Music: www.bensound.com
I can't thank you enough! you saved my life i spent 12 hours trying to solve an error and what you explained helped me in 15 mins THANK YOU SO MUCH
Very happy to hear this video helped you!
Good luck!
you deserve trophy
Thank you so much Dan
Nice explanation Dan
Explained it very well. Thanks a ton...SUBSCRIBED!!!
Thank you, very helpful
Great video. Keep it up!
Amazing!
Hey Dan very well explained and thank you for uploading and helping others. I had a question - How do you extend the threshold to the test set or production input since the columns could be different in train set and the one we get in production for high cardinallity categories
Goood work man..Keep it up
Saved my ass, man. Thanks!
wonder what is difference between encoding and mapping. For example if STATE_CD goes from 1 to 50 say, now its numeric - can it be used in AI learning without resorting to one hot encoding?
If you map states to numbers 1 to 50, it can be used in ML, but you inserted an internal relationship that doesn’t exist (state 1 is more similar to state 2, very far from state 50)
@@danbochman thanks for the reply. understood
Can you make same video using pyspark libraries instead of pandas
Hey Gourav,
Have you tried Dask? It has many of the same abilities of PySpark, but it's native to Python and the Pandas syntax.
I made a tutorial video about it:
ua-cam.com/video/Alwgx_1qsj4/v-deo.html
One Hot Encoding with Dask:
dask-ml.readthedocs.io/en/latest/modules/generated/dask_ml.preprocessing.DummyEncoder.html