Feature Engineering Techniques For Machine Learning in Python

Поділитися
Вставка
  • Опубліковано 21 лип 2024
  • Thank you for watching the video! Here is the Colab Notebook: colab.research.google.com/dri...
    California Housing Kaggle Dataset: www.kaggle.com/camnugent/cali...
    Learn Python, SQL, & Data Science for free at mlnow.ai/ :)
    Subscribe if you enjoyed the video!
    Best Courses for Analytics:
    ---------------------------------------------------------------------------------------------------------
    + IBM Data Science (Python): bit.ly/3Rn00ZA
    + Google Analytics (R): bit.ly/3cPikLQ
    + SQL Basics: bit.ly/3Bd9nFu
    Best Courses for Programming:
    ---------------------------------------------------------------------------------------------------------
    + Data Science in R: bit.ly/3RhvfFp
    + Python for Everybody: bit.ly/3ARQ1Ei
    + Data Structures & Algorithms: bit.ly/3CYR6wR
    Best Courses for Machine Learning:
    ---------------------------------------------------------------------------------------------------------
    + Math Prerequisites: bit.ly/3ASUtTi
    + Machine Learning: bit.ly/3d1QATT
    + Deep Learning: bit.ly/3KPfint
    + ML Ops: bit.ly/3AWRrxE
    Best Courses for Statistics:
    ---------------------------------------------------------------------------------------------------------
    + Introduction to Statistics: bit.ly/3QkEgvM
    + Statistics with Python: bit.ly/3BfwejF
    + Statistics with R: bit.ly/3QkicBJ
    Best Courses for Big Data:
    ---------------------------------------------------------------------------------------------------------
    + Google Cloud Data Engineering: bit.ly/3RjHJw6
    + AWS Data Science: bit.ly/3TKnoBS
    + Big Data Specialization: bit.ly/3ANqSut
    More Courses:
    ---------------------------------------------------------------------------------------------------------
    + Tableau: bit.ly/3q966AN
    + Excel: bit.ly/3RBxind
    + Computer Vision: bit.ly/3esxVS5
    + Natural Language Processing: bit.ly/3edXAgW
    + IBM Dev Ops: bit.ly/3RlVKt2
    + IBM Full Stack Cloud: bit.ly/3x0pOm6
    + Object Oriented Programming (Java): bit.ly/3Bfjn0K
    + TensorFlow Advanced Techniques: bit.ly/3BePQV2
    + TensorFlow Data and Deployment: bit.ly/3BbC5Xb
    + Generative Adversarial Networks / GANs (PyTorch): bit.ly/3RHQiRj
    Timeline:
    00:00 Introduction
    1:44 Initial Setup
    10:00 Dimensionality Reduction (PCA)
    16:22 Preprocessing / Scaling
    26:08 Categorical Encoding (Dummy / One-Hot)
    33:09 Binning (Grouping / Aggregating)
    37:56 Clustering (K-Means)
    44:08 Feature Selection

КОМЕНТАРІ • 64

  • @GregHogg
    @GregHogg  11 місяців тому

    Take my courses at mlnow.ai/!

  • @roblee3667
    @roblee3667 2 роки тому +13

    Run a heat map for all columns when viewing correlations before running PCA, there are way more opportunities for dimensionality reduction.

    • @GregHogg
      @GregHogg  2 роки тому +1

      Oh interesting I should look into this

  • @e_hossam96
    @e_hossam96 2 роки тому +6

    for info: if you delete the column island then you should delete the rows containing value 1 as well or you will have the other encoded columns equals zero in all.

  • @nokroman
    @nokroman 9 місяців тому +1

    i've been lost in feature engineering chapter on the book that i am currently ready for machine learning right now, and straight ahead i found your video, and with the whole 47 minutes i have learned 2-3 things from you and i understand the whole process lot more better now, this all thanks to you Greg! keep up making these types of videos bcs WE NEED YOU!!

    • @GregHogg
      @GregHogg  9 місяців тому

      Awe this is so nice. So glad to have brought a bit of value. Thank you so much for the encouragement and support, it means a lot. Happy learning 😄

  • @arsheyajain7055
    @arsheyajain7055 2 роки тому +4

    Been waiting for this one!! Amazing video! Thanks Greg

    • @GregHogg
      @GregHogg  2 роки тому +1

      Glad to hear it! No problem 👍😊👍

  • @Eizengoldt
    @Eizengoldt 9 місяців тому +1

    Greg, your videos are absolutely lovely and reinforce everything I’m learning in my classes, thank you so much

    • @GregHogg
      @GregHogg  9 місяців тому

      Oh I'm so glad to hear that! Thank you for the support and for the kind words, I really appreciate it :)

  • @mikekertser5384
    @mikekertser5384 2 роки тому +2

    I've waited for this video! Many Thanks! :)

    • @GregHogg
      @GregHogg  2 роки тому +1

      Yup, I know you have been. No problem Mike and thank YOU 😄

  • @jacobsquires
    @jacobsquires 8 місяців тому

    You explain everything in such an easy-to-follow way! Thanks for the amazing video!

    • @GregHogg
      @GregHogg  8 місяців тому

      Ghost to hear it! Thank you so much, and you're very welcome!

  • @ShadowD2C
    @ShadowD2C 7 місяців тому

    really great comprehensive video, would be great if you did one on how to select features to get the best results for this problem itself

  • @findoc9282
    @findoc9282 Рік тому

    this kind of way is what we need outside the uni class. Enough for PCA knowledge in Uni, let's code!

  • @crepantherx
    @crepantherx 2 роки тому +2

    Buddy, I have subscribed you. Please keep uploading more video that helps a lot

    • @GregHogg
      @GregHogg  2 роки тому

      Thanks so much, and I absolutely will!

  • @ashleyb5849
    @ashleyb5849 2 роки тому +2

    Amazing video as always! Super helpful

    • @GregHogg
      @GregHogg  2 роки тому

      Glad you thought so!

  • @darshanprabhu3674
    @darshanprabhu3674 Рік тому

    Thanks a lot greg , you have helped me a lot through this videos.

  • @user-dv4cd3bi3o
    @user-dv4cd3bi3o 10 місяців тому

    Thank you so much. Your efforts are really appreciated.

  • @hsoley
    @hsoley 2 роки тому +1

    Best of the best!
    Thank you greg, you are the man!

    • @GregHogg
      @GregHogg  2 роки тому

      Thanks so much! I really appreciate that.

  • @rafik1968
    @rafik1968 7 місяців тому

    Thank you a lot for these helpful experiments .. it gave me a lot of ideas in data preprocessing !

    • @GregHogg
      @GregHogg  7 місяців тому

      Super glad to hear it!

  • @kashishrajput4934
    @kashishrajput4934 Рік тому

    Thank you for posting this, i like all of your videos :)

  • @gpsvlogs7853
    @gpsvlogs7853 2 роки тому

    You're Gold !! Keep up the good work.

    • @GregHogg
      @GregHogg  2 роки тому

      Thanks so much really appreciate that

  • @ernestoflores3873
    @ernestoflores3873 3 місяці тому

    Thanks! Very usefull!

  • @prasadchavan3562
    @prasadchavan3562 Рік тому

    great🙌🏻, very helpful keep making more such videos

    • @GregHogg
      @GregHogg  Рік тому

      Thanks so much, will do!

  • @shivanshusharma8154
    @shivanshusharma8154 5 місяців тому

    This is very informative....

  • @runbama1013
    @runbama1013 8 місяців тому

    great job thank you

    • @GregHogg
      @GregHogg  8 місяців тому

      You're very welcome!

  • @Asoville18
    @Asoville18 5 місяців тому

    Thanks man

  • @rajpulapakura001
    @rajpulapakura001 Рік тому

    amazing tutorial

  • @lazytocook
    @lazytocook Рік тому

    do u have a tutorial on catBoostencoder

  • @seemunchan1664
    @seemunchan1664 Рік тому

    Superb

  • @agoodfriend365
    @agoodfriend365 Рік тому

    thank you bro

  • @ahmedaj2000
    @ahmedaj2000 2 роки тому

    thanks!

  • @cipher4811
    @cipher4811 2 роки тому

    well explained thanks

    • @GregHogg
      @GregHogg  2 роки тому

      You're very welcome!

  • @nhinvalerabayugao1418
    @nhinvalerabayugao1418 3 місяці тому

    many thanks

    • @GregHogg
      @GregHogg  3 місяці тому

      Many welcomes!

  • @trongnhantran3358
    @trongnhantran3358 Рік тому

    Love it

  • @hosseiniphysics8346
    @hosseiniphysics8346 4 місяці тому

    tnx

  • @mouadet-tali4089
    @mouadet-tali4089 2 роки тому

    thank you so much

    • @GregHogg
      @GregHogg  2 роки тому +1

      You are very welcome!!

  • @imveryhungry112
    @imveryhungry112 Рік тому

    Thank you I wish i was as smart as you ughhh but at least i learned some from this

  • @russelllapua4904
    @russelllapua4904 4 місяці тому

    Wouldn't this be called Transformation techniques of preprocessing instead? I thought Dimensionality Reduction would be separate from Feature Engineering, with Feature Scaling making up the 3rd subtopic. So something like:
    Dimensionality Reduction (removing features)
    PCA
    Clustering
    Feature Engineering (creating/transforming features)
    One-Hot
    Binning
    Feature Scaling (normalisation/standardisation of features)
    Your scaling

  • @alanjohnstone8766
    @alanjohnstone8766 3 місяці тому

    Well explained., but why do you keep turning everything into numpy arrays? I do not think it is necessary.

    • @GregHogg
      @GregHogg  2 місяці тому

      I guess it's just a habit... Probably not necessary yeah

  • @zsnowboardern9332
    @zsnowboardern9332 8 місяців тому

    great video. pkease talk slowe for the beginners

  • @adilbhatti6299
    @adilbhatti6299 Рік тому

    Wish I could give more likes!

  • @beauclark2199
    @beauclark2199 2 роки тому

    Great vid, but slow down your speaking so lesser experienced people can follow along!

    • @GregHogg
      @GregHogg  2 роки тому

      Thank you. If it is too fast, there is a slowdown option :)

    • @beauclark2199
      @beauclark2199 2 роки тому

      @@GregHogg Yes, i can absolutely slow down the speed of the tutorial. Have a great day

    • @GregHogg
      @GregHogg  2 роки тому

      @@beauclark2199 great! You too!!

  • @spider279
    @spider279 5 місяців тому

    Too long and boring this guy is not straighforward

    • @GregHogg
      @GregHogg  5 місяців тому

      Ikr Screw this guy