How is data prepared for machine learning?

Поділитися
Вставка
  • Опубліковано 8 лип 2024
  • Data is one of the main factors determining whether machine learning projects will succeed or fail. That's why it is necessary to prepare data in the most digestible form for a future ML model.
    Watch our video to find out more about data preparation for machine learning:
    00:00 Intro
    01:19 Dataset size
    03:07 Dataset quality
    04:35 Labeling
    06:35 Data reduction and cleansing
    09:56 Data wrangling
    12:14 Feature engineering
    Sources:
    [1] www.reuters.com/article/us-am...
    [2] www.sciencedirect.com/science...
    Read more in our article: www.altexsoft.com/blog/datasc...
    Music by Epidemicsound.com
    Learn more about AltexSoft: www.altexsoft.com
    Follow us on LinkedIn: / altexsoft
    Follow us on Facebook: / altexsoft
    Follow us on Twitter: / altexsoft
    Follow us on Instagram: / altexsoftcom
  • Наука та технологія

КОМЕНТАРІ • 24

  • @anirbansarkar6306
    @anirbansarkar6306 2 роки тому +36

    This channel has got one of the best videos and visuals which gives indepth understanding. Thank you so much for such immense hardwork

  • @mukundyadav6913
    @mukundyadav6913 Рік тому +6

    The analogy goes like this: modelling, tuning the hyperparameterss and using different algorithms to get better results is like working out, whereas data is like the food you put in your body. No matter how much you workout, how good and intense and your workout sessions are, if you dont eat good, balanced, high quality food then you are not gonna be healthy and get good results. Similarly, if you dont use good and high quality, accurate data then your model will not produce efficient results(data is literally the food for the model). So, when trying to improve your model's results, always try to improve your data quality first instead of changing your parameters and algorithms!

  • @goldenmilktea4992
    @goldenmilktea4992 2 роки тому +4

    Great content as usual. Now I have a better understanding of what machine learning is. Thank you!

  • @mp9305
    @mp9305 2 роки тому +5

    Excellent production quality as usual. You guys deserve more subs :)

  • @attribute-4677
    @attribute-4677 Рік тому +1

    This video deserves more love. Thank you!!

  • @dhruvgaikwad9088
    @dhruvgaikwad9088 9 місяців тому +1

    This is what I was looking for!!! Best Explanation!! Keep it up.

  • @sakshammishra9232
    @sakshammishra9232 11 місяців тому +1

    Excellent use of graphics....just love it.

  • @brendamartinez6955
    @brendamartinez6955 2 роки тому

    Thank you so much

  • @amitpatil518
    @amitpatil518 2 роки тому

    Very informative video. Covers all the topics.

  • @arnavsinha834
    @arnavsinha834 Місяць тому

    Thanks, learning something new!

  • @victornavorskie
    @victornavorskie 2 роки тому +2

    Just Wow Instantly liked the Video and subscribed to the Channel. Thanks for your hard work

  • @kaasboyzz
    @kaasboyzz 9 місяців тому

    Awesome video. Thanks!

  • @srimant101
    @srimant101 8 місяців тому

    Very useful. Thanks a lot

  • @kennedywee
    @kennedywee 2 роки тому +1

    Great Content!

  • @sidlist
    @sidlist 2 роки тому

    Great Content in very simple language. Can you please make a detailed video on ''Feature Engineering'' - Handcrafted features and Derived features?

  • @st10689
    @st10689 Рік тому

    What a video! Amazing really.

  • @mohdriyazpm
    @mohdriyazpm 4 місяці тому

    Very well explained

  • @philipphortnagl2486
    @philipphortnagl2486 5 місяців тому

    great videos!

  • @ahmedshalaby9343
    @ahmedshalaby9343 Рік тому

    omg i wish if ur my professor in the college

  • @blip666
    @blip666 Рік тому

    it was working fine why did they discontinued it

  • @somarble
    @somarble 4 місяці тому

    wonder why this video was suggested when I searched for "serialized data"

  • @marwanfudo998
    @marwanfudo998 2 роки тому +3

    CHAD AI

  • @Seiven2077
    @Seiven2077 2 місяці тому +2

    AI was correct. there is nothing sexist about being statistically right. Men are statistically better achievers, they work more, prefer more technical majors and manage stress better -> get hired more for such positions because they fit better. If you want to have a balanced race and gender for your data set you will end up with average at best candidates, because some talent would be excluded as a result of Quota sampling. And this is not what you want as a HR. Gender factor is important, that is why you don't hire creepy men to work in kindergarten, nursing or daycare. Stereotypes + less success in this field statistically among men.