Training Data Vs Test Data Vs Validation Data| Krish Naik

Поділитися
Вставка
  • Опубліковано 25 вер 2024
  • github for materials and notes: github.com/kri...
    Training set: A set of examples used for learning, that is to fit the parameters of the classifier.
    Validation set: A set of examples used to tune the parameters of a classifier, for example to choose the number of hidden units in a neural network.
    Test set: A set of examples used only to assess the performance of a fully-specified classifier.
    Subscribe @Krish Naik Channel For Data Science Videos In English.
    channel link: bit.ly/3aeve4r
    ML playlist in hindi: bit.ly/3NaEjJX
    Stats Playlist In Hindi:bit.ly/3tw6k7d
    Python Playlist In Hindi:bit.ly/3azScTI
    Now I will be uplaoding videos of Data Science In Hindi.
    #KrishNaik #trainingdata #machinelearning #validationdata
    Connect with me here:
    Twitter: / krishnaik06
    Facebook: / krishnaik06
    instagram: / krishnaik06

КОМЕНТАРІ • 43

  • @netviz8673
    @netviz8673 Місяць тому +3

    when data set is given to create a model then train dataset is on which the model is trained and it usually uses 70-80% of the data set. Test data uses 20% of the data set and is used for model performance analysis, prediction is made and performance metrics is checked.
    Now validation data is used for model hyperparameter tuning. We take the data slightly from the training data set. Using cross validation the validation data is obtained from training data set. CV=5 means 5 experiments. If total training data is 8000 then 8000/5 = 1600 means for every cross validation the starting 1600 data points will be used as validation data and the rest of 6400 data points will be used as train data and the accuracy is obtained for the first experiment. In second experiment when cross validation is 2 then then next 1600 data points will be used as validation data and first 1600data points + next 4800 data points = 6400 data points will be used for train data set and so on. So with every exp the validation data and train data set vary a little.

  • @hunterworrier6769
    @hunterworrier6769 2 роки тому +6

    Krish bro... U should provide simple code of this all video too

  • @zaafirc369
    @zaafirc369 2 роки тому +6

    Thank you
    Clearly explained as always !
    Is it possible to implement an example of cross validation and hyper parameter tuning on python pls? 🙏

  • @MuhammadJunaid-yr8jd
    @MuhammadJunaid-yr8jd 4 місяці тому

    Perfectly explained sir understood very well
    Thanks a lot sir

  • @sukruthraghavendra7305
    @sukruthraghavendra7305 10 місяців тому +1

    Perfectly explained sir understood very well

  • @anjalipes
    @anjalipes Місяць тому

    Thank you for the lovely explanation sir.
    I have a doubt in the validation data, once you have taken the mean or Avg of the CV, what do we do next with that mean? how does it help in the performance of your model? Does this impact testing data?

  • @sketchytv1321
    @sketchytv1321 2 роки тому +1

    خوش رہو بہت اعلیٰ کام کر رہے ہیں ۔

  • @riya-tb6sj
    @riya-tb6sj Рік тому

    Hello sir i m 12 Ai frst tym i watched ur video and frst tym understood concept plz make more videos of 12 syllabus cbse

  • @milansood8240
    @milansood8240 Рік тому

    thank you sir,
    I'm enrolled in upgrad Data Science program and not satisfied with it as the content is not much special like yours.

  • @iamalien8327
    @iamalien8327 3 місяці тому

    sir thanks for creating the videos
    sir kindly guide which book to refer for making the notes

  • @murthy33
    @murthy33 Рік тому

    Please upload English videos. It is very helpful

  • @bivasbisht1244
    @bivasbisht1244 2 роки тому +1

    Why hyperparameter tuning is done after training the model ? Shouldn’t it be done before traning the model , and then train the model based on those parameters ?

    • @fran9426
      @fran9426 Рік тому

      You do both. Once you have done hyperparameter tuning you can then train the final model with the tunes values on all the training data

  • @musicallystatus5368
    @musicallystatus5368 2 роки тому

    @9:52 why you divide for cross velidation= 5

  • @muazamazam7115
    @muazamazam7115 Рік тому

    Thank you for making this piece of sweet pie ❤️🌹

  • @mahantachauhan4461
    @mahantachauhan4461 10 місяців тому

    Thanks for the such an informative video 👏👏

  • @123vijit
    @123vijit 2 роки тому +1

    What is hyperparameter?

  • @richadhiman3300
    @richadhiman3300 Рік тому

    Beautifully explained sir...

  • @jyotikapatil2354
    @jyotikapatil2354 Рік тому

    This was cross-validation. How hyperparameter tuning is done?

  • @zaafirc369
    @zaafirc369 2 роки тому +1

    ROC AUC explanation also pls

  • @omdodmani3205
    @omdodmani3205 Рік тому

    if test data is used for accuracy , then which accuracy is shown by the cross validation method ?

  • @rds9815
    @rds9815 6 місяців тому

    Hi krish
    Please let me know what is x_train ,y_train and x_test ,y_test and how this process works on the data set including the mathematical equation I have a lot of confusion on this. Please make a separate video on this Both languages of English and Hindi

    • @sumitlasiwa7152
      @sumitlasiwa7152 2 дні тому +1

      train and test data is further divided into independent and dependent features. x-train is for independent and y-train for dependent of train dataset. This is done to let model know the difference between independent and dependent features. Similarly, same thing for test dataset. In test dataset this is done to compare the model prediction and actual value(y-test) to know the accuracy.

    • @rds9815
      @rds9815 23 години тому

      @@sumitlasiwa7152 Thank you doubt cleared

  • @jaiprakashsingh756
    @jaiprakashsingh756 Рік тому

    sir, could you please share some notes so that we can revice all the bulletin points.

  • @divyadewangan6123
    @divyadewangan6123 2 роки тому +1

    you are the best person to explain any topic .Thanx🥰

  • @theunscripted643
    @theunscripted643 Місяць тому

    Ur so amazing sir ❤

  • @neal_indian555
    @neal_indian555 Рік тому

    Sir you are great Akshay Kumar is not Boss You are the BOSS

  • @AlAmin-xy5ff
    @AlAmin-xy5ff 2 роки тому +1

    excellent

  • @rohitjana7461
    @rohitjana7461 2 місяці тому

    Thank you sir ☺

  • @akshaymondal1372
    @akshaymondal1372 2 роки тому

    Sir also make a video on data warehouse data lakes

  • @maths_impact
    @maths_impact Рік тому

    Outstanding sir

  • @vasimshaikh9857
    @vasimshaikh9857 Рік тому

    Thank you sir 💖💖💖💖💖

  • @PS-rh8vy
    @PS-rh8vy Рік тому

    Superb ❤

  • @vaibhvkushwah
    @vaibhvkushwah 4 місяці тому

    kafi kya bahut jyda acha tha

  • @hunterworrier6769
    @hunterworrier6769 2 роки тому

    But very helpful

  • @neal_indian555
    @neal_indian555 Рік тому

    Hukum ka ekka BOSS

  • @swagatsanketpriyadarsan6231
    @swagatsanketpriyadarsan6231 9 місяців тому

    I just gave the 1000th like. :P

  • @abhinavrajsaxena789
    @abhinavrajsaxena789 11 місяців тому

    6400

  • @AdityaSharma-yc8yt
    @AdityaSharma-yc8yt 9 місяців тому

    Didn't liked it !

  • @vibhu613
    @vibhu613 Рік тому

  • @TheNeu2ron
    @TheNeu2ron 6 місяців тому