Training Data Vs Test Data Vs Validation Data| Krish Naik
Вставка
- Опубліковано 25 вер 2024
- github for materials and notes: github.com/kri...
Training set: A set of examples used for learning, that is to fit the parameters of the classifier.
Validation set: A set of examples used to tune the parameters of a classifier, for example to choose the number of hidden units in a neural network.
Test set: A set of examples used only to assess the performance of a fully-specified classifier.
Subscribe @Krish Naik Channel For Data Science Videos In English.
channel link: bit.ly/3aeve4r
ML playlist in hindi: bit.ly/3NaEjJX
Stats Playlist In Hindi:bit.ly/3tw6k7d
Python Playlist In Hindi:bit.ly/3azScTI
Now I will be uplaoding videos of Data Science In Hindi.
#KrishNaik #trainingdata #machinelearning #validationdata
Connect with me here:
Twitter: / krishnaik06
Facebook: / krishnaik06
instagram: / krishnaik06
when data set is given to create a model then train dataset is on which the model is trained and it usually uses 70-80% of the data set. Test data uses 20% of the data set and is used for model performance analysis, prediction is made and performance metrics is checked.
Now validation data is used for model hyperparameter tuning. We take the data slightly from the training data set. Using cross validation the validation data is obtained from training data set. CV=5 means 5 experiments. If total training data is 8000 then 8000/5 = 1600 means for every cross validation the starting 1600 data points will be used as validation data and the rest of 6400 data points will be used as train data and the accuracy is obtained for the first experiment. In second experiment when cross validation is 2 then then next 1600 data points will be used as validation data and first 1600data points + next 4800 data points = 6400 data points will be used for train data set and so on. So with every exp the validation data and train data set vary a little.
Krish bro... U should provide simple code of this all video too
Thank you
Clearly explained as always !
Is it possible to implement an example of cross validation and hyper parameter tuning on python pls? 🙏
Perfectly explained sir understood very well
Thanks a lot sir
Perfectly explained sir understood very well
Thank you for the lovely explanation sir.
I have a doubt in the validation data, once you have taken the mean or Avg of the CV, what do we do next with that mean? how does it help in the performance of your model? Does this impact testing data?
خوش رہو بہت اعلیٰ کام کر رہے ہیں ۔
Hello sir i m 12 Ai frst tym i watched ur video and frst tym understood concept plz make more videos of 12 syllabus cbse
thank you sir,
I'm enrolled in upgrad Data Science program and not satisfied with it as the content is not much special like yours.
sir thanks for creating the videos
sir kindly guide which book to refer for making the notes
Please upload English videos. It is very helpful
Why hyperparameter tuning is done after training the model ? Shouldn’t it be done before traning the model , and then train the model based on those parameters ?
You do both. Once you have done hyperparameter tuning you can then train the final model with the tunes values on all the training data
@9:52 why you divide for cross velidation= 5
Thank you for making this piece of sweet pie ❤️🌹
Thanks for the such an informative video 👏👏
What is hyperparameter?
Beautifully explained sir...
This was cross-validation. How hyperparameter tuning is done?
ROC AUC explanation also pls
if test data is used for accuracy , then which accuracy is shown by the cross validation method ?
Hi krish
Please let me know what is x_train ,y_train and x_test ,y_test and how this process works on the data set including the mathematical equation I have a lot of confusion on this. Please make a separate video on this Both languages of English and Hindi
train and test data is further divided into independent and dependent features. x-train is for independent and y-train for dependent of train dataset. This is done to let model know the difference between independent and dependent features. Similarly, same thing for test dataset. In test dataset this is done to compare the model prediction and actual value(y-test) to know the accuracy.
@@sumitlasiwa7152 Thank you doubt cleared
sir, could you please share some notes so that we can revice all the bulletin points.
you are the best person to explain any topic .Thanx🥰
Ur so amazing sir ❤
Sir you are great Akshay Kumar is not Boss You are the BOSS
excellent
Thank you sir ☺
Sir also make a video on data warehouse data lakes
Outstanding sir
Thank you sir 💖💖💖💖💖
Superb ❤
kafi kya bahut jyda acha tha
But very helpful
Hukum ka ekka BOSS
I just gave the 1000th like. :P
6400
Didn't liked it !