Multinomial Logistic Regression with R: Categorical Response Variable at Three Levels

Поділитися
Вставка
  • Опубліковано 25 лис 2024

КОМЕНТАРІ • 187

  • @seeonie
    @seeonie 2 роки тому +5

    This is excellent. It simplified a complex process for me. And to cap it, the equation derivation aspect was well-explained. You deserve huge kudos.

    • @bkrai
      @bkrai  2 роки тому

      Thanks for the comments!

  • @Abouelela1
    @Abouelela1 4 роки тому +1

    I watch a lot of videos, I read a lot of books and online-article, but your videos are the best all over the internet.

    • @bkrai
      @bkrai  4 роки тому

      Thanks for your comments!

  • @samarthshrivastava2258
    @samarthshrivastava2258 4 роки тому +1

    You are doing an amazing job by uploading videos related to the techniques which can be used using R. I have watched 7-8 videos of yours and I am feeling quite confident with respect to execution of these techniques. Kudozz to your channel ✨🥂

    • @bkrai
      @bkrai  4 роки тому

      Thanks a ton!

  • @flamboyantperson5936
    @flamboyantperson5936 6 років тому +1

    This was really a very helpful and great tutorial. You have so many awesome videos that I spend my holiday learning from your videos. You have enlighten me through your teachings. Thank you so much Sir. Remember our blessings are always with you and you will always be happy in your life.

    • @bkrai
      @bkrai  6 років тому

      Thanks for feedback!

  • @2009saurabhrishu
    @2009saurabhrishu 8 років тому

    Very descriptive video. Each step is explained in detail. Request you to cover other modeling techniques as well. Thanks.

    • @bkrai
      @bkrai  8 років тому

      +saurabh gupta Thanks for your feedback! I'll be adding more techniques from time to time including based on specific requests. Meanwhile here is a playlist with many other methodologies.
      ua-cam.com/play/PL34t5iLfZddv8tJkZboegN6tmyh2-zr_T.html

  • @nasamumusa5044
    @nasamumusa5044 8 років тому

    Easy to follow your explanation. You've done a great job.
    How I wish I can find a tutorial like this on Ordinal Logistic Regression.
    Gracias!

    • @bkrai
      @bkrai  8 років тому +1

      Thanks for the feedback! Here is the link to Ordinal Logistic Regression:
      ua-cam.com/video/qkivJzjyHoA/v-deo.html

  • @parthdave2501
    @parthdave2501 8 років тому

    Bro aap mahan ho.... poora assignment ho gaya mera

    • @bkrai
      @bkrai  8 років тому

      +Parth Dave 👍

    • @parthdave2501
      @parthdave2501 8 років тому

      Hey can you please tell me how to find the Beta coefficients for each of the 3 levels? Here it took level 1 as a reference and found coefficient values in reference to it. What if i need the actual values of Betas at all 3 levels. Because if i take reference as the 2nd class the betas change. Any help is much much appreciated

  • @kalyanasundaramsp8267
    @kalyanasundaramsp8267 6 років тому +1

    i salute you for sharing the knowledge, you are a great human being.please continue to share

    • @bkrai
      @bkrai  6 років тому

      Thanks for your comments!

  • @LoizidesGeorge
    @LoizidesGeorge 5 років тому +2

    Superb, excellent presentation thanks!
    Γ
    [Dr. Bharatendra, whenever you are in Marathasa mountains, Cyprus contact me! I owe you many beers and meals for the time you saved me]

    • @bkrai
      @bkrai  5 років тому +1

      Thanks for feedback and comments!

  • @dragon171998
    @dragon171998 4 роки тому +1

    this video was a life saver for my project thank you!

    • @bkrai
      @bkrai  4 роки тому

      You're welcome!

  • @chrishanni2779
    @chrishanni2779 5 років тому +1

    I have followed many of your videos, thank you!

    • @bkrai
      @bkrai  5 років тому

      Thanks for comments!

  • @federicopascucci9402
    @federicopascucci9402 7 років тому

    You have excellent didactic skills. Nice tutorial, well done!

    • @bkrai
      @bkrai  7 років тому

      +Federico Pascucci Thanks 👍

  • @samanabraham7379
    @samanabraham7379 4 роки тому +1

    Great explanation. Very clear and helpful.

    • @bkrai
      @bkrai  4 роки тому +1

      Thanks for comments!

    • @bkrai
      @bkrai  4 роки тому +1

      Here is a link to more recent one:
      ua-cam.com/play/PL34t5iLfZddvv-L5iFFpd_P1jy_7ElWMG.html

    • @samanabraham7379
      @samanabraham7379 4 роки тому +1

      @@bkrai Thank you. I am sure they will be very helpful.

    • @bkrai
      @bkrai  4 роки тому

      Welcome!

  • @vijaymore1239
    @vijaymore1239 8 років тому

    Thank you so much, really helped!!!!!! you explained it very well!!!!! Suggestion would be add few graphs along with ROC curve would be helpful!!!!!! Thanks again!!!!

    • @bkrai
      @bkrai  8 років тому

      +Vijay More Thanks for the feedback 👍

    • @bkrai
      @bkrai  7 років тому

      you can now find roc curve video here:
      ua-cam.com/video/ypO1DPEKYFo/v-deo.html

  • @michellelu7336
    @michellelu7336 7 років тому +2

    Thank you so much for this! You just saved my life!

  • @tanvirrajput3906
    @tanvirrajput3906 5 років тому +1

    Sir, thank you very much for your efforts

    • @bkrai
      @bkrai  5 років тому

      Thanks for comments !

  • @jformoso615
    @jformoso615 6 років тому +1

    Very clear! Thanks

  • @devawratvidhate9093
    @devawratvidhate9093 6 років тому +3

    Thank you for simple detailed lecture always to the point .
    sir do you have any interview preparation videos for Data Analyst ..?

    • @bkrai
      @bkrai  6 років тому +2

      Following link has several short videos on various topics related to data analyst jobs. It can help with interview preparation.
      ua-cam.com/play/PL34t5iLfZddv9M5HQNbJkB39Gx7t83rft.html

    • @devawratvidhate9093
      @devawratvidhate9093 6 років тому +1

      @@bkrai Thank you so much

  • @new.challenges
    @new.challenges 4 роки тому +1

    Dr. Bharatendra, is it possible to apply the k fold cross-validation method to the regression model, where some of the independent variables are categorical?

    • @bkrai
      @bkrai  4 роки тому

      Independent variables being categorical should not matter.

  • @shivamg1000
    @shivamg1000 6 років тому +1

    very nicely explained...

    • @bkrai
      @bkrai  6 років тому

      thanks!

  • @santoshbala9690
    @santoshbala9690 4 роки тому +2

    Hi Dr. Rai,
    We are not getting the coeffeicient of "1" - The refernce level, How do we get that... It is essential we get the Interpretation for that level

    • @bkrai
      @bkrai  4 роки тому

      '1' is used as reference level and the 2 equations are relative to that. So '1' is baked into two equations that we get.

  • @amanmehta6562
    @amanmehta6562 7 років тому +1

    hello, how to do this if the independent variables are also factors with more than 2 types of values having no order or levels (eg. work experience - 0-6 months, 7-12 months, 12-24 months, 24+ months or educational background - commerce, science, arts, others)

    • @bkrai
      @bkrai  7 років тому

      Once you specify that the variable is a factor variable, it will automatically take care of it.

  • @micpax1
    @micpax1 8 років тому +2

    Very good video, thank you

    • @bkrai
      @bkrai  8 років тому

      +micpax1 thanks for the feedback!

  • @cmsaboridoify
    @cmsaboridoify 8 років тому

    Thank you! Very simple and useful!!!

    • @bkrai
      @bkrai  8 років тому

      +Carlos Martin 👍

  • @tewodrosgebregergs8230
    @tewodrosgebregergs8230 8 років тому +2

    This really helpful video, thank you

    • @bkrai
      @bkrai  8 років тому

      +Tewodros Gebregergs Good to hear that!

  • @roopashastri9908
    @roopashastri9908 7 років тому +1

    Awesome explanation on two tailed z-test! Could you please elaborate on AIC metric

    • @bkrai
      @bkrai  7 років тому

      You can get some info on AIC from this document:
      avesbiodiv.mncn.csic.es/estadistica/ejemploaic.pdf

  • @sunilbobb
    @sunilbobb 6 років тому +1

    really awesome sir

    • @bkrai
      @bkrai  6 років тому

      Thanks!

  • @TheVectorzer0
    @TheVectorzer0 8 років тому +1

    thanks it helped me a lot!

    • @bkrai
      @bkrai  8 років тому

      +vectorzer0 good to hear that!

  • @shuxrat2600
    @shuxrat2600 4 роки тому +1

    Big thanks for this tutorial Dr. Bharatendra! When I run confusion matrix in my data: cm

    • @bkrai
      @bkrai  4 роки тому +1

      You have 'ske' somewhere in your code that it cannot find.

    • @shuxrat2600
      @shuxrat2600 4 роки тому +1

      @@bkrai . Thankk you for response. Yes, I have ske independent variable in my code and data. But why it cannot find, do you have any idea?

    • @bkrai
      @bkrai  4 роки тому

      Check if Canopy_structure_merged$out should be Canopy_structure_merged$ske

  • @ben88hunt
    @ben88hunt 10 місяців тому +1

    Thank you

    • @bkrai
      @bkrai  10 місяців тому

      You're welcome!

    • @ben88hunt
      @ben88hunt 10 місяців тому

      Thanks again, the video is very helpful. I am just wondering if you know whether it is possible to obtain a p value to indicate significant overall model fit, chi square value, and a KMO/ Bartlett's test of sphericity using the "multinom" function or can this only be done when the outcome has 2 levels? @@bkrai

  • @hellpumpking
    @hellpumpking 7 років тому

    Do you have some example with this model with random effects?
    Muchas gracias.

  • @guerschommugisho5569
    @guerschommugisho5569 4 роки тому +1

    Nice video

    • @bkrai
      @bkrai  4 роки тому

      Thanks for comments!

  • @annazaiceva5861
    @annazaiceva5861 3 роки тому +1

    Is there a video on how to calculate those three probabilities?

    • @bkrai
      @bkrai  3 роки тому

      See the video at 4:15 point. It shows related equations.

  • @marinarondon8056
    @marinarondon8056 4 роки тому +1

    Dr. Bharatendra, I have a categorical dependent variable (3 levels) and two categorical independent variables (14 levels and 4 levels). Can I analyze the same way?
    I noticed that I will not have estimates for all levels like I have one level for each independent variable as a reference.

    • @bkrai
      @bkrai  4 роки тому +1

      Yes, it will work fine. You may also try this link:
      ua-cam.com/play/PL34t5iLfZddvv-L5iFFpd_P1jy_7ElWMG.html

    • @marinarondon8056
      @marinarondon8056 4 роки тому +1

      @@bkrai thank you!!! Great explanations!

    • @bkrai
      @bkrai  4 роки тому

      Thanks for comments!

  • @OwolabiEbenezerselectedsongs
    @OwolabiEbenezerselectedsongs 6 років тому +1

    I can't get to download the dataset. The link only take care of the codes.

    • @bkrai
      @bkrai  6 років тому

      Here is the link:
      drive.google.com/open?id=0B5W8CO0Gb2GGMzJGbkdGUGREYjA

  • @secilgulbudak1277
    @secilgulbudak1277 8 років тому +6

    Thank you very much this very good explanation, i learned multinom function just by watching your video.
    I want to ask a question about logistic regression in R. Which link function (logit, normit (pro bit), gompit (complementary log-log)) is used by this multinom function in R. I guess it is logit function but i am not sure. And can we use the other link function? Can we write the link function while multinom function is writing in R?

    • @RL-vu8jt
      @RL-vu8jt 6 років тому +1

      Im sure its logit link function

    • @bkrai
      @bkrai  3 роки тому +1

      Saw this today. Hope it is resolved by now.

  • @nassermohammed5844
    @nassermohammed5844 5 років тому +1

    Your explanations are very helpful, thank you for posting it! I do have a question though: for the error table at 3:18, why does the function stop iterating after 3 iterations? Also, what is the error being measured?

    • @bkrai
      @bkrai  4 роки тому +1

      It measures the log likelihood and the iteration stops when difference between successive iterations is too small.

  • @hanivlog774
    @hanivlog774 4 роки тому +1

    Thanks for the information. you have used three-class labels but a two-tailed z test table showing the values for two-class labels, i.e. 2, 3. Please explain this. Thanks again.

    • @bkrai
      @bkrai  4 роки тому +1

      With 3 classes we will obtain two equations and level-1 is being used as a reference level.

  • @takheru
    @takheru 5 років тому +1

    Hi sir, may I know where can i get the appropriate citation for this particular script?

    • @bkrai
      @bkrai  5 років тому

      You should be able to google it.

  • @rohittuteja4489
    @rohittuteja4489 5 років тому +1

    Is there any method to choose significant variable or we have to take all variables in model

    • @bkrai
      @bkrai  5 років тому

      Those variables that are not statistically significant, can be dropped.

  • @ashokchaudhary8204
    @ashokchaudhary8204 7 років тому +1

    I appreciate your tutorial could you add some other videos

    • @bkrai
      @bkrai  7 років тому

      You can find all my video here:
      ua-cam.com/users/westlandindiafeatured
      you can also subscribe to channel to get update when new lecture video is posted.

  • @rajkothari6007
    @rajkothari6007 6 років тому +1

    Nice. How did you select the 3 variables initially from the dataset?

    • @bkrai
      @bkrai  6 років тому

      I used 1st three variables to keep it simple enough for writing probability equations. For developing complete model, you must try all variables.

  • @Monkeyhandle101
    @Monkeyhandle101 7 років тому +2

    thank you for the video, very helpful. I do have one quesiton and im hoping you can answer. Under the predict part of the video, you insert predict(my_model, mydata). What if i wanted to test on new data points. so predict(my_model,testdata).
    Bharatendra Rai

    • @bkrai
      @bkrai  7 років тому

      That's correct!

    • @ashrafuzzamanshahriar7701
      @ashrafuzzamanshahriar7701 6 років тому

      But in the test set we always omit the classify variable . SO if i omit the classify variable it shows error :(

  • @themillitarycampaignsofpes3311
    @themillitarycampaignsofpes3311 4 роки тому +1

    Excellent video! Just one query. If I want to test mymodel on a different and smaller data set, how can I do that?

    • @bkrai
      @bkrai  4 роки тому

      You can use it the way we do it after partitioning the data set. Here is an example:
      ua-cam.com/video/aS1O8EiGLdg/v-deo.html

    • @themillitarycampaignsofpes3311
      @themillitarycampaignsofpes3311 4 роки тому

      @@bkrai But when I try that, this is the error I get:
      predict

  • @galan8115
    @galan8115 4 роки тому +1

    Good evening doctor, thank you again for your tutorials. If we want to drop variables out of our multinom model, when can we do that? I got some that are not significant for either of my "binomial within multinomial" models, (I got 3 classes so i got 2 contrast), ¿can i eliminate them of my model since they "do not contribute significantly"? And if so, ¿can i do that also with the variables that only contributes to one of my models?

    • @bkrai
      @bkrai  4 роки тому +1

      You can remove variables by putting a minus sign in the equation. You may also refer to this more recent one:
      ua-cam.com/play/PL34t5iLfZddvv-L5iFFpd_P1jy_7ElWMG.html

    • @galan8115
      @galan8115 4 роки тому +1

      @@bkrai thank you. Yes i just saw that video :D

    • @bkrai
      @bkrai  4 роки тому

      Thanks

  • @giorgiogentile8566
    @giorgiogentile8566 4 роки тому +1

    How we can get the coefficient of reference levels?

    • @bkrai
      @bkrai  4 роки тому

      It's not required.

  • @yogitasolanki4638
    @yogitasolanki4638 5 років тому

    Sir please make one video of polynomial regression

  • @aradhyamudigonda4513
    @aradhyamudigonda4513 3 роки тому +1

    Very informative, Sir. Thanks for this. I have a query, though. I had built a classifier with 11 classes and I want to calculate probabilities for the test data in the Excel file as my test data will be coming in from a new survey in excel and I want to run the data through excel, get the probability in real-time. Will that be possible?

    • @bkrai
      @bkrai  3 роки тому +1

      Yes, you can do that. Also, it will be better to use csv format.

  • @jenniferamorim9850
    @jenniferamorim9850 6 років тому +1

    Thank you for this amazing explanation.
    what method is used to estimate the parameters in this package?

    • @bkrai
      @bkrai  6 років тому

      It uses multinomial logistic regression method. It could be that I misunderstood your question.

    • @jenniferamorim9850
      @jenniferamorim9850 6 років тому

      @@bkrai in fact I would like to know if the function uses the maximum likelihood method to estimate the parameters. Because when I used the function of the VGAM package the estimates were different.

  • @ghostshiphost5744
    @ghostshiphost5744 8 років тому

    Is it possible to plot sigmoid curves for the 3 levels in one chart in R? Appreciate your sharing :)

  • @jaysethi3736
    @jaysethi3736 5 років тому

    Hello Sir,
    Your video was beneficial. I am running my model on the training dataset, and right now I am getting an accuracy of 67% with the error of 33%. So I wanted to know what should I do to reduce the error. The data was biased which I have made it balanced using ROSE package, and I haven't done normalization so should I do normalization. Because I was reading somewhere that normalization won't affect much. Would be a great help on this.

  • @RafiTAHasani
    @RafiTAHasani 6 років тому

    Sir,
    We are trying to run multinomial logistic regression with many independent variables which are categorical, for example, age & religion (4 categories) and education (6 categories).
    We are unable to get coefficients and P values corresponding to the category of the dependent variable which is kept as reference level.
    You have replied to the queries in this regard that R will take care of this once we specify it as factor. We have 2 doubts:
    1. Can we create reference levels of our choice for independent variables also?
    2. which are the coefficients and P values to be reported for the different categories of dependent variable.
    3. How to identify the
    coefficient and P values corresponding to the category of the dependent variable which is kept as reference level.
    4. How can we run a multinomial probit model in R?

  • @soumyendupaul9556
    @soumyendupaul9556 5 років тому +1

    What's advantage of taking reference of levels?

    • @bkrai
      @bkrai  5 років тому

      That's needed when you do the probability calculations.

  • @Shinjiandjun-122
    @Shinjiandjun-122 6 років тому +1

    Thank you very much, good explanation. I have a question. In this case LB has p value significant for 2 and not significant for 3. So should be remove LB, if a particular variables has p value insignificant for 50% or more variables can we remove them. Also in case of logistic regression when we remove any variables we watch R square value; in this case should we monitor AIC value?

    • @bkrai
      @bkrai  6 років тому

      Since LB is significant for 2, we should keep it in the model. Because if you remove it, it will be missing from both 2 and 3. Also AIC can be used here for making model related decisions. Lower value of AIC indicates better model. It helps with goodness of fit and also helps to avoid model over-fitting.

  • @balajiks6716
    @balajiks6716 6 років тому +1

    Sir your videos are awesome and I have a doubt. I am using neiuralnet package for classification and how to develop a neural netweork model with multiple inputs and multiple outputs.( I meant with different formulae)

    • @bkrai
      @bkrai  6 років тому

      You can use this latest link for multiple inputs and outputs:
      ua-cam.com/video/hd81EH1g1bE/v-deo.html

  • @abhishekbansal5182
    @abhishekbansal5182 4 роки тому +1

    @Dr. Bharatendra Rai sir plz show what is the mathematical functions for finding the coefficients
    Plz help

    • @bkrai
      @bkrai  4 роки тому +1

      For that you will have to refer to some book. This video only shows how to do it with R.

    • @abhishekbansal5182
      @abhishekbansal5182 4 роки тому

      @@bkrai
      Sir plz suggested some book
      Plzzzz

  • @amolamahadik1
    @amolamahadik1 2 роки тому

    Hello sir, Thank you so much this video, however My train accuracy is 90% but my test accuracy is 20%only, I think mymodel has overfitted, can you pls tell me how to handle this situation, or if you can make video of cross validation for multinomial regression, it will be a great help. Thank you in anticipation.

  • @soumyendupaul9556
    @soumyendupaul9556 5 років тому

    Can u explain how did u calculate z score? I didn't get it...............

  • @carolinaquiros4192
    @carolinaquiros4192 5 років тому +1

    Thank you for the explanation, Prof. Rai. Can I just ask how do we choose the reference level? Is it the level with the most frequencies? Many thanks!

    • @bkrai
      @bkrai  5 років тому +1

      It doesn't really matter which category is chosen as reference as far as developing a good classification model is concerned. It can be chosen as per the convenience of interpreting the coefficients.

    • @carolinaquiros4192
      @carolinaquiros4192 5 років тому

      @@bkrai Thank you so much!

    • @carlfrio8024
      @carlfrio8024 3 роки тому

      For 4 dependent variables, do we use only 1 reference level also? Is process similar to the one you did on the video? Thanks

  • @HamJeong
    @HamJeong 6 років тому

    Thanks for the videos, these are so great and useful!
    A question: is there any difference between the model you specify here and a proportional odds logistic regression? It seems like there is none, since you calculate the probabilities per category at 10:10 but I wasn't sure. Thanks!

  • @YatiChoudhary
    @YatiChoudhary 4 роки тому +1

    Sir, I have gone through many videos on youtube regarding the logistic regression. But the way you have explained is commendable. I am in the final stage of my PhD and these days writing a paper for publication, I am struggling to build my logistic regression model, if possible would you kindly guide me? How may I contact you to discuss it? I shall be highly obliged by your early response and guidance.

    • @bkrai
      @bkrai  4 роки тому

      Here is my email: seemabharat@gmail.com

    • @YatiChoudhary
      @YatiChoudhary 4 роки тому

      @@bkrai Sir I have emailed you the details with my official email ID with name Pooja Chhikara (Yati is my pet name which I use for informal social media platforms). kindly check and revert back.
      Thank you

  • @ranjithnair2659
    @ranjithnair2659 8 років тому

    Very well crafted and detailed Vedio. I still have a doubt that if I'm to using 64 Variables and 25000 line items after running the model how to determine the best variables. In case of LM & GLM command, summary highlights the best possible variables by ***. What to do in NNET to find the same.

    • @bkrai
      @bkrai  8 років тому +1

      You can do z-test and look at p-values to see how significant each variable is. This example shows these calculations too.

  • @basic-statisticsjain6693
    @basic-statisticsjain6693 4 роки тому +1

    Does the method work for four levels of outcome?

  • @carlfrio8024
    @carlfrio8024 3 роки тому +1

    Sir, is this process same when dealing with 4 dependent variables?

    • @bkrai
      @bkrai  3 роки тому

      That's different.

    • @carlfrio8024
      @carlfrio8024 3 роки тому

      @@bkrai how is it different sir?

  • @sailingintosunshine
    @sailingintosunshine 8 років тому

    Thank you!

    • @bkrai
      @bkrai  8 років тому

      +Markus Franke 👍

  • @willianleal5495
    @willianleal5495 8 років тому +1

    Very good video!
    I used your example to run a model, but when i make the Z test
    comes the following error in R:
    > z

    • @bkrai
      @bkrai  8 років тому +1

      +Willian Leal I notice 'd' missing in standard towards the end. Also 'erros' should be 'errors'.

    • @willianleal5495
      @willianleal5495 8 років тому

      +Bharatendra Rai Thanks! My mistake.

  • @Protect_the
    @Protect_the 8 років тому

    Very helpful. I have one question however.....How to increase iterations? What if the model is not converged as per the R-output message after running "multinom" function?

    • @bkrai
      @bkrai  8 років тому

      +Chandan Kumar you can get more information about the function using "??multinom". I'm yet to come across a situation where it doesn't converge.

    • @jaycalvert7911
      @jaycalvert7911 8 років тому +1

      +Chandan Kumar You can specify additional arguments from the nnet function. set maxit = 1000 for 1000 iterations and so on

  • @winnumber101
    @winnumber101 7 років тому

    My R has begun to freeze whenever I run the multinom command. It didn't always act this way, but it hasn't stopped doing this for a while even after re-installing the program.

    • @bkrai
      @bkrai  7 років тому

      probably you may have too many things open. Try to clear any open data-sets or graphs. in the work space.

  • @swatisrivastava682
    @swatisrivastava682 5 років тому +1

    It is possible to construct with more than 50 response variable ?

    • @bkrai
      @bkrai  5 років тому +1

      That should work fine.

    • @swatisrivastava682
      @swatisrivastava682 5 років тому +1

      @@bkrai Thanks a lot sir !! I will try to implement today and will update whether I was successful or not.

    • @bkrai
      @bkrai  5 років тому

      great!

  • @PramodKumar-cf3tr
    @PramodKumar-cf3tr 8 років тому

    I appreciate your tutorial. I did the test on my data and surprise model predicted 100%. Earlier, when i did the test using regular logistic regression(i.e glm) the predictability was very poor. Even the decision tree, and random forest was not good.
    Could you make some tutorial for data cleaning esp. if dataset is huge or provide any references?

    • @bkrai
      @bkrai  8 років тому

      Thanks for the feedback! I'll keep it in mind when planning next set of tutorials.

  • @lakshmigurram9911
    @lakshmigurram9911 6 років тому

    Can u explain roc curves for multinomial...

    • @bkrai
      @bkrai  6 років тому

      You can only do two at a time. For multi class situation, you can try one class versus all others.

  • @tay9001
    @tay9001 4 роки тому +1

    Sir, in this example why does there not need to be a split of the data into train and test?

    • @bkrai
      @bkrai  4 роки тому +1

      You can try this more recent one:
      ua-cam.com/play/PL34t5iLfZddvv-L5iFFpd_P1jy_7ElWMG.html

    • @tay9001
      @tay9001 4 роки тому +1

      @@bkrai Thank you! Would it be fine to perform the multinomial logistic regression either way? One way being not dividing the data and the other way to divide the data into train and test?
      Love your videos!

    • @bkrai
      @bkrai  4 роки тому +1

      It is always good to partition the data.

  • @hyunjungariuka1686
    @hyunjungariuka1686 4 роки тому +1

    i dont get it, what is the number 2 and 3 mean in resression? i mean 2 0.0000....... and the next row 3 0.728...... these 2 and 3 numbers what do they tell and why it is needed

    • @bkrai
      @bkrai  4 роки тому

      This link should help to provide more details:
      ua-cam.com/play/PL34t5iLfZddvv-L5iFFpd_P1jy_7ElWMG.html

  • @Monkeyhandle101
    @Monkeyhandle101 7 років тому

    Bharatendra Rai
    do you have an email i can contact? I have some questions about the data set im using. I would love to pick your brain!

  • @abhishekbansal5182
    @abhishekbansal5182 4 роки тому

    Sir plz suggest some book through which we can learn core concepts or mathematics of machines learning and statistics
    So we can write own neural network
    Plz sir help me
    Plz suggested book which will help me to understand the core concept. .

  • @murtazaburhani4022
    @murtazaburhani4022 3 роки тому +1

    Can you make a multinomial logistic regression in r from scratch, please🙏

  • @vishnukowndinya
    @vishnukowndinya 7 років тому

    hi Sir,
    how can we reduce the misclass error for any model ? one way i learned is by adjusting the cutoff value (p=0.5 or 0.45....).
    i have built a logit model and measured the appropriate cutoff p=0.46 and used in predicting instead of p=0.5.
    (pred >0.46

    • @bkrai
      @bkrai  7 років тому

      When original data has class imbalance, following link has some ideas for improving the prediction model:
      ua-cam.com/video/Ho2Klvzjegg/v-deo.html

    • @vishnukowndinya
      @vishnukowndinya 7 років тому +1

      that's a gud explanation. tq sir :)

  • @abhishekbansal5182
    @abhishekbansal5182 4 роки тому +1

    Sir how we get 2 different equation
    For categories 2 and 3
    Both categories has different coefficients
    Plz sir explain
    How we get two different equation for 2 and 3 categories

    • @bkrai
      @bkrai  4 роки тому +1

      I have explained that around 4 minutes and 10 seconds

    • @abhishekbansal5182
      @abhishekbansal5182 4 роки тому

      Sir actually im seeking its mathematics approach to get the coefficients

    • @abhishekbansal5182
      @abhishekbansal5182 4 роки тому

      @@bkrai sir plz show what is the mathematical functions for finding tha coefficients
      Plz help

  • @neroksi5595
    @neroksi5595 7 років тому

    This saves my fucking asses ! Thanks dude

  • @Manas2
    @Manas2 7 років тому

    Thanks for nice the explanation. Getting following error from the link which you shared (dl.dropboxusercontent.com/u/50119580/Logit-binary.R):
    File not found
    Sorry, that file doesn’t live here anymore. It might have been moved or made private

    • @bkrai
      @bkrai  7 років тому +1

      Dropbox has now discontinued public folder, that's why this error. You can now find files from following link"
      drive.google.com/open?id=0B5W8CO0Gb2GGN21ab0ZiZmo2UWs
      drive.google.com/open?id=0B5W8CO0Gb2GGMzJGbkdGUGREYjA

    • @Manas2
      @Manas2 7 років тому

      Thank you very much.

  • @sunilbobb
    @sunilbobb 6 років тому +1

    really awesome sir