Logistic Regression for Classification | Working with a real-world dataset from Kaggle

Поділитися
Вставка
  • Опубліковано 14 жов 2024

КОМЕНТАРІ • 75

  • @anuragthakur5787
    @anuragthakur5787 3 роки тому +7

    That was intense!!!
    This is probably the first time I have watched a tutorial this long without any break
    You are Awesome sir

  • @kizzavincent
    @kizzavincent 3 роки тому +26

    Thanks a lot Aakash for the fabulous explanations and infectious passion to empower others. These tutorials are simply unmatched! Bravo!

    • @jovianhq
      @jovianhq  3 роки тому +2

      Thanks for the feedback, help us spread the word :)

    • @bongogappo38
      @bongogappo38 Рік тому

      @@jovianhq sir what can we do if there is a column of string type values like disease name and symptoms

  • @SillyLittleMe
    @SillyLittleMe 2 роки тому +1

    This video is still one of the best. A literal game changer!

  • @TheAnugupta
    @TheAnugupta 3 роки тому +1

    Nicely explained Akash and Jovian Team..this was probably the most thorough and clearly explained tutorial I came across

  • @hemangdhanani9434
    @hemangdhanani9434 3 роки тому +4

    great explanation with reasonable depth for this topic, such a great video...

    • @jovianhq
      @jovianhq  3 роки тому

      Thanks for the feedback, help us spread the word :)

  • @rlm3574
    @rlm3574 3 роки тому +5

    Really, a lecture full of knowledge

    • @jovianhq
      @jovianhq  3 роки тому

      Thanks for the feedback, help us spread the word :)

  • @gurjeet333
    @gurjeet333 3 роки тому +4

    Nice Video....Really appreciated. Can we also include the topic of setting up data pre processing pipelines in future sessions.

    • @jovianhq
      @jovianhq  3 роки тому

      Thanks for the feedback and suggestion!

  • @sahilmalhotra7295
    @sahilmalhotra7295 Рік тому

    Thank you, this was very beginner friendly and it helped me understand a lot of practical topics.

    • @jovianhq
      @jovianhq  Рік тому

      You're very welcome! Glad it was helpful.

  • @ektakumari4496
    @ektakumari4496 2 роки тому +1

    Great content Aakash sir , that too free...really amazed and impressed by jovian !

    • @jovianhq
      @jovianhq  2 роки тому +1

      Glad you liked it!

  • @mayankraj4763
    @mayankraj4763 5 місяців тому

    Hello. I have a question. Should we scale the features after the imputation or before because here you imputed the raw_df dataframe which is not imputed? Thanks

  • @georgevavolil7005
    @georgevavolil7005 2 роки тому +1

    I have a doubt. When we do imputation, we take mean to replace the missing values. We take the mean from each columns of the entire data.
    The mean of data in each columns of the entire data should be different from means taken from train_df, val_df and test_df separately. It will create some discrepancy in the final result. What's your position on this ? Whether we should conduct imputation based on the entire dataframe or from its subsets

    • @jovianhq
      @jovianhq  2 роки тому +1

      A sample of the data should represent the entire dataset. Also, the validation, and training set should be independent of the training set. So imputation can be done differently in validation set and training set.

  • @tapomayeebasu3047
    @tapomayeebasu3047 2 роки тому

    Thank you for such a detailed lecture. Very very helpful. Would love to know about more.

    • @jovianhq
      @jovianhq  2 роки тому

      Glad it was helpful! Go to zerotogbms.com for more lectures on Machine Learning

  • @mdalamgirhossain6192
    @mdalamgirhossain6192 2 роки тому +1

    Salute Boss. This is wholesome 💝💝

  • @parastooaghr
    @parastooaghr 3 роки тому +1

    Great video! I learned a lot! Thank you!

  • @lion87563
    @lion87563 2 роки тому +1

    So higher the weight more important column is (but only if numerical columns are scaled)? If data is not scaled we cannot derive this conclusion?

    • @jovianhq
      @jovianhq  2 роки тому

      True! Also, not just higher, the more negative the weight the more important it has i.e. The weight that are closest to 0 have minimum importance

  • @UsmanKhan-tc4sk
    @UsmanKhan-tc4sk 2 роки тому

    I was working on a mini data science project in which test.csv and train.csv datasets given to me. I trained my model using training data. Now if i want to find accuracy score of my model on testing data what i will do? If i write model.predict(test_data) then how i will compare the predicted tesing values to the true values? Because there is no target values in the testing dataset

  • @foodforthought8415
    @foodforthought8415 3 роки тому +1

    Very good tutorial.elaborate and detailed .thanks

    • @jovianhq
      @jovianhq  3 роки тому

      Thanks for the feedback, help us spread the word :)

  • @siddharthsahu5048
    @siddharthsahu5048 2 роки тому

    (1:53:40) when you plot the weights the negative weight would not be considered.
    And the negative weights also affect the model just in opposite direction.
    What are your thoughts should the negative weights be considered??

    • @jovianhq
      @jovianhq  2 роки тому

      Yes, the negative weights should be considered. In fact, you can try and ignore the columns which has very less weights i.e. whose weights are closer to 0. Both negative and positive weights effect the model in some way.

  • @anuphp3432
    @anuphp3432 2 роки тому

    hey, also isn't it a common practice to scale the test data that is transform the test data or validation data by fitting it only on training datasets?

  • @asifsaad5827
    @asifsaad5827 3 роки тому +1

    would you mind switching to dark mode?
    TIA

  • @thakurprathiksinghrajput7135
    @thakurprathiksinghrajput7135 7 місяців тому

    1:45:00 whilst you fitted the transformed cols in to your model, I am still getting a type error

  • @sandipansarkar9211
    @sandipansarkar9211 2 роки тому +1

    finished watching

  • @datahistory2411
    @datahistory2411 Рік тому

    thnks sir...but how to deploy on the website?

  • @sarimahsan6341
    @sarimahsan6341 7 місяців тому +1

    At 1:35:35 ,encoder transform, i am getting an error that columns must be the same as length key.please tell me how to reolve it

    • @andme-tech102
      @andme-tech102 Місяць тому

      Facing the same problem too

    • @praveenranolia4952
      @praveenranolia4952 5 днів тому

      @@andme-tech102 try to use the "sparse_output =False" in OneHotEncoder function

  • @rlm3574
    @rlm3574 3 роки тому +2

    3 hrs worth watching

    • @jovianhq
      @jovianhq  3 роки тому

      Thanks for watching!

  • @truptpatel2597
    @truptpatel2597 2 роки тому

    Information Leakage
    timestamp: 1:25:10 , He fitted the scaler on the whole numerical dataset and transform it to train, validation and test sets. But isn't it the Information leakage because the scaler knew the test
    or validation while fitting?

    • @jovianhq
      @jovianhq  2 роки тому +1

      Well, if you have access to the validation dataset, you can do scaling on the training and validation both. Generally, you won't be able to touch the test dataset so we shouldn't fit scaler/encoder on the test dataset.

  • @dataninjaa
    @dataninjaa Рік тому

    Thanks a Lot Bro its nice dataset and you covered very nice from start to end

  • @anuphp3432
    @anuphp3432 2 роки тому

    excellent brother!

  • @danielm5729
    @danielm5729 2 роки тому

    Thank you very much.🙏

  • @harshvardhansalve8537
    @harshvardhansalve8537 7 місяців тому

    Nice lecture

  • @NehaSingh-fb8kj
    @NehaSingh-fb8kj 7 місяців тому

    Great content

  • @krupamehta8705
    @krupamehta8705 2 роки тому

    1:26:54 can't understand why is max value in some columns not 1, it should be 1....

  • @shantanusingh2198
    @shantanusingh2198 2 роки тому +1

    Thank you🙂

  • @sandipansarkar9211
    @sandipansarkar9211 2 роки тому +1

    FINISHED CODING FULL

  • @rabbitazteca23
    @rabbitazteca23 2 роки тому

    Hi I noticed that in 1:53:44 you are making a prediction using the train inputs (X_train).... but shouldn't' t you be making a prediction using the validation inputs instead? I don't think you have passed the X_val into any of the logistic regression model prediction.... or am I just confuse ? HAAHHA.

    • @jovianhq
      @jovianhq  2 роки тому

      Please check ua-cam.com/video/sjIzfC4AOI0/v-deo.html, at first we're predicting with the train set, later we are also predicting with the validation and test sets.

    • @rabbitazteca23
      @rabbitazteca23 2 роки тому

      @@jovianhq I am sorry ahahah. you are right. I must have missed this part.

  • @sharkk2979
    @sharkk2979 2 роки тому

    thanks u so good! thanks again

  • @pallapothubhargavramfromib2244
    @pallapothubhargavramfromib2244 3 роки тому

    Sir do you continue this videos

    • @shreeyansjavangula8908
      @shreeyansjavangula8908 3 роки тому

      Yeah, this is a course on ml. The new videos structure is provided on his website. jovian.ai/learn/machine-learning-with-python-zero-to-gbms

  • @白馬義從寶葵
    @白馬義從寶葵 Рік тому

    amazing

  • @kmishy
    @kmishy Рік тому

    bookmark 1:03:15 .. for me imp part start here

  • @fet_hsc2300
    @fet_hsc2300 3 роки тому +1

    0:58:00

  • @arjunbhandari5554
    @arjunbhandari5554 7 місяців тому

    1:39:11

  • @debojitmandal8670
    @debojitmandal8670 3 роки тому

    What's a solver

    • @jovianhq
      @jovianhq  3 роки тому

      Hey, please go through the blog to know more about solvers. -> towardsdatascience.com/dont-sweat-the-solver-stuff-aea7cddc3451

  • @rubayetalam8759
    @rubayetalam8759 3 роки тому +1

    Please add subtitles.

    • @jovianhq
      @jovianhq  3 роки тому +1

      Hey we are in the process of adding subtitles to videos, it will be added soon. Thanks!

    • @rubayetalam8759
      @rubayetalam8759 3 роки тому

      @@jovianhq thanks! you are doing great!

  • @fet_hsc2300
    @fet_hsc2300 3 роки тому

    1:00:55

  • @neurax6688
    @neurax6688 2 роки тому

    waoo

  • @adityabenere6004
    @adityabenere6004 3 роки тому

    1 ;56;49 nicee

  • @imaksinsights7202
    @imaksinsights7202 2 роки тому

    Here is another simplified Logistic Regression tutorial if you are a beginner: ua-cam.com/video/tcjR8JYSb9E/v-deo.html

  • @yskasells4014
    @yskasells4014 2 роки тому

    1:18:01

  • @fet_hsc2300
    @fet_hsc2300 3 роки тому

    1:06:09

  • @fet_hsc2300
    @fet_hsc2300 3 роки тому

    1:08:30