Logistic Regression Indepth Maths Intuition In Hindi

Поділитися
Вставка
  • Опубліковано 17 жов 2024
  • ML Materials github : github.com/kri...
    ML Playlist: • Machine Learning Playlist
    ----------------------------------------------------------------------------------------------------------------------------------------------------------------
    Please donate if you want to support the channel through GPay UPID,
    Gpay: krishnaik06@okicici
    -----------------------------------------------------------------------------------------------------------------------
    Recording Gears That I Use
    shorturl.at/wzI68
    ---------------------------------------------------------------------------------------------------------------------------
    #krish #datasciencebusted #ridgeregression
    Connect with me here:
    Twitter: / krishnaik06​
    Facebook: / krishnaik06​
    instagram: / krishnaik06

КОМЕНТАРІ • 68

  • @parultiwari8734
    @parultiwari8734 8 місяців тому +3

    wish i could add more thousands of likes from my side. such a great explanation!!
    Thank you sir!

  • @sairabano7968
    @sairabano7968 2 роки тому +6

    Thanks, Krish for making videos in Hindi. You always make things easy to understand.

  • @hades840
    @hades840 9 місяців тому +2

    23:22 need to keep in mind ? because i am very bad with logs

  • @chaotic_singer13
    @chaotic_singer13 5 місяців тому +1

    The intuition is good but if you can help us with a proper derivation and also about the thought process i.e. how do we thought the way we though. It will be deep!!!

  • @jannatunferdous103
    @jannatunferdous103 Місяць тому

    Pass=1 and Fail=0 till okay, but what is higher than 1? and how study hour can be less than 0? time can not be less than 0.

  • @kmishy
    @kmishy Рік тому +1

    after 1 year, today I understood why do we have log term in cost function of logistic 22:00

  • @SachinSharma-hv3wm
    @SachinSharma-hv3wm 2 роки тому +1

    Thank u so much krish sir for making videos in hindi.....aapka way of explanation bhut easy hota hai...aap complex chizo ko bhi easy bna dete ho😊😊

  • @SachinKumar-zl6ku
    @SachinKumar-zl6ku 2 роки тому +2

    You are doing amazing work man

  • @UmerFarooq-zv1ky
    @UmerFarooq-zv1ky 29 днів тому

    explanation is good.
    But the Explanation of Nitish sir Campusx is another level.

  • @prakharagarwal9448
    @prakharagarwal9448 2 роки тому +1

    Krish, when will next community session start

  • @Deeprfc12346
    @Deeprfc12346 4 місяці тому

    bhai aap ak video. Text Mining and Sentiment Analysis pe bna dijiye.

  • @HAVINOSH
    @HAVINOSH Рік тому +1

    do we need not need to square the last equation ?

  • @himanshugamer1888
    @himanshugamer1888 9 місяців тому

    Superb Explanation Sir ❤❤

  • @shrutisingh9801
    @shrutisingh9801 6 місяців тому

    y=0, y=1, y is predicted value right?

  • @aradhyakanth8409
    @aradhyakanth8409 2 роки тому +1

    ​sir, for classification we have classifier model. so, why logistic Regression

    • @atulkadam6345
      @atulkadam6345 2 роки тому

      You can use any model whichever gives you best performance wrt training and testing data

    • @RonaldoRewind-cr7
      @RonaldoRewind-cr7 2 роки тому +1

      logestic regression is a classification problem its name is regression but actually it is classifier problm

    • @Ram_jagat
      @Ram_jagat Рік тому

      @@RonaldoRewind-cr7 exactly

    • @prabhatupadhyay7526
      @prabhatupadhyay7526 8 місяців тому

      Because in logistic regression we take sigmoid function and sigmoid return data between o to 1.

  • @GhulamMustafaSherazi
    @GhulamMustafaSherazi Рік тому

    Sir I want to ask how z= theta0 + theta1 X1 converted to z = theta tranpose of x. waiting for your reply.

    • @iamravimeena
      @iamravimeena Рік тому

      Here we have theta = [theta0, theta1] and X = [1, X1], we are transposing theta matrix to get a single value after the multiplication, which is our hypothesis. z = theta0 + theta1 * X1 is another way of writing it. But z = theta transpose * X is a general way (in case if we have multiple features(X.columns > 2)).

  • @osamaosama-vh6vu
    @osamaosama-vh6vu 2 роки тому

    Great explantion thank u dear sir be happy😍

  • @sankhadipbera9271
    @sankhadipbera9271 2 місяці тому

    quality content ❤‍🔥❤‍🔥

  • @ankurgbpuat
    @ankurgbpuat Рік тому

    Please tell us why a log function is used as a cost function(if you know at all)

    • @shaileshkumar-rg9tg
      @shaileshkumar-rg9tg Рік тому

      if you know -we are all ears.

    • @ankurgbpuat
      @ankurgbpuat Рік тому

      @@shaileshkumar-rg9tg Sure thing! It's done to ensure the cost function is convex.

    • @anamitrasingha6362
      @anamitrasingha6362 Рік тому

      there are various flavors of ML algorithms. In logistic regression with the approach of trying to learn a discriminative function that can classify a point into a particular label => a function f:X->Y such that f(datapoint) = class_label(belonging to set Y). Since these class labels are discrete if you try to use a mean_squared_error loss function you will get an expression of the loss function which will not be a convex function, I have attempted a proof of it but it involves a bit of intricate mathematics. You can do that by showing that the hessian of the loss function is neither positive semi-definite nor negative semi-definite hence it's neither convex nor concave. When you use a loss function which is a logistic loss function you get a concave function and you basically would need to do a gradient ascent to get to the maxima of the concave function. Again these involve concepts from Convex Optimization which you may attempt to read if interested from Boyd.

  • @praneetnayak6757
    @praneetnayak6757 2 роки тому +2

    IN that case what does "Maximum Likelihood" mean?

    • @rehmanahmadchaudhry2548
      @rehmanahmadchaudhry2548 Рік тому

      maximum likelihood is used to simply estimate the parameters i.e. coffcients, these cofficients are further used in odds, log odds

    • @anamitrasingha6362
      @anamitrasingha6362 Рік тому

      Likelihood of parameters means what's the probability of having observed the particular distribution of the dataset that you have with your right now given that I choose a particular set of parameters. What maximum likelihood estimation says is that you want to find that set of parameters that maximises the probability of having observed that distribution of the dataset that you have. You do that by taking the gradient of the likelihood/log-likelihood function with respect to the parameters and equating to 0, then solving for those parameters

  • @shivamgondkar6183
    @shivamgondkar6183 9 місяців тому

    Hello sir
    Can you please provide notes in pdf form?
    Thanks

  • @AmmarAhmedSiddiqui
    @AmmarAhmedSiddiqui 9 місяців тому

    local minima se global nikalne time ap ne dundi mar di !..

  • @poizn5851
    @poizn5851 2 роки тому

    Thank you so much this one clear my whole droughts

  • @programmingcodewithmukesh2138

    thank you sir ..so helpful for me

  • @kiddo7094
    @kiddo7094 2 роки тому

    maja aa gya quick and understable

  • @aradhyakanth8409
    @aradhyakanth8409 2 роки тому +1

    Thank you sir.

  • @rahuldogra7171
    @rahuldogra7171 Рік тому

    Can you explain probabilistic approach for logistic regression?

    • @arshiyakhan7757
      @arshiyakhan7757 Рік тому

      Maximum likehood

    • @anamitrasingha6362
      @anamitrasingha6362 Рік тому

      Let's say you have a 2-class classification problem. You henceforth assume that your random variable Y values come from a Bernoulli distribution, with each label being either 0 or 1. This random variable Y can take on value 1 based on some probability theta(say), since probabilities of a pmf add up to one hence you can also infer that Y takes on value 0 with probability (1-theta). Now you have a dataset with you consisting of features X(n features say) and your target Y and the number of observations(samples) you have is m(say). What you want to learn is a function mapping f such that f: X -> Y. This f can be a probabilistic function as well. You define the probability of having observed a particular datapoint taking on the y value as say 1 given its features x as Pr(y_i=1|x_i). What you want now is to find the probability of having observed the values of Y across the dataset in the particular order(like y_1 takes value 1, y_2 takes value 0, these y values are what you have from the dataset) given the features X across the whole dataset(in the same order) , so basically Pr(Y|X;theta) this is read as the probability of having observed Y given that you have observed X parameterised by theta. You now define your likelihood function as L(theta) which means the likelihood of theta => the probability of having observed this Y given X. Since each of the observations/samples are independent and they are believed to have come from the same bernoulli distibution(with replacement) or in short i.i.d you say that the Pr(Y|X;theta) = product across all i (Pr(y_i = 1|x_i; theta). Why I did this is because of the independence property in probability which says the Pr(A and B) = Pr(A)*Pr(B) if event A and event B are independent. You now take a log on both sides so as to make your calculation easier and it becomes summation across all i (log(Pr(y_i=1|x_i; theta)). This is called your log-likelihood. What you now want to do is find the value of theta for which this expression is maximized which is known as maximum likelihood estimation. I should also add that this theta is assumed to be a function of w^Tx => g(w^Tx) where typically your g is a sigmoid function. So when you take the gradient you also have to substitute this function in the log-likelihood expression and then you take the gradient w.r.t w.

  • @krishj8011
    @krishj8011 4 місяці тому

    great tutorial

  • @arshad1781
    @arshad1781 2 роки тому +1

    Nice 👍

  • @singhramniwassinghsinghram7676
    @singhramniwassinghsinghram7676 2 роки тому

    Very Helpful Video

  • @kulbhushansingh1101
    @kulbhushansingh1101 2 роки тому +1

    Sir you didn't teach here about loss function in logistic regression

    • @kartikeysingh5781
      @kartikeysingh5781 9 місяців тому +1

      Loss function will be the same as regression just you have to replace the hypothesis function by hypothesis function for logistic regression

    • @kulbhushansingh1101
      @kulbhushansingh1101 9 місяців тому +1

      @@kartikeysingh5781 thanks kartikey, I got it that was 1 year ago 😂

  • @sohildoshi2655
    @sohildoshi2655 2 роки тому

    You are legend!!

  • @sureshsingam7291
    @sureshsingam7291 Рік тому

    The maths for logistic regression you upload in ml playlist is completely different from hindi playlist which is correct🙆‍♂️😰😰

    • @tammy4994
      @tammy4994 Рік тому

      Even I had the same confusion, @krishnaik could you please clarify?

  • @abhishektiwari9673
    @abhishektiwari9673 Рік тому

    Sir this ML playlist is enough to learn complete machine learning.

  • @sabbiruddinakash7181
    @sabbiruddinakash7181 5 місяців тому

    Thank you sir

  • @rohitjagtap5228
    @rohitjagtap5228 2 роки тому

    thanks a lot

  • @pkumar0212
    @pkumar0212 3 місяці тому

    👌

  • @SorryMe-o1y
    @SorryMe-o1y 2 місяці тому

  • @beingaiiitian4559
    @beingaiiitian4559 6 місяців тому

    9:48

  • @barwalgayatri4655
    @barwalgayatri4655 12 днів тому

    Bessssssssssssssssttttttttttttttt

  • @cs_soldier5292
    @cs_soldier5292 Рік тому +1

    Kuch samaj mein nhi aaya sir

  • @as8401
    @as8401 Рік тому +4

    Sir sorry but subkuch dimag ke uppar se chala gaya

    • @MovieOk-p8q
      @MovieOk-p8q 7 місяців тому +1

      Shi bola

    • @supriyasaxena5053
      @supriyasaxena5053 Місяць тому +1

      Thanks maine puri vid dekhne se pehle ye comment dekh liya

    • @as8401
      @as8401 Місяць тому

      @@supriyasaxena5053 मुझे खुशी है की आपका टाईम मेने बाचाया

  • @arshad1781
    @arshad1781 2 роки тому +1

    Thanks 👍