How would a Data Scientist analyze Customer Churn?

Поділитися
Вставка
  • Опубліковано 21 вер 2024

КОМЕНТАРІ • 44

  • @jursmin
    @jursmin 3 роки тому +16

    Great video. However, I think that the precision/recall explanation at the end should be the other way around.

    • @CodeEmporium
      @CodeEmporium  3 роки тому +3

      Good eye. I'll clear this up in a comment. Thanks for catching this

    • @Akshaylive
      @Akshaylive 3 роки тому

      Came here to say this!

    • @falak88
      @falak88 3 роки тому

      Aye

    • @bahmanaboulhasanzadeh8512
      @bahmanaboulhasanzadeh8512 2 роки тому

      It would be great if you add a comment on the video itself or correct that section. Many may not read the comment section and thus not realize it.

    • @wuneyeproduction
      @wuneyeproduction 5 місяців тому

      @@bahmanaboulhasanzadeh8512 Exactly!! I almost did not read the comment section.

  • @ipvikas
    @ipvikas 3 роки тому +3

    Precision: Of the users who churned, how many did the model predict would churn
    Precision: छोड़ कर गए लोगों में से कितनों के बारे में model ने बताया था कि ये छोड़ के जा सकते हैं |
    Recall: Of the users the model said would churn, how many users actually churned?
    Recall: Model ने जिनको बताया था कि ये जा सकते हैं , उनमे से कितने वाकई में छोड़ कर चले गए |

  • @19AKS58
    @19AKS58 Місяць тому

    Great video

  • @mohamedgaal5340
    @mohamedgaal5340 2 роки тому

    Thank you for yet another awesome video! I like how you simplify data science concepts.

  • @snjjain
    @snjjain 3 роки тому +6

    I think you got the precision and recall swapped, precision = Tp/ tp + fp , Recall = Tp/ Tp + Fn , The false negatives + true positives in recall stands for the users who actually churned and the true positives + false positives in precisions are the predictions by the model!

    • @CodeEmporium
      @CodeEmporium  3 роки тому +2

      Yep. I saw this in another comment too. Good eye! I'll write a clarifying comment on this soon

    • @migi7787
      @migi7787 Рік тому

      @@CodeEmporium I loved this presentation - you're so good at touching the main points (e.g. data leakage) and all in 13 mins!! Exactly what I was looking for, well done!

  • @deepanshudashora5887
    @deepanshudashora5887 3 роки тому +1

    Another masterpiece by you 🙏

    • @CodeEmporium
      @CodeEmporium  3 роки тому

      Thanks for the compliments! Means a lot!

    • @Storieswithsumith
      @Storieswithsumith 2 роки тому

      @@CodeEmporium you please tell me what is work order

  • @chessplayer0106
    @chessplayer0106 2 роки тому +1

    Wait why can we only train on data up to 3 months in the past? If you have the same features for customers from last year and also see their usage history last year, couldn't you use that historical data to predict churn as well?

  • @YeeLeeHaw
    @YeeLeeHaw 3 роки тому

    Start porting your videos to LBRY's Odysee as well. You can set that to automatically upload it there whenever it uploads to UA-cam and can be activated when you create your Odysee account. You deserve way more exposure, maybe that can help a little since there is far less competition there.

    • @CodeEmporium
      @CodeEmporium  3 роки тому

      Thanks for the suggestion Daniel. I def might consider this

  • @dragomir44
    @dragomir44 3 роки тому +1

    Awesome video! The step at 9:05 went a bit too fast in my opinion. Could you show us, perhaps with made up data, how you verify it using SQL? What goes into the process of defining a "Correct" and an "incorrect"?

    • @rohitsharma0389
      @rohitsharma0389 3 роки тому

      Performing uni-variate analysis of predictor variable's relationship with target variable should answer the "hunches"/hypothesis. For instance, plotting a bar chart of Gender vs Churn will tell us whether males are likely or females are more likely to churn. In this case, we can visualize #WorkOrdersIn6Months against Churn

  • @RidingWithGerdas
    @RidingWithGerdas 3 роки тому +1

    Want to ask about perhpas too bias features. Isn't something like "days from last orders" or "total order count" would be too bias to predict churn. I mean it's common sense that if the customer hasn't ordered for a while or in a long time ordered once or twice, he is SUPER likely to churn when the churn period arrives. Wouldn't such features somehow hide or overshadow other features which perhaps could indicate us which features make negative churn predicitons ?

    • @CodeEmporium
      @CodeEmporium  3 роки тому

      True. Like if the feature is 89 days since last purchase, chances are they will churn. And now that i think about it, it probably won't be useful making that prediction since acting on it is impossible. We could just exclude that as a feature and say "What is the probability the customer churns 3 months from making this prediction". Good eye

    • @RidingWithGerdas
      @RidingWithGerdas 3 роки тому

      @@CodeEmporium Then this would be related to Survival Analysis?

  • @kabeerjaffri4015
    @kabeerjaffri4015 3 роки тому +1

    Great video. Love from pakistan

  • @irfanalfathramadhan4797
    @irfanalfathramadhan4797 10 місяців тому

    Can you provide examples of practices in tools related to churn, for example in Orange Data Maining?
    I'm still learning to churn, thank you

  • @faizaanazam
    @faizaanazam Рік тому

    Love it!

  • @rohitsharma0389
    @rohitsharma0389 3 роки тому

    There are two points that you mentioned in the model training slides
    1. Create snapshots randomly for active users at a given time
    2. Only train on data up to 3 months in the past
    Does that mean that when I want to create a random snapshot, let's say of 18th Mar 2021, I only include customers who have purchased at least once in the last 3 months as anyone whose last purchase was 4 months prior to 18th Mar is already a churned and hence inactive violating the first point.
    Also of those users who are considered active, can we look back at data prior to 3 months or not. I am guessing not because that would violate rule 2 but then you are using a feature #work orders in last 6 months. So I am a little confused here.

  • @cocoarecords
    @cocoarecords 2 роки тому

    amazing

  • @snjjain
    @snjjain 3 роки тому

    how should we treat the test dataset? Does that also have to be randomly sampled? How would you present recall and precision? at the user level or daily observation level? Thanks

  • @EtienneTremblay
    @EtienneTremblay 3 роки тому

    Great video! I don’t think you could use regression though because of right censoring. You don’t know if some of those customers will churn and when. Survival analysis models deal with this kind of censoring.

    • @CodeEmporium
      @CodeEmporium  3 роки тому +1

      Ah. Good call. I think you're right with this

  • @munshir.c6161
    @munshir.c6161 2 роки тому

    Thank you,
    Could u please help me how can make a mathematical equation for churn model?

  • @varunnayyar3138
    @varunnayyar3138 2 роки тому +1

    Precision recall definition seems to be goofed up.

    • @CodeEmporium
      @CodeEmporium  2 роки тому +1

      Yea. My b. Pinned a previous comment that said the same. Thanks!

  • @deadlysk8
    @deadlysk8 3 роки тому

    awesome

  • @SMHasan9
    @SMHasan9 Рік тому

    I don't agree with the 2nd feature.
    still great vdo 👍

  • @Storieswithsumith
    @Storieswithsumith 2 роки тому

    what is work order ???

  • @anjaliudoshi5729
    @anjaliudoshi5729 2 роки тому

    Is this a Hancock 'good job'??