Square Machine Learning Interview: Decision Tree Bank Loan

Поділитися
Вставка

КОМЕНТАРІ • 16

  • @liweiwen5247
    @liweiwen5247 2 роки тому +10

    impressive! He is so organized and thorough. Probably the best candidate I saw on your channel

  • @Clammer999
    @Clammer999 3 місяці тому +1

    One of the best videos I’ve seen where I get a good insight into the thought process of a data scientist in terms of the whole process of applying ML to a business function from model selection (considering the pros and cons) to data collection and deployment considerations. Please do more of such videos!

  • @austinhoag5130
    @austinhoag5130 10 місяців тому +3

    Helpful mock interview! I felt like a reasonable question at the beginning would have been what does the historical data look like? That would have eliminated him having to make a bunch of assumptions that impacted his decisions, such as are the classes imbalanced, are there lots of features/samples.

  • @samw2066
    @samw2066 3 місяці тому

    1 of the best candidates I've seen so far

  • @floriandubost5484
    @floriandubost5484 2 роки тому +4

    AUC is not really a good metric for unbalanced datasets. The x-axis metric depends on the number of negatives. Better use a F-ROC curve or precision-recall curve and the area under the precision-recall curve (also called average precision).

  • @brothermalcolm
    @brothermalcolm 2 роки тому +2

    This seemed like a really real interview!

  • @mitchieyy
    @mitchieyy 2 роки тому +2

    Great interview and answer! Just felt the first 8 min is not really related to the question asked if interview time is limited maybe straight to the point?

    • @jimmygan801
      @jimmygan801 Рік тому +1

      first 8 min is gold, in terms of interview technique.. establishing context, reaffirming the understanding of the request gives yourself time to think about the prob... as well as effectively communicate with the stakeholder (here the interviewer). Particularly important if you are interviewing with a company that wants ds to have client facing skills

  • @luckytraderchow
    @luckytraderchow 2 роки тому +1

    @16:47, during the shadow model, not sure if why it needs to be done, ensure the distribution of our real-time prediction is equal to training distribution? can someone help elaborate that? much appreciated!

    • @edwardpavlovsky588
      @edwardpavlovsky588 2 роки тому +1

      I believe the concern here is data drift, or just generally making sure that your model is predicting for a dataset with a similar distribution to the one that the model was trained on. For example, if the model was trained 5 months ago, the distributions in the underlying dataset that it was trained on have likely changed and so the model's learnings are outdated. This becomes less so of an issue with real time systems where the model is being re-trained regularly, but comparing the distribution of features and the output variable that the model was trained on to the distribution of features and output variable that the model is predicting on is still useful because it gives you confidence that the model "learned" on a similar dataset.

  • @淘宝买的会员
    @淘宝买的会员 2 роки тому +1

    Interpretability nowadays has nothing to do with model selection thanks to some open source packages like Shap and Lime. Even with so called black box model, individual prediction and directional contribution are now both explainable.

    • @austinhoag5130
      @austinhoag5130 10 місяців тому +3

      You make it sound like explainability is solved. I wish it were true!

  • @rohitverma1057
    @rohitverma1057 Рік тому +4

    No cross questioning, no product level metric discussion, no revenue saved simulation for the past if model have been deployed before, no AUC drawbacks discussed, no feature engineering discussion, no different set of input data discussion, no sampling to address imbalance - NOTHING !! Really trivial and basic from both the sides.

    • @samw2066
      @samw2066 3 місяці тому

      To an extent I agree with your concerns, although the candidate did prefer PR AUC over standard AUC. I think Jay should not be so laid back, to simulate a real interview.

  • @baomao139
    @baomao139 4 місяці тому

    from someone who is the interviewer at Square, this is definitely what they want to test for. Totally misleading directions😂

    • @samw2066
      @samw2066 3 місяці тому

      Can you clarify ? Sounds like it is definitely what they do NOT want to test, given the totally misleading directions.