How to use Adversarial Validation to Help Fix Overfitting

Поділитися
Вставка
  • Опубліковано 27 вер 2024
  • Join my Foundations of GNNs online course (www.graphneura...)! Adversarial Validation is a diagnostic tool to test whether your training and test datasets come from the same distribution. If not, it can help you find which variables are causing the problem.
    Blog link: blog.zakjost.c...
    Code: github.com/zjo...
    Discord server: / discord
    Mailing list: Click "Subscribe" at blog.zakjost.com/
    Patreon: / welcomeaioverlords
    Music Attribution:
    1st track: "Jazzy Frenchy" from Bensong.com.
    Intro track: Elder - Legend
    3rd track: Merry Bay by Ghostrifter Official | / ghostrifter-official
    Music promoted by www.free-stock...
    Creative Commons Attribution-ShareAlike 3.0 Unported
    creativecommon...
    4th track: Pink Cadillac by tubebackr | / tubebackr
    Music promoted by www.free-stock...
    Attribution-NoDerivs 3.0 Unported (CC BY-ND 3.0)
    creativecommon...

КОМЕНТАРІ • 11

  • @artem_isakow
    @artem_isakow 2 місяці тому

    Thank you!

  • @lakeguy65616
    @lakeguy65616 Рік тому

    at approx 9:53, you display a chart of "feature importances"... How do you determine feature importance? Thank you, GREAT VIDEO!

  • @arunavamaulik19
    @arunavamaulik19 3 роки тому

    Thank you very much for the video, very helpful.
    Subscribed!

  • @jonathanlarkin1112
    @jonathanlarkin1112 2 роки тому

    Thanks for this. Very nice video. I’m wondering about the use of AV in time series problems were you have lag and moving average features. Any special care needed in that situation?

  • @nicholasliu-sontag1585
    @nicholasliu-sontag1585 3 роки тому

    Would the 'ideal' roc_auc score be 0.5?

  • @wei_zou
    @wei_zou 4 роки тому

    I kept wondering whether allowing people to use feature matrix in the testing data (without label) will inflate the model performance on the same testing data. Yes, people can win a kaggle contest like this; but this kind of reduces the confidence on the testing data performance and generalizability beyond the current testing data. Any thoughts on this? thanks

    • @welcomeaioverlords
      @welcomeaioverlords  4 роки тому

      I think people can and do use the test data to improve their scores. I agree this brings up a lot of questions if you were going to use this sort of approach in a real world setting. But it depends.

  • @vikramrs4191
    @vikramrs4191 4 роки тому

    Thankyou for simple, lucid and articulate explanation. can you share the link of code on github as told you in the video. Thanks in advance

    • @welcomeaioverlords
      @welcomeaioverlords  4 роки тому +1

      Hi Vikram--glad you liked the video. All of the links are in the video's description.

  • @staminadaddy
    @staminadaddy 3 роки тому

    Amazing! Bro you have made my learning journey so much easier. Keep up the good videos!! Definitely subscribed