AI Red Teaming and AI Safety - Amanda Minnich - ESW

Поділитися
Вставка
  • Опубліковано 1 гру 2024

КОМЕНТАРІ • 5

  • @CigdemPatlak
    @CigdemPatlak 2 місяці тому +1

    Great insights, especially the scoring methods were interesting to listen to.

    • @SecurityWeekly
      @SecurityWeekly  2 місяці тому

      Glad you enjoyed it! Thanks for the feedback!

    • @AdrianSanabria
      @AdrianSanabria 2 місяці тому

      Thanks for listening! It's always a good episode when I'm learning tons of stuff and taking notes DURING the interview.

  • @JohnV-e6g
    @JohnV-e6g 3 місяці тому +1

    Hacking AI systems and tools is what I do for fun, I'm always trying to beat my top score. Prompt filters and Guardrails are a joke.

    • @AdrianSanabria
      @AdrianSanabria 2 місяці тому

      Agreed, and it's going to be a constant game of leapfrog, just like jailbreaks on smartphones, malware defenses, and webapp firewalls has been.