Buck Shlegeris - AI Control [Alignment Workshop]

Поділитися
Вставка
  • Опубліковано 17 січ 2025

КОМЕНТАРІ • 2

  • @kabirkumar5815
    @kabirkumar5815 Місяць тому +3

    Isn't this inherently going to have a limit and not work for an AGI? Wouldn't a smart enough model just be able to game all of these and any other test that we come up with?