Natasha Jaques - Social Reinforcement Learning @ UCL DARK

Поділитися
Вставка
  • Опубліковано 29 жов 2024

КОМЕНТАРІ • 5

  •  2 роки тому

    About the question whether PAIRED is doing more than Domain Randomization: If you get a policy that adapts to all suggested environments proposed by DR, it might still not be able to generalize to environments outside of the domain of what the DR is capable of right? Because it could have memorized all the proposed environments. But with PAIRED we constrain the situations the agent would encounter and in that sense force it to learn skills that (hopefully) do generalize better?

  • @albertpeng
    @albertpeng 4 місяці тому

    Good

  • @tufailahmad5528
    @tufailahmad5528 3 роки тому

    Speaking nicely

  • @SFSylvester
    @SFSylvester 3 роки тому

    Where's the code? #papersandtalkswithcode

  • @tufailahmad5528
    @tufailahmad5528 3 роки тому

    Speaking nicely