RLHF: How to Learn from Human Feedback with Reinforcement Learning

Поділитися
Вставка
  • Опубліковано 31 жов 2024

КОМЕНТАРІ • 7

  • @hayatisschon
    @hayatisschon Місяць тому

    Great talk!

  • @jteichma
    @jteichma Місяць тому

    Wonderful talk! Thanks Natasha!❤

  • @KshitizVermaDL
    @KshitizVermaDL 6 місяців тому

    Thank you Natasha for this awesome talk! I was looking an explanation to exactly the set of the papers chosen by you and I ended up watching your talk where you actually holistically compare them. Thanks a lot! This was super useful!

  • @sudaravanm379
    @sudaravanm379 8 місяців тому

    Where can I get slides for the above presentation

    • @CooperativeAIFoundation
      @CooperativeAIFoundation  8 місяців тому +6

      Here you go (with thanks to Natasha for sharing these): docs.google.com/presentation/d/1QyxGW2xCJNzzGtMqSWtLeT3u0PX_8d9p_e6oXdVcMgs/edit?usp=sharing

    • @sudaravanm379
      @sudaravanm379 8 місяців тому

      @@CooperativeAIFoundation Thanks a lot!!!!