CS 285: Lecture 15, Part 1: Offline Reinforcement Learning

Поділитися
Вставка
  • Опубліковано 17 січ 2025

КОМЕНТАРІ • 6

  • @baselomari3657
    @baselomari3657 Рік тому +1

    At 33:16, shouldn't it be "x*

    • @ResidualSkill
      @ResidualSkill Рік тому

      yeah seems like a typo

    • @dwpark3761
      @dwpark3761 Рік тому +1

      I don't think so. It is some kind of analogy. Just imagine that f(x) is Q(s,a) in the next page.

    • @binyuwang6563
      @binyuwang6563 3 місяці тому

      It's not a typo. Here x*

  • @SphereofTime
    @SphereofTime 9 місяців тому

    1:00

  • @AmitSingh-jo8ob
    @AmitSingh-jo8ob 2 роки тому

    Is there a page by lab where i can see all these references (that are being used in slide) at one place?
    Also, is there a servey paper covers all these things?