CS885 Lecture 18a: Safe multi-agent RL for autonomous driving (Presenter: Ashish Gaurav)

CS885 Lecture 15b: Proximal Policy Optimization (Presenter: Ruifan Yu)

Inverse Reinforcement Learning Explained

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

Anyone know what the name of this song is??? I can’t find it

Блогеры Сражаются За 300.000 Рублей! Кто Последний Выйдет Из Куба (ФрамеТамер,Сатир,Бустер и др.)

CS885 Lecture17c: Inverse Reinforcement Learning

Pascal Poupart

Переглядів 9 604

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 11 гру 2024

КОМЕНТАРІ • 8

@nikhilchalla6658 Рік тому ⁺¹
Don't know how to thank you for the recordings! It is really helping me with my education on RL. Thank you very much for the effort and for making the amazing lectures available to the public.
@datascience_with_yetty 4 роки тому ⁺²
This is the first lecture everyone needs to IRL should watch before any other lecture on UA-cam. It made me understand the other “very technical” lectures I’ve seen.
@tvsrr1990 3 роки тому ⁺¹
So clear and good starting point
@nathan_ca 5 років тому
Thank you, professor! This has been a great starting point for IRL.
@youssefkilani9177 3 роки тому
why we don't want the oprimized Pi to be better or have a higher R value than expert's trajectory?
@vrangaswamy1 3 роки тому ⁺²
The first assumption in IRL is that the expert policy π* (the one that you're imitating) is optimal with respect to some reward function R*. Your estimate of the current policy is R_i; if the policy π does better than π* at optimizing R_i; then R_i != R*! Why? Because your original assumption was that no policy is better than π* when it comes to optimizing R*. So, your estimate of R must be wrong, and you need to update it to one where the expert policy performs better than your current policy. This will bring your estimate closer to R*.
@GoKotlinJava 5 років тому
Brilliant Lecture. Thank you so much
@fairuzshadmanishishir8171 4 роки тому
Best Lecture
Thanks Professor

Наступне

Автоматичне відтворення

CS885 Lecture 18a: Safe multi-agent RL for autonomous driving (Presenter: Ashish Gaurav)

CS885 Lecture 18a: Safe multi-agent RL for autonomous driving (Presenter: Ashish Gaurav)

CS885 Lecture 15b: Proximal Policy Optimization (Presenter: Ruifan Yu)

CS885 Lecture 15b: Proximal Policy Optimization (Presenter: Ruifan Yu)

Inverse Reinforcement Learning Explained

Inverse Reinforcement Learning Explained

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

Anyone know what the name of this song is??? I can’t find it

Anyone know what the name of this song is??? I can’t find it

Блогеры Сражаются За 300.000 Рублей! Кто Последний Выйдет Из Куба (ФрамеТамер,Сатир,Бустер и др.)

Блогеры Сражаются За 300.000 Рублей! Кто Последний Выйдет Из Куба (ФрамеТамер,Сатир,Бустер и др.)

«Просив пробачення, що не уберіг Діму» - історія братів Василя Репчука і Дмитра Мурару #shorts

«Просив пробачення, що не уберіг Діму» — історія братів Василя Репчука і Дмитра Мурару #shorts

CS885 Module 3: Imitation Learning

CS885 Module 3: Imitation Learning

CS885 Lecture 7b: Actor Critic

CS885 Lecture 7b: Actor Critic

Reinforcement Learning (RL) - Andrew G. Barto | Podcast #25

Reinforcement Learning (RL) - Andrew G. Barto | Podcast #25

PR-130: Generative Adversarial Imitation Learning

PR-130: Generative Adversarial Imitation Learning

CS 285: Lecture 20, Inverse Reinforcement Learning, Part 1

CS 285: Lecture 20, Inverse Reinforcement Learning, Part 1

CS885 Module 6: Inverse RL

CS885 Module 6: Inverse RL

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

MIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)

Social Sensing and Inverse Reinforcement Learning - Vikram Krishnamurthy

Social Sensing and Inverse Reinforcement Learning - Vikram Krishnamurthy

Lecture 6: Inverse Reinforcement Learning -- From Maximum Margin to Maximum Entropy

Lecture 6: Inverse Reinforcement Learning -- From Maximum Margin to Maximum Entropy

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

вернулись в ПРОШЛОЕ 🔃 | WICSUR #shorts

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

НЕ ПОКУПАЙ iPhone 17 Air!

НЕ ПОКУПАЙ iPhone 17 Air!

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Outsmarted😅 Subscribe to me 🙌🏻

Outsmarted😅 Subscribe to me 🙌🏻

Новонароджену донечку бачив лише декілька разів #shorts #війна

Новонароджену донечку бачив лише декілька разів #shorts #війна

"ВСЯ УЛИЦА полетела" - курянка про обстріли рф

"ВСЯ УЛИЦА полетела" — курянка про обстріли рф