Reinforcement Learning: AlphaGo

MIT 6.S191: Reinforcement Learning

The moment we stopped understanding AI [AlexNet]

Попри зливу у Полтаві відкрили дошку воїну

Внезапно! Что на самом деле подорвал «Орешник»

МАМАША, Когда обидели Ребёнка (смешное видео, юмор, приколы, поржать)

Reinforcement Learning from scratch

Graphics in 5 Minutes

Переглядів 74 692

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 29 лис 2024

КОМЕНТАРІ • 51

@darthvader4899 8 місяців тому ⁺⁴¹
this is video is super underrated. In fact the whole channel is underrated.
@william_8844 4 місяці тому
Maybe i should follow the channel then 😅.
This was my first vid, and the explanation was really well simplified
@themathguy3149 Рік тому ⁺⁹
Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!
@tushargupta1999 8 місяців тому ⁺⁵
This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.
@ashketchum1244 Рік тому ⁺⁶
I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.
@jameslibby5215 Рік тому ⁺⁸
Very very underrated channel
@benc7910 10 місяців тому
Underrated, two Rs
@jameslibby5215 10 місяців тому
@@benc7910 thank ya sir
@Arivan_Abdulla 4 місяці тому ⁺³
Too beautiful you can watch this kind of videos all the day without get bored
@mind6861 5 місяців тому ⁺²⁸
Can we have the code for this
@poopcoder468 Місяць тому
Lol😅😅😅😅😅😅
@themax2go 8 місяців тому ⁺⁴
agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))
@metaljacket8102 7 місяців тому ⁺²
This is really awsome! It's the best video that explains DRL in such an easy to understand way!
@Bet-s4g 2 місяці тому ⁺¹
This is super underrated video
@a.aspden Рік тому ⁺²
Your videos are great. Looking forward to more!
@cloudysh 7 місяців тому ⁺¹
This was so surprisingly great :3
@CptDoge-rn3ou Рік тому ⁺²
I really like the way you visualize what you are talking about. Thank you for putting in the effort!
@Sumpydumpert 5 місяців тому ⁺¹
I agree once you see how it all works it seems like 1s and zeros give me some feed back on r/grand unified theory or cosmo knowledge
@marcinstrzesak346 Рік тому ⁺¹
Great video, very helpful, easy to understand.
@moldo800 10 місяців тому ⁺¹
Excellent. Congratulations ❤
@swannschilling474 5 місяців тому ⁺¹
Thanks a lot for this one! 😊
@luiseduardocraizer7416 6 місяців тому ⁺¹
Excellent content!
@anthonyortiz7924 2 місяці тому
What a great series! I have a question for the experts... was it necessary to map velocity as an input? I'm guessing it's not absolutely necessary and was done to make the training faster? My guess is based on the assumption that the timing of the ball x/y changes to the inputs have an effect, but I may be wrong.
@gmjammin4367 Рік тому ⁺¹
Amazing video as always :)!
@BlueBirdgg Рік тому ⁺¹
Can you playlist each one of your topics plz?
I wanted to post on Twitter(X) your video topics but could only post a single video at a time.
Great content by the way. Ty very much.
Your perspective on some topics helped me a lot to get a more intuitive understanding.
@g5min Рік тому
Good idea! Here's one on generative AI:
ua-cam.com/play/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo.html
Here's one on reinforcement learning
ua-cam.com/play/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL.html
Here's one on LLMs + text-to-image
ua-cam.com/play/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu.html
@BlueBirdgg Рік тому
@@g5min Ty!
@jaideepraulji1395 3 місяці тому ⁺¹
Superb
@mohajeramir 7 місяців тому ⁺²
Excellent
@mado.madeleine Рік тому ⁺¹
Super helpful! Thank you 🙏🏽
@jdlopes06 5 місяців тому ⁺¹
Thank you!
@william_8844 4 місяці тому
I get how the model can see moves and output up or down action. But I don't get how model tracks the score for rewards etc
Can someone explain how the reward is fed into model
@edvinbeqari7551 10 місяців тому
What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.
@nikbivation Рік тому ⁺¹
thank you for this!
@ireoluwaTH Рік тому ⁺¹
Thank you!!!
@bombur9007 7 місяців тому
how many layers should such network have
@n4mmenam Рік тому ⁺¹
Brilliant
@mineq4967 8 місяців тому
but by what number do you change the weights like you never told us
@NR_5tudio Місяць тому
i just have a quastion, what is that thing ? 6:20 its like a worm ?
like. i didnt take it in my math class.... im 16 years btw
i mean the one u added
@maxim_ml 6 місяців тому ⁺¹
that was good
@axe863 Рік тому ⁺²
Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅
@nischalyou Рік тому
whats the name of this video game ?
@gaydemaupassant6263 5 місяців тому
Pls o want the code plsss
@FRANKONATOR123 Рік тому
Can you share the source code for this project
@g5min Рік тому
You can follow the link to the Karpathy site at the end of the video, repeated here:
karpathy.github.io/2016/05/31/rl/
@herikaniugu Рік тому
Imagine using reinforcement learning in quantitative finance 😊
@macratak Рік тому
ah yes, reinforcement learning. a fundamental computer graphics technology
@g5min Рік тому ⁺⁶
I think that character/game-AI is pretty central to graphics
@pw7225 Рік тому ⁺¹
Why so negative?
@revimfadli4666 Рік тому
@@g5minespecially AI image generation or processing nowadays

Наступне

Автоматичне відтворення

Reinforcement Learning: AlphaGo

Reinforcement Learning: AlphaGo

MIT 6.S191: Reinforcement Learning

MIT 6.S191: Reinforcement Learning

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

Попри зливу у Полтаві відкрили дошку воїну

Попри зливу у Полтаві відкрили дошку воїну

Внезапно! Что на самом деле подорвал «Орешник»

Внезапно! Что на самом деле подорвал «Орешник»

МАМАША, Когда обидели Ребёнка (смешное видео, юмор, приколы, поржать)

МАМАША, Когда обидели Ребёнка (смешное видео, юмор, приколы, поржать)

Час РАСПЛАТЫ от МАЙКА ТАЙСОНА

Час РАСПЛАТЫ от МАЙКА ТАЙСОНА

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

The Man Who Solved the $1 Million Math Problem...Then Disappeared

The Man Who Solved the $1 Million Math Problem...Then Disappeared

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Reinforcement Learning, by the Book

Reinforcement Learning, by the Book

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

AI beats multiple World Records in Trackmania

AI beats multiple World Records in Trackmania

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

AI Learns Insane Monopoly Strategies

AI Learns Insane Monopoly Strategies

An introduction to Reinforcement Learning

An introduction to Reinforcement Learning

Президент відвідав українських воїнів, які проходять лікування в госпіталі

Президент відвідав українських воїнів, які проходять лікування в госпіталі

А я думаю что за звук такой знакомый? 😂😂😂

А я думаю что за звук такой знакомый? 😂😂😂

Як в Уторопах варять сіль із соровиці з місцевого джерела

Як в Уторопах варять сіль із соровиці з місцевого джерела

САМАЯ ТРАГИЧНАЯ ИСТОРИЯ ЛЮБВИ! БЫВШИЙ РАЗРУШИЛ ЕЁ ЖИЗНЬ, ЧТОБЫ ВЕРНУТЬ СЕБЕ? | Новинки мелодрам 2024

САМАЯ ТРАГИЧНАЯ ИСТОРИЯ ЛЮБВИ! БЫВШИЙ РАЗРУШИЛ ЕЁ ЖИЗНЬ, ЧТОБЫ ВЕРНУТЬ СЕБЕ? | Новинки мелодрам 2024

«Угадай кто?» В этой игре и карточки с Гарри Поттером есть 🪄 Артикул WВ: 138578734, Ozоn: 981564320

«Угадай кто?» В этой игре и карточки с Гарри Поттером есть 🪄 Артикул WВ: 138578734, Ozоn: 981564320

ШАМАНКА НЕ СТРИМАЛА ЕМОЦІЙ! “ЧОМУ ВИ НЕ ЗБЕРІГАЄТЕ ЖИТТЯ УКРАЇНСЬКИХ СОЛДАТ?!” - СЕЙРАШ

ШАМАНКА НЕ СТРИМАЛА ЕМОЦІЙ! “ЧОМУ ВИ НЕ ЗБЕРІГАЄТЕ ЖИТТЯ УКРАЇНСЬКИХ СОЛДАТ?!” - СЕЙРАШ

Что будет если съесть грибы в Майнкрафте #shorts #майнкрафт #minecraft

Что будет если съесть грибы в Майнкрафте #shorts #майнкрафт #minecraft

3 Дня как Бомж! Масленников, Сабина, Даник живут на помойке

3 Дня как Бомж! Масленников, Сабина, Даник живут на помойке