[Classic] ImageNet Classification with Deep Convolutional Neural Networks (Paper Explained)

What is Q-Learning (back to basics)

Policy Gradient Theorem Explained - Reinforcement Learning

Правильный подход к детям

Они Скупали ВСЁ Серебро Мира и вот ЧТО Было Дальше! #shorts

TOY STORY IN BRAWL STARS!?

[Classic] Playing Atari with Deep Reinforcement Learning (Paper Explained)

Yannic Kilcher

Переглядів 46 021

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 24 гру 2024

КОМЕНТАРІ • 55

@aa-xn5hc 4 роки тому ⁺³⁷
Totally love your historical papers reviews
@mahermokhtar 2 роки тому ⁺⁴
i literally whatched 1000 of videos and i couldn't fully understand the DRL untill i watched this video .. very impressive detailed explanation .. thank you for it
@bruinebeerinhetblauwehuis 6 місяців тому
Same here!
@bikrammajhi3020 Рік тому ⁺¹
I am loving it. Thank you so much. YOU DESERVE MILLION SUBSCRIBER. HOPE YOU GET THERE SOON.
@chris--tech 4 роки тому ⁺⁴
Recently i am learning RL painfully, i understand what's happening in DQN until i watched your videos, thanks a lot.
@kumarsubham2078 3 роки тому ⁺³
Thanks for historical papers series Yannic. Great explanation of contents with plenty citations of related happenings. Helps understand the evolution of DL. Hope to see more coming soon!
@sebastianrada4107 9 місяців тому
What a great video! Please keep doing this kind of content 😀
@zerorusher Рік тому ⁺¹
It's November 2023 and you hear the magic name everybody is talking about: 20:52
@DefiantElf 3 роки тому ⁺⁴
Thanks for the great explanation! Regarding sticky actions (29:05), I think those were proposed after in the paper "Revisiting the Arcade Learning Environment..." by Machado et. al. in 2018 to add stochasticity to the Atari problem
@coderboy4683 3 роки тому
I came to understand paper but I realised a lot of things what I used to feel very difficult in RL. Awesome explanation sir. Thank you.
@genesisevolution4243 2 роки тому
Damn! this was exactly what I wanted to learn!! Thank you so much...
@alexwhb122 4 роки тому
Absolutely love your videos! Thank you for making these. I've learned a lot!
@heyrmi 4 роки тому ⁺¹¹
AlphaGo did to RL what Alex-net did to DL.
David Silver got me interested in this field. Tho I am a beginner but I too want to contribute in this field.
Thanks for covering this.
@TheThirdLieberkind 4 роки тому ⁺¹
I wouldn't entirely agree with this, as in my opinion, AlphaGO presented very few novel ideas, but was able to package 4 clever networks together to make something very practical - something reinforcement learning hadn't had before.
AlphaZero, on the other hand, did have a couple of major novel ideas, but even then debatably, were not the inventors of those ideas.
In my opinion most of the Alpha projects, while being more practically impressive than most research projects, did not invent the network architectures, but rather improved and were able to unload a massive amount of computing on it.
@Rhannmah 4 роки тому
@@TheThirdLieberkind having the AI play against itself and learn from that was pretty novel and definitely at the core of the success of AlphaGO.
@danielguffey 4 роки тому ⁺¹
@@Rhannmah Wasn't RL founded with self-play in checkers?
@Rhannmah 4 роки тому
@@danielguffey Was it? I thought it was trained in human play.
@danielguffey 4 роки тому
@@Rhannmah "The Samuel Checkers-playing Program was among the world's first successful self-learning programs"
@MrjbushM 4 роки тому
Thanks very useful for us learning deep learning!!!!!! I love the classic papers series
@marekdziubinski850 Рік тому
Nice joystick you’ve got there, Yannic 😂. But seriously, I enjoy your work - thank you for the contributions 😊
@snehalraj6898 4 роки тому
This was really awesome! Thanks
@RinkuYadav-pn4jo Рік тому
Yeahh....nice review..thankx
@dark808bb8 4 роки тому ⁺¹
Great video! I just coded a dqn type neural net to play Othello. It has only fully connected layers with a 64 dim input vector and 64 dim output vector. I hope to do some experiments with it in the future.
@PyTechVision 2 роки тому
Thanks for great explanation.
@utku_yucel 4 роки тому ⁺¹
Thanks!
@foodmart5122 10 місяців тому ⁺¹
What does he mean by latex savagery around 2:30?
@CHINNOJISANTOSHKUMARNITAP Рік тому
thanks for the explanation, can i expect a video on RAINBOW DQN
@michelprins Рік тому
thx great video
@MMc9081 4 роки тому ⁺¹
@Yannic - Great video as always and really helped me get a grip on the basics of RL.
Just wondering tho, did you mean to have adverts throughout the video? Up to now I have only seen them at the beginning, maybe the end too I cannot remember. But this video had 1 at start and then 3 during. I appreciate you need to generate some income from these videos (and you deserve it), but having the adverts during the video is very offputting. Would you consider having several at the start instead (if possible)?
@YannicKilcher 4 роки тому ⁺¹
Thanks for the feedback. I turned them on in the middle during this video just to see the effect, but I agree they're annoying.
@davidromero1373 Рік тому
which program do you use in your ipad to make those annotations outside the margins of the papers?
@ThinkTank255 Рік тому
Does anyone know what he is talking about at 2:10 ? LaTex savagery???
@TruMystery Рік тому
did you understnad??
@nikhilgv9 2 місяці тому ⁺¹
Those two lines are well outside the margin of the page. I noticed it when, I tried to crop the PDF
@jesschil266 4 роки тому
Hi Yannic! Love your video so much! But there was one thing I am not clear about, Is y_i equal to the Q function approximated by at (i-1)th time, the weights of a neural network? Best
@YannicKilcher 4 роки тому
It's the target value, so yes, the Q value to approximate
@HappyDancerInPink 4 роки тому ⁺⁸
What would you replace LaTeX with? Surely not Word?😂
@herp_derpingson 4 роки тому ⁺⁷
Markdown with MathJax. Or just use Jupyter Notebooks with inline code.
@snippletrap 4 роки тому ⁺⁴
@@herp_derpingson Exactly. Paperswithcode and distill.pub already moving in this direction. No reason that papers can't be interactive.
@SuperEmanuel98 4 роки тому
Surely there is alternatives but the thing is that everyone knows latex so it is easy to collab and it is fast. Getting math formulas quickly and looking good is easy. Latex has some quirks but it is not hard to workaround and fix said things. I would say that there are alternatives but nothing come close.
@billykotsos4642 4 роки тому
niceeeee
@mikhailkhlyzov6205 4 роки тому ⁺³
what happened in Pong? C'mon, David!
@iliasp4275 3 роки тому
ai lob yiu
@JoaoVitor-mf8iq 4 роки тому ⁺²
Savagery is ok if it doesn't decrease the quality of the research, formating is so boring...
@sui-chan.wa.kyou.mo.chiisai 4 роки тому
y13 really ooold paper
@lawchakra7813 4 роки тому ⁺¹
I cant share this gold mine content with anyone. I dont know anybody who would be interested in all this.
@42nb 4 роки тому ⁺²
But you can always find someone in this community later on, just stay interested :D
@CHINNOJISANTOSHKUMARNITAP Рік тому
thanks for the explanation, can i expect a video on RAINBOW DQN

Наступне

Автоматичне відтворення

[Classic] ImageNet Classification with Deep Convolutional Neural Networks (Paper Explained)

[Classic] ImageNet Classification with Deep Convolutional Neural Networks (Paper Explained)

What is Q-Learning (back to basics)

What is Q-Learning (back to basics)

Policy Gradient Theorem Explained - Reinforcement Learning

Policy Gradient Theorem Explained - Reinforcement Learning

Правильный подход к детям

Правильный подход к детям

Они Скупали ВСЁ Серебро Мира и вот ЧТО Было Дальше! #shorts

Они Скупали ВСЁ Серебро Мира и вот ЧТО Было Дальше! #shorts

TOY STORY IN BRAWL STARS!?

TOY STORY IN BRAWL STARS!?

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

[Classic] Word2Vec: Distributed Representations of Words and Phrases and their Compositionality

[Classic] Word2Vec: Distributed Representations of Words and Phrases and their Compositionality

Reinforcement Learning: Machine Learning Meets Control Theory

Reinforcement Learning: Machine Learning Meets Control Theory

Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained)

Byte Latent Transformer: Patches Scale Better Than Tokens (Paper Explained)

Rethinking Attention with Performers (Paper Explained)

Rethinking Attention with Performers (Paper Explained)

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

Deep Q Learning is Simple with PyTorch | Full Tutorial 2020

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention (Paper Explained)

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention (Paper Explained)

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)

Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

[Classic] Deep Residual Learning for Image Recognition (Paper Explained)

Сестра обхитрила!

Сестра обхитрила!

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

СКОЛЬКО ИХ...?! #Shorts #Глент

СКОЛЬКО ИХ...?! #Shorts #Глент

Что будет если украсть в магазине шоколадку 🍫

Что будет если украсть в магазине шоколадку 🍫

How Strong Is Tape?

How Strong Is Tape?

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

ПРОВЕРКА НА ВШИВОСТЬ (смешное видео, юмор, поржать, приколы)

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку