Multi Armed Bandits - Reinforcement Learning Explained!

Proximal Policy Optimization | ChatGPT uses this

Мой тг: Подвал Стинта #стинт #stint #stintik

Passat CC на 300 л.с. Начало проекта!

💔 Історія захисника Маріуполя, який втратив ногу, осліп на праве око і пройшов полон. #зсу #shorts

Elements of Reinforcement Learning

CodeEmporium

Переглядів 7 346

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 5 чер 2024
Elements of Reinforcement Learning
ABOUT ME
⭕ Subscribe: ua-cam.com/users/CodeEmporiu...
📚 Medium Blog: / dataemporium
💻 Github: github.com/ajhalthor
👔 LinkedIn: / ajay-halthor-477974bb
RESOURCES
[1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
[2] Paradigms of ML: idapgroup.com/blog/types-of-m...
[3] Pong: • DQN Breakout
[4] Learning to walk: • Emergence of Locomotio...
[5] ChatGPT blog: openai.com/blog/chatgpt
[6] Chess: www.kaggle.com/code/arjanso/r...
[7] Model Free vs Model Based RL: spinningup.openai.com/en/late...
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: • Reinforcement Learning...
Natural Language Processing: • Natural Language Proce...
⭕ Transformers from Scratch: • Natural Language Proce...
⭕ ChatGPT Playlist: • ChatGPT
⭕ Convolutional Neural Networks: • Convolution Neural Net...
⭕ The Math You Should Know : • The Math You Should Know
⭕ Probability Theory for Machine Learning: • Probability Theory for...
⭕ Coding Machine Learning: • Code Machine Learning
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: imp.i384100.net/MathML
📕 Calculus: imp.i384100.net/Calculus
📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
📕 Linear Algebra: imp.i384100.net/LinearAlgebra
📕 Probability: imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
📕 Python for Everybody: imp.i384100.net/python
📕 MLOps Course: imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): imp.i384100.net/NLP
📕 Machine Learning in Production: imp.i384100.net/MLProduction
📕 Data Science Specialization: imp.i384100.net/DataScience
📕 Tensorflow: imp.i384100.net/Tensorflow

КОМЕНТАРІ • 14

@CodeEmporium 8 місяців тому ⁺⁵
If you think I deserve it, please give this video a like as it will help circulate the video immensely. Thank you so much for the support so far !
@pj-nz6nm 8 місяців тому ⁺⁹
Please make more videos on reinforcement learning,my knowledge about this field is very poor.
@CodeEmporium 8 місяців тому ⁺²
I definitely shall make more videos! Thanks for the comment !
@sloth_in_socks 26 днів тому
Great video! It's funny you mentioned unsupervised learning at the start but didn't mention LLMs
@minlingg91 8 місяців тому
keep up the good work! im currently doing a traineeship in AI and your videos have been immensely helpful.
@amiralioghli8622 8 місяців тому
Thank you, sir, for sharing valuable information through your UA-cam channel. Once again, I have a request: please create a series on how to apply Transformers to time series tasks such as anomaly detection, forecasting, or classification. Working on just one of these tasks would be sufficient for us. I have followed numerous articles, short notes, and videos regarding the application of Transformers to time series data, but it is still not clear to me. I am a beginner on this Transformer journey, and there are no useful videos available on UA-cam overall.
@casualpasser-by5954 8 місяців тому
Very nice, short and clear overview of reinforcement learning! However, in the end of the video, I think, the distinction between model-free and model-based algorithms wasn't explained well. It is not about does one train an algorithm on the simulatied or real-world data. Is real world the source of the information or it is a simulation - from the algorithmic point of view the information is in both cases just numbers, produced by some external environment. The real difference between model-free and model-based is that model-based algorithms have intrinsic model within them, which is adjusted during the training to better predict the behaviour of the environment. Of there is no such trainable model within an algorithm and we have only fixed external simulation - we still follow model-free approach.
Sorry for my English.
@CodeEmporium 7 місяців тому
Thanks for this! Honestly I think I am in agreement and this goes to show maybe the words I used in the end to describe this is confusing. Perhaps with this definition , a more concrete example would have been helpful like I had given the others. But I treated that last piece more like a footnote. I’ll probably dedicate more videos and time to this :)
@slitihela1860 3 місяці тому
can you prepare a video for Double Q-Learning Network
and Dueling Double Q-Learning Network
please
@mdbayazid6837 8 місяців тому ⁺¹
I would request for a book reading camp if possible
@CodeEmporium 8 місяців тому ⁺¹
Ooo this is a fun topic! I shall consider
@arunima29 8 місяців тому
Please make detailed videos on all the concepts of RL.
@CodeEmporium 8 місяців тому ⁺²
Roger! I shall!
@MilesBellas 8 місяців тому
1.7x speed = best

Наступне

Автоматичне відтворення

Multi Armed Bandits - Reinforcement Learning Explained!

Multi Armed Bandits - Reinforcement Learning Explained!

Proximal Policy Optimization | ChatGPT uses this

Proximal Policy Optimization | ChatGPT uses this

Мой тг: Подвал Стинта #стинт #stint #stintik

Мой тг: Подвал Стинта #стинт #stint #stintik

Passat CC на 300 л.с. Начало проекта!

Passat CC на 300 л.с. Начало проекта!

💔 Історія захисника Маріуполя, який втратив ногу, осліп на праве око і пройшов полон. #зсу #shorts

💔 Історія захисника Маріуполя, який втратив ногу, осліп на праве око і пройшов полон. #зсу #shorts

😈Парний прохід наших Мі-8 на наднизькій висоті! #shorts

😈Парний прохід наших Мі-8 на наднизькій висоті! #shorts

Reinforcement Learning with Human Feedback - How to train and fine-tune Transformer Models

Reinforcement Learning with Human Feedback - How to train and fine-tune Transformer Models

How Google Translate Works - The Machine Learning Algorithm Explained!

How Google Translate Works - The Machine Learning Algorithm Explained!

Q-learning - Explained!

Q-learning - Explained!

Embeddings - EXPLAINED!

Embeddings - EXPLAINED!

Reinforcement Learning, by the Book

Reinforcement Learning, by the Book

Foundation of Q-learning | Temporal Difference Learning explained!

Foundation of Q-learning | Temporal Difference Learning explained!

Llama - EXPLAINED!

Llama - EXPLAINED!

An introduction to Reinforcement Learning

An introduction to Reinforcement Learning

A Complete Overview of Word Embeddings

A Complete Overview of Word Embeddings

Зеленський і нові сигнали Путіну від США. Китай вступає у війну? | Діалоги з Портниковим

Зеленський і нові сигнали Путіну від США. Китай вступає у війну? | Діалоги з Портниковим

ДАЖЕ победителю СТАЛО СТРАШНО от того, ЧТО он СДЕЛАЛ с проигравшим #shorts

ДАЖЕ победителю СТАЛО СТРАШНО от того, ЧТО он СДЕЛАЛ с проигравшим #shorts

ЛЕБІГА, МАЙОРОВА, КУХАРЧУК, ТКАЧЕНКО. РОЗРЯД | ВИПУСК 13

ЛЕБІГА, МАЙОРОВА, КУХАРЧУК, ТКАЧЕНКО. РОЗРЯД | ВИПУСК 13

✈️ ЗСУ відтісняють авіацію РФ за полярне коло

✈️ ЗСУ відтісняють авіацію РФ за полярне коло

ВОЛКОВА: хочу поїхати в РЕХАБ. Мене ДОМАГАВСЯ викладач. Після СМЕРТІ чоловіка відчула ПОЛЕГШЕННЯ

ВОЛКОВА: хочу поїхати в РЕХАБ. Мене ДОМАГАВСЯ викладач. Після СМЕРТІ чоловіка відчула ПОЛЕГШЕННЯ

надувательство чистой воды

надувательство чистой воды

когда достали одноклассники!

когда достали одноклассники!

The Worlds Most Powerfull Batteries !

The Worlds Most Powerfull Batteries !