MIT 6.S191: Reinforcement Learning

CS 7646: QLearning and robot navigation

"10 Ways Backtests Lie" by Tucker Balch

How to Cut Glass Bottles: DIY Techniques for Creative Projects!

⚡️Орбан ЗУСТРІВСЯ із Зеленським в Брюсселі #shorts

Это было очень близко...

CS 7646: QLearning Trader Project Overview

Tucker Balch

Переглядів 42 253

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 30 жов 2024

КОМЕНТАРІ • 7

@makeshiftpenny 3 роки тому ⁺²⁷
2:15 - [START HERE] code structure and templates
4:45 - StrategyLearner API
6:15 - addEvidence() parameters and behavior
7:10 - testPolicy() parameters and behavior
9:30 - Evaluation rubric
12:25 - Implementation of StrategyLearner
17:00 - how to frame trading as a reinforcement learning (RL) problem
19:55 - What defines the State (in this problem)?
21:15 - What are the Actions?
27:35 - What is the Reward?
28:30 - Should the reward be delayed (long-term, i.e. cumulative return) or frequent (short-term, i.e. daily return)?
31:15 - Balch assumes we are not using the Transition Matrix (i.e. no Dyna-Q)
33:45 - How to represent the State?
38:15 - StrategyLearner addEvidence() pseudocode
44:20 - Q: How do we define convergence?
50:45 - testPolicy() pseudocode
52:15 - adding missing line in addEvidence() pseudocode
54:05 - discussion of short-term vs long-term rewards
58:00 - daily return rewards depend on state (long, short, none). If holdings are none, you should get no rewards
59:15 - should we use Dyna-Q? Dyna is not recommended, because we want to minimize runtime, but it should reduce the number of trades
1:00:30 - [END OF LECTURE]
@japanboy31415 11 місяців тому ⁺⁶
Ml4t gang wya ?
@bronsonschnitzel7493 7 місяців тому ⁺⁸
Another 7 year old video courtesy of OMSCS
@kuatroka 7 років тому ⁺¹
Hi professor Balch, thanks for the Udacity course ML for Trading!
I'd like to ask a question. In the section 03-06 - Q-Learning - Quiz: The Trading Problem: State (min 0:43) you explain that Adjusted Close and SMA are not good to be chosen as factors for our State because the values are meaningless outside of the context of comparison. You say that the Price/SMA ratio, on the other hand is a better fit. Later you say that BB values are good and could be used. My question is in this context (Q-Learning for Trading) what is the difference here between BB value and SMA for example. The BB values will also be different for different stocks and are also of the same nature as the Price or SMA would be since BB value is not a ratio. Maybe I'm missing something and somehow you meant a normalised sort of BB value, for example in percentage points? I'm just trying to understand what make sense to use as features for Trading. what makes sense to use and what not, but I want to understand the general idea behind it. Thanks
@viniciusepheta 7 років тому ⁺²
Yes, I think he meant the normalized BB, i.e. a ratio. In fact, the most usual thing to do is to standardize the BB value, also called as z-score, this is a comparable value among different stocks.
@VR-fh4im 3 роки тому ⁺¹
@@viniciusepheta He does means to say standardize. When we standardize the training feature, we will use mean and standard deviation values of training factors later with test data, when we use the Q-Learner.
@japanboy31415 11 місяців тому ⁺²
money man

Наступне

Автоматичне відтворення

MIT 6.S191: Reinforcement Learning

MIT 6.S191: Reinforcement Learning

CS 7646: QLearning and robot navigation

CS 7646: QLearning and robot navigation

"10 Ways Backtests Lie" by Tucker Balch

"10 Ways Backtests Lie" by Tucker Balch

How to Cut Glass Bottles: DIY Techniques for Creative Projects!

How to Cut Glass Bottles: DIY Techniques for Creative Projects!

⚡️Орбан ЗУСТРІВСЯ із Зеленським в Брюсселі #shorts

⚡️Орбан ЗУСТРІВСЯ із Зеленським в Брюсселі #shorts

Это было очень близко...

Это было очень близко...

DOMIY & SHUMEI - Не пройде

DOMIY & SHUMEI - Не пройде

ML Was Hard Until I Learned These 5 Secrets!

ML Was Hard Until I Learned These 5 Secrets!

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Q-Learning: Model Free Reinforcement Learning and Temporal Difference Learning

Think Fast, Talk Smart: Communication Techniques

Think Fast, Talk Smart: Communication Techniques

One of the Greatest Speeches Ever | Steve Jobs

One of the Greatest Speeches Ever | Steve Jobs

Rory Sutherland - Are We Now Too Impatient to Be Intelligent? | Nudgestock 2024

Rory Sutherland – Are We Now Too Impatient to Be Intelligent? | Nudgestock 2024

What is generative AI and how does it work? - The Turing Lectures with Mirella Lapata

What is generative AI and how does it work? – The Turing Lectures with Mirella Lapata

CS 7646: Guest Speaker Todd Simkin of Susquehanna

CS 7646: Guest Speaker Todd Simkin of Susquehanna

What Should Leaders Learn from History?

What Should Leaders Learn from History?

Q-learning - Explained!

Q-learning - Explained!

CAN YOU DO THIS ?

CAN YOU DO THIS ?

DEMONS ARE ATTACKING BRAWL STARS!!!

DEMONS ARE ATTACKING BRAWL STARS!!!

Не так важно как ТЫ БЬЁШЬ, а важно какой ДЕРЖИШЬ УДАР😎 #shorts

Не так важно как ТЫ БЬЁШЬ, а важно какой ДЕРЖИШЬ УДАР😎 #shorts

"Ми дуже дякуємо цим хлопцям". Українські військові врятували двох жінок з лівого берега Дніпра

"Ми дуже дякуємо цим хлопцям". Українські військові врятували двох жінок з лівого берега Дніпра

Купил КЛОУНА на DEEP WEB !

Купил КЛОУНА на DEEP WEB !

ДИЗЕЛЬ ШОУ 2024 💙 150 ВИПУСК 💛💐 ВЕЛИКА ПРЕМ'ЄРА 🌷 від 18.10.2024

ДИЗЕЛЬ ШОУ 2024 💙 150 ВИПУСК 💛💐 ВЕЛИКА ПРЕМ'ЄРА 🌷 від 18.10.2024

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

СОБАКА И ТРИ ТАБАЛАПКИ😱#shorts

СОБАКА И ТРИ ТАБАЛАПКИ😱#shorts