Розмір відео: 1280 X 720853 X 480640 X 360
Показувати елементи керування програвачем
Автоматичне відтворення
Автоповтор
Your 12 min video worth than all the playlist about q-learning on youtube👏
i watched so many vids in RL, but this ones the best when it comes to explaining and breaking down the formulas 😭❤thankuskajhjhc
Really enjoying the series. Keep it up
Thanks so much! Super glad you are enjoying this
This was brilliantly explained. Thank you!
What classical tasks are solved by off-policy algorithms? Do we use it to write bots that solves simple computer games?
Thank you from the bottom of my heart!
You deserve a tons of like!!!
Wow, you are really good at explaining things. Thank you!
Thanks, for your pretty efficient good quality videos! not only save time but also gives a complete understanding of topic😍
Explained well sir!!
great explanation
This is so underrated
Excellent Explanation, hats off.
amazing.
your video is really useful!!! thanks a lot
wonderful video! Than you!
Very Well explained by you sir,It helped alot
May be wrong I am not an expert but isn’t the Bellman equation supposed to add the reward of the S1 not S2?
Question to the last point you mention: We repeat the procedure many times until the values in the q-table don't change much anymore. Is that considered to be some form of Monte Carlo (within Q-learning)? Enjoy your videos btw, great work!
very good explained, thanks a lot!
thank you so much that was so helpful
Thank you so much!!!!!!!!!!!!
thank u so much
This is epic
Allah razı olsun
thanks man
anh vừa cứu em 1 bàn thua trong thấy =))) tưởng rớt môn hên gặp anh😀😀😀
Instead of saying grid you could say almost say DFA
Q*
bro how you are speaking like an american?suggest me some tips as well
Your 12 min video worth than all the playlist about q-learning on youtube👏
i watched so many vids in RL, but this ones the best when it comes to explaining and breaking down the formulas 😭❤thankuskajhjhc
Really enjoying the series. Keep it up
Thanks so much! Super glad you are enjoying this
This was brilliantly explained. Thank you!
What classical tasks are solved by off-policy algorithms? Do we use it to write bots that solves simple computer games?
Thank you from the bottom of my heart!
You deserve a tons of like!!!
Wow, you are really good at explaining things. Thank you!
Thanks, for your pretty efficient good quality videos! not only save time but also gives a complete understanding of topic😍
Explained well sir!!
great explanation
This is so underrated
Excellent Explanation, hats off.
amazing.
your video is really useful!!! thanks a lot
wonderful video! Than you!
Very Well explained by you sir,It helped alot
May be wrong I am not an expert but isn’t the Bellman equation supposed to add the reward of the S1 not S2?
Question to the last point you mention: We repeat the procedure many times until the values in the q-table don't change much anymore. Is that considered to be some form of Monte Carlo (within Q-learning)? Enjoy your videos btw, great work!
very good explained, thanks a lot!
thank you so much that was so helpful
Thank you so much!!!!!!!!!!!!
thank u so much
This is epic
Allah razı olsun
thanks man
anh vừa cứu em 1 bàn thua trong thấy =))) tưởng rớt môn hên gặp anh
😀😀😀
Instead of saying grid you could say almost say DFA
Q*
bro how you are speaking like an american?
suggest me some tips as well