#1. Q Learning Algorithm Solved Example | Reinforcement Learning | Machine Learning by Mahesh Huddar

Поділитися
Вставка
  • Опубліковано 21 лис 2022
  • #1. Q Learning Algorithm Solved Example | Reinforcement Learning | Machine Learning by Mahesh Huddar
    Introduction to Reinforcement Learning: • Introduction to Reinfo...
    Q Learning Algorithm Explained: • Q Learning Algorithm |...
    #1. Q Learning Algorithm Solved Example: • #1. Q Learning Algorit...
    The following concepts are discussed:
    ______________________________
    Q learning algorithm,
    q learning algorithm in machine learning,
    q learning in reinforcement learning,
    reinforcement learning,
    reinforcement learning solved example,
    solved example q learning,
    q learning numerical example
    ********************************
    1. Blog / Website: www.vtupulse.com/
    2. Like Facebook Page: / vtupulse
    3. Follow us on Instagram: / vtupulse
    4. Like, Share, Subscribe, and Don't forget to press the bell ICON for regular updates

КОМЕНТАРІ • 73

  • @thelazy.7845
    @thelazy.7845 4 місяці тому +101

    Attendance one night before exam..😂😂

  • @angelgarcialopezdeharo6829
    @angelgarcialopezdeharo6829 Рік тому +9

    Mahesh, you're just too good!! Super well explained. Thanks for the help!

  • @VedantPratik
    @VedantPratik Місяць тому

    This is one of the best youtube channel in terms of Machine Learning, I watched a lot of foreign videos , but no good explanation, i just watched this video for 5 minutes understood everything, Thanks for your help

    • @MaheshHuddar
      @MaheshHuddar  Місяць тому

      Welcome
      Do like share and subscribe

  • @marcods6546
    @marcods6546 Рік тому +6

    Very nice explanation, because, unlike the explanations I have seen so far, you run through the algorithm, showing the Q learning process. Thanks a lot!

  • @hardikrajeswaran4797
    @hardikrajeswaran4797 Рік тому

    Nice Explanation ... Is it possible for you to explain inverse reinforcement learning with similar example where we have to compute reward function from expert?

  • @poojithsrisai2576
    @poojithsrisai2576 10 місяців тому +1

    Thank you sir, it's the best explanation for q-learning.

    • @MaheshHuddar
      @MaheshHuddar  10 місяців тому

      Welcome
      Do like share and subscribe

  • @manelgomes9086
    @manelgomes9086 10 місяців тому

    Amazing video! Very clear explanation

    • @MaheshHuddar
      @MaheshHuddar  10 місяців тому

      Thank You
      Do like share and subscribe

  • @junaid8573
    @junaid8573 Рік тому

    thanks alot! learned alot in just 11 minutes!!!

  • @ganeshsubramanian6217
    @ganeshsubramanian6217 4 місяці тому

    sir, this is helpful.
    Any reasons you had taken -1 and 0; instead of 0 and 1 for the values in the matrix?

  • @abhishekarora4007
    @abhishekarora4007 Рік тому

    Amazing explanation Mahesh !

  • @kenzakhelkhal7931
    @kenzakhelkhal7931 Рік тому

    Very good job ,Thanks a lot for ur efforts

  • @kiranmaik7128
    @kiranmaik7128 8 місяців тому

    Thank You very much it helped me a lot Very Nice Explanation...

    • @MaheshHuddar
      @MaheshHuddar  8 місяців тому

      You are welcome
      Do like share and subscribe

  • @farzadveysi755
    @farzadveysi755 7 місяців тому

    Very clear example, good job!

    • @MaheshHuddar
      @MaheshHuddar  7 місяців тому

      Glad it was helpful!
      Please do like share and subscribe

  • @Macooasme
    @Macooasme Місяць тому

    Great video! All others ignore the technical aspect but you went ahead and

    • @MaheshHuddar
      @MaheshHuddar  Місяць тому +1

      Thank You
      Do like share and subscribe

  • @aliawad2244
    @aliawad2244 Рік тому +3

    You are a true LEGEND ^_^

  • @nimrafaryad4103
    @nimrafaryad4103 Рік тому +1

    Thanks sir Jazak Allah 👍🏼

  • @tominfotech
    @tominfotech Рік тому

    Great job 👍

  • @datastako156
    @datastako156 Рік тому

    very good explanation, thank you sir

  • @kshirasagarsahoo4254
    @kshirasagarsahoo4254 Рік тому +1

    nice explanation with the example.

  • @codeXtree
    @codeXtree 8 місяців тому

    Very clear explanation.

    • @MaheshHuddar
      @MaheshHuddar  8 місяців тому

      Welcome
      Do like share and subscribe

  • @user-ug1pj6kv8h
    @user-ug1pj6kv8h 6 місяців тому

    Thank you so much sir...

    • @MaheshHuddar
      @MaheshHuddar  6 місяців тому

      Welcome
      Do like share and subscribe

  • @user-qd7ko1nb2r
    @user-qd7ko1nb2r 10 місяців тому

    Excellent explanation

    • @MaheshHuddar
      @MaheshHuddar  10 місяців тому

      Thank You
      Do like share and subscribe

  • @bhuvanaaaaa
    @bhuvanaaaaa 6 місяців тому

    Thank you very much sir😊

    • @MaheshHuddar
      @MaheshHuddar  6 місяців тому

      Welcome
      Do like share and subscribe

  • @vishalgaursuniverse
    @vishalgaursuniverse 3 місяці тому +1

    Amazing Content :->

    • @MaheshHuddar
      @MaheshHuddar  3 місяці тому +1

      Thank You
      Do like share and subscribe

  • @buh357
    @buh357 Рік тому +2

    you are awesome

    • @MaheshHuddar
      @MaheshHuddar  Рік тому +1

      Thank You
      Do like share and subscribe

  • @idk-uj2lo
    @idk-uj2lo 2 місяці тому +3

    IIIT Kottayam students welcome

  • @thenkanishankar7538
    @thenkanishankar7538 5 місяців тому

    thank you sir😇

    • @MaheshHuddar
      @MaheshHuddar  5 місяців тому

      Welcome
      Do like share and subscribe

  • @topinfo5188
    @topinfo5188 Рік тому +1

    Will every one get the same final value as yours irrespective of the random selection they take or will change?

    • @saurabhdhasmana2331
      @saurabhdhasmana2331 Рік тому

      Usko ni pata bhai

    • @user-bn3zw9sd1p
      @user-bn3zw9sd1p Рік тому +1

      No - data can differ from the results in article. What you need to do with your results is : find max value in Q matrix (perchaps 500). All values in Q matrix divide by this Max and multiply by reward for terminal state value (100). You should obtain the same values.

  • @robert-dr8569
    @robert-dr8569 Рік тому

    Thank you! Excellent explanation and very helpful..

  • @jaligamaabhiram9594
    @jaligamaabhiram9594 7 місяців тому

    nice explaination sir😇

    • @MaheshHuddar
      @MaheshHuddar  7 місяців тому

      Thank You
      Do like share and subscribe

  • @shabbirbohra
    @shabbirbohra 2 місяці тому

    Very good exlaination of example. How to implement this in MATLAB or Python? are there ready codes available?

    • @MaheshHuddar
      @MaheshHuddar  2 місяці тому

      Thank You
      I don't have ready codes

  • @raviraj1462
    @raviraj1462 Рік тому

    Thks sir kal exam hai

  • @andrewhyc
    @andrewhyc 8 місяців тому +3

    Q(3,2)=R(3,2)+MAX(Q(2,5))=0,Why do I calculate that Q(3,2) is equal to 0, and how do I calculate 51?

    • @nikitarandive7283
      @nikitarandive7283 8 місяців тому

      Use updated Q table not the previous zero one

  • @NHAIushaprakki
    @NHAIushaprakki Рік тому +2

    how did u get 51 and 64 as values shd have explained those as well

  • @bblindia6002
    @bblindia6002 29 днів тому

    RTU wale