Q-Learning: A Complete Example in Python

Поділитися
Вставка
  • Опубліковано 23 січ 2025

КОМЕНТАРІ • 74

  • @meraldlag4336
    @meraldlag4336 3 роки тому +3

    This is easily the best resource on q-learning I’ve ever seen. It’s one simple library available to everyone on any OS (looking at you, *gym* ) and very well explained

  • @2NormalHuman
    @2NormalHuman 2 роки тому +1

    Wow this is the best qlearning video with code example that i found on the internet. Thanks for the detailed explanation of every step in the code

  • @gamuchiraindawana2827
    @gamuchiraindawana2827 8 місяців тому +1

    I don't believe anyone teaches it better than you. Amazing.

  • @kinez_vilicar8
    @kinez_vilicar8 4 роки тому +14

    Best Q-Learning lesson ever, better than my uni class. Thank you very much, please keep it up.

  • @shreyashgupta13
    @shreyashgupta13 Рік тому +1

    I am learning RL for last 1 year and this is one of the best video. You have taught everything from start to end. Thank you!

  • @takeiteasydragon
    @takeiteasydragon 8 місяців тому

    Extremely clear explanation for this topic. You are my life saver when I am preparing my finals. Thanks a lot.

  • @patrickng8805
    @patrickng8805 Рік тому +1

    The code is much cleaner and easier to understand than the example codes I found in many other Q-Learning books.

  • @kachappillyjean
    @kachappillyjean 2 роки тому +4

    That is some incredible teaching skill! Broke down a complex concept just to the right amount of detail . Really appreciate this lesson.

  • @Виктор-н8р7д
    @Виктор-н8р7д Рік тому

    Thank you so much Dr.Super! You're a very gifted teacher. Please don't stop.

  • @wennie2939
    @wennie2939 3 роки тому +1

    Best RL video ever in UA-cam! Thank you so much, Dr. Soper!

  • @Sunny-qe5el
    @Sunny-qe5el 2 роки тому

    WOW!!!
    Thanks a ton for this video mate.
    I have taken the course in reinforcement learning at university and this is by far the great way to make acquainted with Q-learning algorithm in reinforcement learning.

  • @darker2.0
    @darker2.0 Рік тому

    The best teacher i never seen. Respect.

  • @srinivasanbalan2469
    @srinivasanbalan2469 4 роки тому +2

    You are an excellent teacher, Dr. Soper. Appreciate your support and presentation.

  • @IsaacAsante17
    @IsaacAsante17 3 роки тому +9

    I'm really impressed by how easily you explained this. I'm going to watch the other videos in your Q-learning series, and subscribe as well. Thanks for this amazing RL tutorial, man.

    • @giancarlo1108
      @giancarlo1108 2 роки тому +1

      i never expected to see fl studio tutorial channel here lol

  • @cK1ab
    @cK1ab 2 роки тому +1

    Thankyou so much for this clear and practical example. Most videos I've seen just kind of breeze over the environment and states. Keep up the good work!

  • @farzero
    @farzero 2 роки тому

    Dude thanks a bunch. You gave just the right amount of detail and broke it down simply. Thanks for not bogging us down with a ton of details.

  • @JustSayin24
    @JustSayin24 4 роки тому +2

    Writing my dissertation and this is a God-send. Thank you :)

  • @tjcarter9286
    @tjcarter9286 5 місяців тому

    Thank you for this video, you've explained this excellently and I actually understand the concept now!

  • @wherearewegoing400
    @wherearewegoing400 4 роки тому +3

    Thank you very much Dr. Soper! I took great value from this video, as I am trying to implement some Q-Learning techniques into a project of my own. Looking forward to your future videos!

  • @deepaktripathi4094
    @deepaktripathi4094 Рік тому

    i am really enjoying going through your videos.

  • @christopherminhpham9603
    @christopherminhpham9603 2 роки тому

    Very easy to follow and understand Q-learning and see amazing computer 💻 working so well. Encourage myself to learning more about AI. I am not good at maths, and logical problems but I understand easily by your videos. Thank you so much 😊

  • @saurabhjain9542
    @saurabhjain9542 2 роки тому

    Very helpful and engaging lesson. Was having hard time understanding implementation but this visual explanation with code implementation made it very clear. Great job!!

  • @alinelermer6664
    @alinelermer6664 3 роки тому

    Excellent explanation with such a pleasant voice! Thank you so much.

  • @samarpratapsingh9788
    @samarpratapsingh9788 4 роки тому

    Such a complex topic is explained without any hassle!

  • @alisherabdulkhaev1881
    @alisherabdulkhaev1881 11 місяців тому

    Awesome! Thank you very much, so intuitive and easy to understand video!!

  • @KSATica
    @KSATica Рік тому

    You sir THANK YOU.. you broke it down very easy to comprehend and learn. Thank you and thank you

  • @EttoreMastrogiacomo
    @EttoreMastrogiacomo 3 роки тому

    Best Q - Learning lesson, congratulation and thank u

  • @johnmathew3580
    @johnmathew3580 3 роки тому

    So peaceful yet very informative. Love this style.

  • @danalex2991
    @danalex2991 2 роки тому

    Amazing video. Better than my uni explanation!

  • @ronaldleung9975
    @ronaldleung9975 4 роки тому +2

    Perfect tutorial with clear sample code, but only 646 views? This video deserves better!

    • @ronaldleung9975
      @ronaldleung9975 4 роки тому +2

      Most other videos are just using GYM or Unity library for their video, which are just show-offs, instead of really teaching something.

  • @alialhilo5696
    @alialhilo5696 4 роки тому

    very beneficial for my master's thesis. THANKS!!

  • @j_owatson
    @j_owatson 4 роки тому

    Thanks man this video helped out by quite a lot. keep up the good work

  • @bonadio60
    @bonadio60 2 роки тому

    Fantastic lesson!! Thanks so much

  • @xlz9235
    @xlz9235 2 роки тому

    Excellent lesson!Thank you so much

  • @johnneilcotacte2660
    @johnneilcotacte2660 4 роки тому

    error in the get_next_location function. actions not define

  • @balalakshmiar9456
    @balalakshmiar9456 3 роки тому

    Amazing explanation. Thanks a lot.

  • @vegeta3993
    @vegeta3993 4 роки тому

    Great video, very well made.

  • @ImtithalSaeed
    @ImtithalSaeed 2 роки тому

    why the agent gets -1 in its walking area.

  • @premithmendis8004
    @premithmendis8004 3 роки тому

    it is very good that this can automatically show us the shortest path but what if we need to know the q table or the updated table which program uses to find the best action from up down right or left. i meant any action is taken from the updated table. if we can obtain that it is a huge success as well.

    • @premithmendis8004
      @premithmendis8004 3 роки тому

      when it comes to a certain point it has to decide where to go so updated table shows the values of each direction, highest value will be selected

  • @abhaychandra2624
    @abhaychandra2624 7 місяців тому

    WHAT AN AWESOME VIDEO

  • @moienr4104
    @moienr4104 4 роки тому

    Your are amazing sir, keep up the good work, thank you

  • @asifferdous2418
    @asifferdous2418 4 роки тому

    what type of q learning is used in this?

  • @kushankbansal
    @kushankbansal 2 роки тому

    Excellent video sir

  • @gemini_537
    @gemini_537 8 місяців тому

    Gemini: This video is about a complete walkthrough of a Q-learning based AI system in Python.
    The video starts with an introduction to the business problem. The problem is about designing a warehouse robot that can travel around the warehouse to pick up items and bring them to a packaging area. The robot needs to learn the shortest path between all the locations in the warehouse.
    Then the video explains the concept of Q-learning, which is a reinforcement learning technique. Q-learning works by letting an agent learn from trial and error. The agent receives rewards for taking good actions and penalties for taking bad actions. Over time, the agent learns to take the actions that will lead to the greatest reward.
    Next, the video dives into the code. The code defines the environment, which includes the states, actions, and rewards. The states are all the possible locations of the robot in the warehouse. The actions are the four directions that the robot can move (up, down, left, and right). The rewards are positive for reaching the packaging area and negative for all other locations.
    The code also defines a Q-learning agent. The agent starts at a random location in the warehouse and then takes a series of actions. The agent learns from the rewards that it receives for its actions. Over time, the agent learns to take the shortest path to the packaging area.
    Once the agent is trained, the video shows how to use the agent to find the shortest path between any two locations in the warehouse. The video also shows how to reverse the path so that the robot can travel from the packaging area to any other location in the warehouse.
    Overall, this video is a great introduction to Q-learning and how it can be used to solve real-world problems.

  • @alexjoseph8433
    @alexjoseph8433 3 роки тому

    very well explained.. Thankyou

  • @ramakantpathak7148
    @ramakantpathak7148 3 роки тому

    Thanks for easy presentation

  • @sezaiunver1454
    @sezaiunver1454 2 роки тому

    can u do deep q learning complete example please? i really need that one :)

  • @brookealviar6415
    @brookealviar6415 4 роки тому

    So helpful! Thank you!

  • @paperstars9078
    @paperstars9078 4 роки тому +1

    this is awesome. I wonder how long it to took to do all this.

  • @maltrho
    @maltrho Рік тому

    the background music for this is just perfect! (Anybody knows if it is something easily accesable?)

  • @sivakumar-uj4fu
    @sivakumar-uj4fu 4 роки тому

    My humble thanks Dr.Daniel for such a clear description of Q-learning in python.
    I am not able to access the notebook for the code.
    Kindly could you help me regarding this. I want to practice programming for creating an environment and execute q-leaning on it.

  • @reihanehmirjalili7467
    @reihanehmirjalili7467 3 роки тому

    AMAZING!!!! Thanks a lot

  • @richardngweisiong2895
    @richardngweisiong2895 4 роки тому

    Appreciate your knowledge sharing.

  • @namegoeshere3398
    @namegoeshere3398 4 роки тому

    It is not working for me. (code on the website has errers)

    • @j_owatson
      @j_owatson 4 роки тому

      Have you installed all the required libraries and are running the correct version of python? And if your running the code on the website run the code in order.
      Edit: try running it natively on your computer

  • @tuongnguyen9391
    @tuongnguyen9391 Рік тому

    This is quite strange to me, it seems that the agent does not use any of the q-value to create the shortest path

  • @znypr
    @znypr 3 роки тому +1

    Really good video and helped a lot!
    However the piano is a bit too loud :D maybe -20%? :D

  • @aminabensalem35
    @aminabensalem35 3 роки тому

    Great content

  • @JanMan37
    @JanMan37 3 роки тому

    Thank you for the video. It was really clear and helpful. I have one question.
    In obtaining the shortest path after training, could you explain why epsilon is 0.9? Shouldn't the epsilon = 1.0 to maximize the rewards?

    • @GeromesCat
      @GeromesCat 3 роки тому

      He explains it briefly in the video at 13:00. The way I understand it is that you do sometimes want the AI agent to take random actions, instead of always taking the best action, so that it has a chance to explore the environment. Otherwise the AI, as soon as it finds a route to the 'item packaging area', might think it has already found the best route, and stop looking for a better one.

  • @yanfeixiang2548
    @yanfeixiang2548 4 роки тому

    Thank you Dr. Daniel ! This is a excellent Q-leaning instructional video includ comprehensive theory part and practical implementation. I want to inquiry is it still possible to find the link to this notebook now ? I don't find where is the video descroption part.

  • @maniys
    @maniys 3 роки тому

    this is great!!!!!!!!

  • @THE_ONLY_GOD
    @THE_ONLY_GOD 2 роки тому

    Thanks for video! However, I don't think that making every navigable tile have a -1 is optimal. If the robot has a bitmap of the environment, and can reference the location it is at, then can base a navigable tiles reward system that increments up to MUCH better GUIDE the robot toward the goal. Rather than a binary goal.

  • @escapefelicity2913
    @escapefelicity2913 3 роки тому +2

    Get rid of the background noise

  • @kimjong-un4521
    @kimjong-un4521 Рік тому

    What a legend

  • @Anonim70498
    @Anonim70498 4 роки тому

    Thank you

  • @nikolakrivokapic4962
    @nikolakrivokapic4962 3 роки тому +1

    4:50 poor robots, they are never going to be good enough :(

  • @pammasinghkainth
    @pammasinghkainth 9 місяців тому

    background music in very annoying! But lesson was good

  • @MegaDixen
    @MegaDixen 2 роки тому

    Really nice lesson , my unversity prof is so bad compared to this.