Reinforcement Learning from scratch

Поділитися
Вставка
  • Опубліковано 29 лис 2024

КОМЕНТАРІ • 51

  • @darthvader4899
    @darthvader4899 8 місяців тому +41

    this is video is super underrated. In fact the whole channel is underrated.

    • @william_8844
      @william_8844 4 місяці тому

      Maybe i should follow the channel then 😅.
      This was my first vid, and the explanation was really well simplified

  • @themathguy3149
    @themathguy3149 Рік тому +9

    Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!

  • @tushargupta1999
    @tushargupta1999 8 місяців тому +5

    This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.

  • @ashketchum1244
    @ashketchum1244 Рік тому +6

    I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.

  • @jameslibby5215
    @jameslibby5215 Рік тому +8

    Very very underrated channel

    • @benc7910
      @benc7910 10 місяців тому

      Underrated, two Rs

    • @jameslibby5215
      @jameslibby5215 10 місяців тому

      @@benc7910 thank ya sir

  • @Arivan_Abdulla
    @Arivan_Abdulla 4 місяці тому +3

    Too beautiful you can watch this kind of videos all the day without get bored

  • @mind6861
    @mind6861 5 місяців тому +28

    Can we have the code for this

  • @themax2go
    @themax2go 8 місяців тому +4

    agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))

  • @metaljacket8102
    @metaljacket8102 7 місяців тому +2

    This is really awsome! It's the best video that explains DRL in such an easy to understand way!

  • @Bet-s4g
    @Bet-s4g 2 місяці тому +1

    This is super underrated video

  • @a.aspden
    @a.aspden Рік тому +2

    Your videos are great. Looking forward to more!

  • @cloudysh
    @cloudysh 7 місяців тому +1

    This was so surprisingly great :3

  • @CptDoge-rn3ou
    @CptDoge-rn3ou Рік тому +2

    I really like the way you visualize what you are talking about. Thank you for putting in the effort!

  • @Sumpydumpert
    @Sumpydumpert 5 місяців тому +1

    I agree once you see how it all works it seems like 1s and zeros give me some feed back on r/grand unified theory or cosmo knowledge

  • @marcinstrzesak346
    @marcinstrzesak346 Рік тому +1

    Great video, very helpful, easy to understand.

  • @moldo800
    @moldo800 10 місяців тому +1

    Excellent. Congratulations ❤

  • @swannschilling474
    @swannschilling474 5 місяців тому +1

    Thanks a lot for this one! 😊

  • @luiseduardocraizer7416
    @luiseduardocraizer7416 6 місяців тому +1

    Excellent content!

  • @anthonyortiz7924
    @anthonyortiz7924 2 місяці тому

    What a great series! I have a question for the experts... was it necessary to map velocity as an input? I'm guessing it's not absolutely necessary and was done to make the training faster? My guess is based on the assumption that the timing of the ball x/y changes to the inputs have an effect, but I may be wrong.

  • @gmjammin4367
    @gmjammin4367 Рік тому +1

    Amazing video as always :)!

  • @BlueBirdgg
    @BlueBirdgg Рік тому +1

    Can you playlist each one of your topics plz?
    I wanted to post on Twitter(X) your video topics but could only post a single video at a time.
    Great content by the way. Ty very much.
    Your perspective on some topics helped me a lot to get a more intuitive understanding.

    • @g5min
      @g5min  Рік тому

      Good idea! Here's one on generative AI:
      ua-cam.com/play/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo.html
      Here's one on reinforcement learning
      ua-cam.com/play/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL.html
      Here's one on LLMs + text-to-image
      ua-cam.com/play/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu.html

    • @BlueBirdgg
      @BlueBirdgg Рік тому

      @@g5min Ty!

  • @jaideepraulji1395
    @jaideepraulji1395 3 місяці тому +1

    Superb

  • @mohajeramir
    @mohajeramir 7 місяців тому +2

    Excellent

  • @mado.madeleine
    @mado.madeleine Рік тому +1

    Super helpful! Thank you 🙏🏽

  • @jdlopes06
    @jdlopes06 5 місяців тому +1

    Thank you!

  • @william_8844
    @william_8844 4 місяці тому

    I get how the model can see moves and output up or down action. But I don't get how model tracks the score for rewards etc
    Can someone explain how the reward is fed into model

  • @edvinbeqari7551
    @edvinbeqari7551 10 місяців тому

    What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.

  • @nikbivation
    @nikbivation Рік тому +1

    thank you for this!

  • @ireoluwaTH
    @ireoluwaTH Рік тому +1

    Thank you!!!

  • @bombur9007
    @bombur9007 7 місяців тому

    how many layers should such network have

  • @n4mmenam
    @n4mmenam Рік тому +1

    Brilliant

  • @mineq4967
    @mineq4967 8 місяців тому

    but by what number do you change the weights like you never told us

  • @NR_5tudio
    @NR_5tudio Місяць тому

    i just have a quastion, what is that thing ? 6:20 its like a worm ?
    like. i didnt take it in my math class.... im 16 years btw
    i mean the one u added

  • @maxim_ml
    @maxim_ml 6 місяців тому +1

    that was good

  • @axe863
    @axe863 Рік тому +2

    Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅

  • @nischalyou
    @nischalyou Рік тому

    whats the name of this video game ?

  • @gaydemaupassant6263
    @gaydemaupassant6263 5 місяців тому

    Pls o want the code plsss

  • @FRANKONATOR123
    @FRANKONATOR123 Рік тому

    Can you share the source code for this project

    • @g5min
      @g5min  Рік тому

      You can follow the link to the Karpathy site at the end of the video, repeated here:
      karpathy.github.io/2016/05/31/rl/

  • @herikaniugu
    @herikaniugu Рік тому

    Imagine using reinforcement learning in quantitative finance 😊

  • @macratak
    @macratak Рік тому

    ah yes, reinforcement learning. a fundamental computer graphics technology

    • @g5min
      @g5min  Рік тому +6

      I think that character/game-AI is pretty central to graphics

    • @pw7225
      @pw7225 Рік тому +1

      Why so negative?

    • @revimfadli4666
      @revimfadli4666 Рік тому

      ​@@g5minespecially AI image generation or processing nowadays