Proximal Policy Optimization Explained

Поділитися
Вставка
  • Опубліковано 4 лис 2024

КОМЕНТАРІ • 23

  • @sordesderisor
    @sordesderisor 2 роки тому +9

    If you also read the TRPO and PPO paper this video provides the perfect concise summary of PPO !

  • @aramvanbergen4489
    @aramvanbergen4489 3 роки тому +33

    Thank you for the clear explanation! But next time please use screenshots of the actual formulas this way it is much more readable.

  • @alph4b3th
    @alph4b3th Рік тому +2

    Sensational! Dude, you explain in such a simple way! I was wondering what the difference was between deep Q-Learning and PPO, and I was looking for exactly a video like this. Congratulations on your great didactic way of explaining the basic mathematical concepts and abstracting them to a more intuitive approach; you are really very good at this! Excellent video!

  • @sayyidj6406
    @sayyidj6406 8 місяців тому

    i wish i know this channel sooner. thanks for video

  • @GnuSnu
    @GnuSnu Рік тому +12

    4:25 "let me write it real quick" 💀💀

  • @James-qv1lh
    @James-qv1lh Рік тому +2

    Insanely good video! Simple and straight to the point - thanks so much! :)

  • @canoksuzoglu6540
    @canoksuzoglu6540 Місяць тому

    Thanks dude. That was perfect explanation

  • @carloscampo9119
    @carloscampo9119 Рік тому

    That was very, very well done. Thank you for the clear explanation.

  • @alexkonopatski429
    @alexkonopatski429 2 роки тому +5

    I really love your vids and I also love how you explain things! And could you pls maybe make a video about TRPO, 'cause it is a really complex thing to understand in my opinion and the lack of available resources makes the situation not better. Therefor, I and I think a lot of others would be really glad about a good explanation!
    Thanks in advance

  • @ivanwong863
    @ivanwong863 3 роки тому +5

    DQN is not an offline method is it?

    • @EdanMeyer
      @EdanMeyer  3 роки тому +8

      My bad, I meant to say it’s an off-policy method, q-learning performs very poorly an in offline setting

  • @datonefaridze1503
    @datonefaridze1503 2 роки тому +1

    Thank you for your effort, i really appreciate it, you are working for us to learn, thanks

  • @boldizsarszabo883
    @boldizsarszabo883 Рік тому

    This video was super helpful and informative! Thank you so much for your effort!

  • @anibus1106
    @anibus1106 7 місяців тому

    Thank you so much, you save my day

  • @hemanthvemuluri9997
    @hemanthvemuluri9997 10 місяців тому

    for DQN you mean Offpolicy method right? DQN is not an Offline method.

  • @FlapcakeFortress
    @FlapcakeFortress 2 роки тому

    Much appreciated. Cheers!

  • @vadimavkhimenia5806
    @vadimavkhimenia5806 3 роки тому

    Can you make a video on maddpg with code?

  • @LatpateShubhamManikrao
    @LatpateShubhamManikrao 2 роки тому

    Nicely explained man

  • @awaisahmad5908
    @awaisahmad5908 7 місяців тому

    Thanks

  • @labreynth
    @labreynth 2 місяці тому

    Damn. I learned nothing.