Foundation of Q-learning | Temporal Difference Learning explained!

Поділитися
Вставка
  • Опубліковано 29 жов 2023
  • Let's talk about the foundation concept of Q-learning, SARSA called Temporal Difference Learning.
    ABOUT ME
    ⭕ Subscribe: ua-cam.com/users/CodeEmporiu...
    📚 Medium Blog: / dataemporium
    💻 Github: github.com/ajhalthor
    👔 LinkedIn: / ajay-halthor-477974bb
    RESOURCES
    [1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
    [2] Paradigms of ML: idapgroup.com/blog/types-of-m...
    [3] Model Free vs Model Based RL: spinningup.openai.com/en/late...
    [4] Bellman Equation video: • Bellman Equation - Ex...
    PLAYLISTS FROM MY CHANNEL
    ⭕ Reinforcement Learning: • Reinforcement Learning...
    Natural Language Processing: • Natural Language Proce...
    ⭕ Transformers from Scratch: • Natural Language Proce...
    ⭕ ChatGPT Playlist: • ChatGPT
    ⭕ Convolutional Neural Networks: • Convolution Neural Net...
    ⭕ The Math You Should Know : • The Math You Should Know
    ⭕ Probability Theory for Machine Learning: • Probability Theory for...
    ⭕ Coding Machine Learning: • Code Machine Learning
    MATH COURSES (7 day free trial)
    📕 Mathematics for Machine Learning: imp.i384100.net/MathML
    📕 Calculus: imp.i384100.net/Calculus
    📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
    📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
    📕 Linear Algebra: imp.i384100.net/LinearAlgebra
    📕 Probability: imp.i384100.net/Probability
    OTHER RELATED COURSES (7 day free trial)
    📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
    📕 Python for Everybody: imp.i384100.net/python
    📕 MLOps Course: imp.i384100.net/MLOps
    📕 Natural Language Processing (NLP): imp.i384100.net/NLP
    📕 Machine Learning in Production: imp.i384100.net/MLProduction
    📕 Data Science Specialization: imp.i384100.net/DataScience
    📕 Tensorflow: imp.i384100.net/Tensorflow

КОМЕНТАРІ • 18

  • @noahgsolomon
    @noahgsolomon Місяць тому +2

    The breakdown of the 1 sentence explanation is so useful

  • @PrymeOrigin
    @PrymeOrigin 6 місяців тому +10

    You have a gift to teach and I'm very thankful to find someone who breaks down concepts so simply and easy
    to digest

  • @LuthandoMaqondo
    @LuthandoMaqondo 7 місяців тому +6

    Nice, quick and straight to the point.

  • @al_parlam
    @al_parlam 4 місяці тому +1

    man, your explanation is gorgeous ! you are remarkable in explaining complex things. Keep doing what you are doing :) I wish you much luck with your channel

  • @LaveshNK
    @LaveshNK 3 місяці тому

    Fantastic video...I have a RL assignment due and I had no idea wht TD error even meant. You are great at explaining

  • @akshaypansari111111
    @akshaypansari111111 7 місяців тому

    Thanks a lot. This is real helpful. I will check out the bellman equation video as well

  • @li-pingho1441
    @li-pingho1441 7 місяців тому

    awesome explanation!

  • @krishnavinukonda1882
    @krishnavinukonda1882 2 місяці тому

    This is best . Thanks!

  • @krzysztofjarek6476
    @krzysztofjarek6476 7 місяців тому

    Great lecture 😉

  • @minapagliaro7607
    @minapagliaro7607 2 місяці тому

    Great video !!!!

  • @slitihela1860
    @slitihela1860 3 місяці тому +1

    can you prepare a video for Double Q-Learning Network
    and Dueling Double Q-Learning Network
    please

  • @davidlieber3494
    @davidlieber3494 6 місяців тому

    great video, thanks!

    • @CodeEmporium
      @CodeEmporium  6 місяців тому

      You are very welcome. Thanks for commenting

  • @yep3659
    @yep3659 3 місяці тому

    I'm craving for some Tempuras now

  • @redrose5406
    @redrose5406 7 місяців тому

    Post more about GANs

  • @satyamdubey4110
    @satyamdubey4110 3 місяці тому

    💖💖