Elements of Reinforcement Learning

Поділитися
Вставка
  • Опубліковано 5 чер 2024
  • Elements of Reinforcement Learning
    ABOUT ME
    ⭕ Subscribe: ua-cam.com/users/CodeEmporiu...
    📚 Medium Blog: / dataemporium
    💻 Github: github.com/ajhalthor
    👔 LinkedIn: / ajay-halthor-477974bb
    RESOURCES
    [1] Reinforcement Learning book: incompleteideas.net/book/RLboo...
    [2] Paradigms of ML: idapgroup.com/blog/types-of-m...
    [3] Pong: • DQN Breakout
    [4] Learning to walk: • Emergence of Locomotio...
    [5] ChatGPT blog: openai.com/blog/chatgpt
    [6] Chess: www.kaggle.com/code/arjanso/r...
    [7] Model Free vs Model Based RL: spinningup.openai.com/en/late...
    PLAYLISTS FROM MY CHANNEL
    ⭕ Reinforcement Learning: • Reinforcement Learning...
    Natural Language Processing: • Natural Language Proce...
    ⭕ Transformers from Scratch: • Natural Language Proce...
    ⭕ ChatGPT Playlist: • ChatGPT
    ⭕ Convolutional Neural Networks: • Convolution Neural Net...
    ⭕ The Math You Should Know : • The Math You Should Know
    ⭕ Probability Theory for Machine Learning: • Probability Theory for...
    ⭕ Coding Machine Learning: • Code Machine Learning
    MATH COURSES (7 day free trial)
    📕 Mathematics for Machine Learning: imp.i384100.net/MathML
    📕 Calculus: imp.i384100.net/Calculus
    📕 Statistics for Data Science: imp.i384100.net/AdvancedStati...
    📕 Bayesian Statistics: imp.i384100.net/BayesianStati...
    📕 Linear Algebra: imp.i384100.net/LinearAlgebra
    📕 Probability: imp.i384100.net/Probability
    OTHER RELATED COURSES (7 day free trial)
    📕 ⭐ Deep Learning Specialization: imp.i384100.net/Deep-Learning
    📕 Python for Everybody: imp.i384100.net/python
    📕 MLOps Course: imp.i384100.net/MLOps
    📕 Natural Language Processing (NLP): imp.i384100.net/NLP
    📕 Machine Learning in Production: imp.i384100.net/MLProduction
    📕 Data Science Specialization: imp.i384100.net/DataScience
    📕 Tensorflow: imp.i384100.net/Tensorflow

КОМЕНТАРІ • 14

  • @CodeEmporium
    @CodeEmporium  8 місяців тому +5

    If you think I deserve it, please give this video a like as it will help circulate the video immensely. Thank you so much for the support so far !

  • @pj-nz6nm
    @pj-nz6nm 8 місяців тому +9

    Please make more videos on reinforcement learning,my knowledge about this field is very poor.

    • @CodeEmporium
      @CodeEmporium  8 місяців тому +2

      I definitely shall make more videos! Thanks for the comment !

  • @sloth_in_socks
    @sloth_in_socks 26 днів тому

    Great video! It's funny you mentioned unsupervised learning at the start but didn't mention LLMs

  • @minlingg91
    @minlingg91 8 місяців тому

    keep up the good work! im currently doing a traineeship in AI and your videos have been immensely helpful.

  • @amiralioghli8622
    @amiralioghli8622 8 місяців тому

    Thank you, sir, for sharing valuable information through your UA-cam channel. Once again, I have a request: please create a series on how to apply Transformers to time series tasks such as anomaly detection, forecasting, or classification. Working on just one of these tasks would be sufficient for us. I have followed numerous articles, short notes, and videos regarding the application of Transformers to time series data, but it is still not clear to me. I am a beginner on this Transformer journey, and there are no useful videos available on UA-cam overall.

  • @casualpasser-by5954
    @casualpasser-by5954 8 місяців тому

    Very nice, short and clear overview of reinforcement learning! However, in the end of the video, I think, the distinction between model-free and model-based algorithms wasn't explained well. It is not about does one train an algorithm on the simulatied or real-world data. Is real world the source of the information or it is a simulation - from the algorithmic point of view the information is in both cases just numbers, produced by some external environment. The real difference between model-free and model-based is that model-based algorithms have intrinsic model within them, which is adjusted during the training to better predict the behaviour of the environment. Of there is no such trainable model within an algorithm and we have only fixed external simulation - we still follow model-free approach.
    Sorry for my English.

    • @CodeEmporium
      @CodeEmporium  7 місяців тому

      Thanks for this! Honestly I think I am in agreement and this goes to show maybe the words I used in the end to describe this is confusing. Perhaps with this definition , a more concrete example would have been helpful like I had given the others. But I treated that last piece more like a footnote. I’ll probably dedicate more videos and time to this :)

  • @slitihela1860
    @slitihela1860 3 місяці тому

    can you prepare a video for Double Q-Learning Network
    and Dueling Double Q-Learning Network
    please

  • @mdbayazid6837
    @mdbayazid6837 8 місяців тому +1

    I would request for a book reading camp if possible

    • @CodeEmporium
      @CodeEmporium  8 місяців тому +1

      Ooo this is a fun topic! I shall consider

  • @arunima29
    @arunima29 8 місяців тому

    Please make detailed videos on all the concepts of RL.

  • @MilesBellas
    @MilesBellas 8 місяців тому

    1.7x speed = best