Introduction to Multi-Agent Reinforcement Learning

Поділитися
Вставка
  • Опубліковано 2 сер 2022
  • Learn what multi-agent reinforcement learning is and some of the challenges it faces and overcomes.
    You will also learn what an agent is and how multi-agent systems can be both cooperative and adversarial. Be walked through a grid world example to highlight some of the benefits of both decentralized and centralized reinforcement learning architectures.
    Watch our full video series about Reinforcement Learning: • Reinforcement Learning
    By the end of this series, you’ll be better prepared to answer questions like:
    - What is reinforcement learning and why should I consider it when solving my control problem?
    - How do I set up and solve the reinforcement learning problem?
    - What are some of the benefits and drawbacks of reinforcement learning compared to a traditional controls approach?
    Artificial intelligence, machine learning, deep neural networks. These are terms that can spark your imagination of a future where robots are thinking and evolving creatures.
    Check out these other resources:
    - Try MATLAB Example: Train Multiple Agents for Area Coverage: bit.ly/3Ix1Kf7
    - Read paper: Multi-agent reinforcement learning: An overview: bit.ly/3nVrNmN
    - Read paper: Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents: bit.ly/3nVK7My
    - Read paper: Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning: bit.ly/3v7LxaT
    Check out the individual videos in the series:
    • What Is Reinforcement Learning?: • What Is Reinforcement ...
    • Understanding the Environment and Rewards: • Understanding Reinforc...
    • Policies and Learning Algorithms: • Reinforcement Learning...
    • The Walking Robot Problem: • Solving the Walking Ro...
    • Overcoming the Practical Challenges: • Overcoming the Practic...
    • An Introduction to Multi-Agent Reinforcement Learning: • Introduction to Multi-...
    • Why Choose Model-Based Reinforcement Learning?: • Why Choose Model-Based...
    --------------------------------------------------------------------------------------------------------
    Get a free product trial: goo.gl/ZHFb5u
    Learn more about MATLAB: goo.gl/8QV7ZZ
    Learn more about Simulink: goo.gl/nqnbLe
    See what's new in MATLAB and Simulink: goo.gl/pgGtod
    © 2022 The MathWorks, Inc. MATLAB and Simulink are registered trademarks of The MathWorks, Inc.
    See www.mathworks.com/trademarks for a list of additional trademarks. Other product or brand names may be trademarks or registered trademarks of their respective holders.
  • Наука та технологія

КОМЕНТАРІ • 16

  • @ShahFahad-hj1ps
    @ShahFahad-hj1ps Рік тому +6

    Hello Brian, You are doing an amazing job. I hope to see a series prepared by you for using graph learning applications using deep reinforcement learning. In addition to your existing accomplishments, this will truly be something amazing.

  • @oldcowbb
    @oldcowbb Рік тому +8

    holy shit brian and multiagent and reinforcement learning? count me in

  • @musa_b
    @musa_b Рік тому +6

    Never, loose motivation to post something like this.

  • @sapienspace8814
    @sapienspace8814 9 місяців тому

    Great video, thank you for sharing!

  • @pianodavid9676
    @pianodavid9676 4 місяці тому

    Excellent explanation, thank you :)

  • @arandomfrenchdev
    @arandomfrenchdev Рік тому

    Great video, thanks !

  • @Pedritox0953
    @Pedritox0953 Рік тому

    Great video!

  • @EngineeringSimplified
    @EngineeringSimplified Рік тому +1

    That was great, Brian. One question- when going from one iteration of reinforcement learning to the next, how does the robot know what change to make? Is the change random and then the robot calculates the total reward points at the end and if this is better than last one, selects this as a better path? Or is there a certain methodology it follows on how to select the change to make?
    Hope the question makes sense.

  • @proninety7587
    @proninety7587 10 місяців тому

    12:10 The problem is that, agents may not be optimised if another agent covers a new block. It’s should use the entire coverage map AND cross reference that agents mapping to make sure that even though the whole improved, the parts still need to learn to optimise their improvement.

  • @kundankumar-dt5uu
    @kundankumar-dt5uu Рік тому

    @MATLAB, sir I want an Agent should work like variable switching frequency controller, please guide sir

  • @cuongnguyenuc1776
    @cuongnguyenuc1776 3 місяці тому

    nice video!! but what is the drawbacks of the centralized RL by the way? Its seem to be better and more suitable for multi-agent problem than decentralized.

  • @artem_isakow
    @artem_isakow 8 місяців тому

  • @Alpacabowl98
    @Alpacabowl98 19 днів тому

    you dont respond so why bother