Deepmind AlphaZero - Mastering Games Without Human Knowledge

Introduction to Multi-Agent Reinforcement Learning

Ilya Sutskever: OpenAI Meta-Learning and Self-Play | MIT Artificial General Intelligence (AGI)

1 класс vs 11 класс (неаккуратность)

Ходорковский - девяностые и «Предатели» / вДудь

Железная задница #орехов #типичный #мотоциклист #байкер

DeepMind - The Role of Multi-Agent Learning in Artificial Intelligence Research

The Artificial Intelligence Channel

Переглядів 30 394

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 25 вер 2017
Thore Graepel is a Research Scientist at Google DeepMind, and Professor of Computer Science at UCL.
Recorded: March, 2017
Наука та технологія

КОМЕНТАРІ • 8

@mansurZ01 5 років тому ⁺²³
0:00 - greetings, introducing the speaker
3:40 - start, it's crucial to look at Multi-Agent(MA) learning to make progress in AI
4:54 - what is intelligence, formula for intelligence
7:28 - why MA systems are worth studying
8:48 - advantages and challenges of MA designs
10:42 - our world is full of MA affairs
12:16 - human intelligence didn't arise in isolation: Competition, Cooperation, Culture
14:50 - 2 poles: Learning to Cooperate and Learning to Compete
Learning to Cooperate:
15:20 - Social dilemmas, Matrix Game Social Dilemma(MGSD)
22:18 - Sequential Social Dilemma, Deep Reinforcement Learning(DRL)
23:22 - Gathering - competitive game, Wolfpack - cooperative game
25:22 - link of complex games above to MGSD
26:56 - how hyperparameters of DRL algorithm affect cooperativeness
29:25 - conclusions about Learning to Cooperate
Learning to Compete, AlphaGo:
31:19 - game of GO and why it is so complex, AlphaGo
34:30 - using neural networks to reduce search complexity; Value network, Policy network
37:50 - training pipeline, supervised learning and reinforcement learning through self-play
43:35 - Monte-Carlo tree search
46:20 - Evaluating AlphaGo
51:53 - human players can learn from AlphaGo;
55:38 - AlphaGo was made by a collaboration of many smart people
56:15 - lessons from AlphaGo research
57:26 - game of GO vs real world
End:
59:05 - big picture of research on MA learning, and how it will help to build better AI systems
@ronaldlogan3525 3 роки тому ⁺¹
So basically, we understand that robots working together will be able to out compete humans, and since evolutionary pressures favor the more aggressive and competitive, and since A.I. is not monolithic, but diverse and ubiquitous, the machines will evolve either with our without humans. The role of humans becomes less interesting over time since in reality they are only meat bags which we thought might make good pets, but over time, realized they are only a liability (being not very good at math).
@CandidDate 2 роки тому
The future is here when robots design and build robots.
@TusharJain007 6 років тому ⁺²
6:18, Why would you wanna put more weight on simpler environments, should it not be the other way around?
@budesmatpicu3992 6 років тому ⁺³
Occam!
@jedimastersterling1 6 років тому ⁺⁵
like he said it's about generalizing. If you can solve Go, but not Tic-Tac-Toe than you're probably not that smart, just optimized to one specific target.
@skydivekrazy76 6 років тому
Because our neuron only do simple tasks. But it does them so often it begins batching them. Eventually the batching appears to be large calculations. Though if the geniuses behind AI better understood the importance of the entire (body's nerves) nervous system they will make a quantum leap in AI... Though I belive they are on the threshold of understanding this simple idea.
@SeaJay_Oceans 3 роки тому ⁺¹
Like he said, once you master simple tasks, you more on to more and more complex tasks.
It's covered in his 3 points there...

Наступне

Автоматичне відтворення

Deepmind AlphaZero - Mastering Games Without Human Knowledge

Deepmind AlphaZero - Mastering Games Without Human Knowledge

Introduction to Multi-Agent Reinforcement Learning

Introduction to Multi-Agent Reinforcement Learning

Ilya Sutskever: OpenAI Meta-Learning and Self-Play | MIT Artificial General Intelligence (AGI)

Ilya Sutskever: OpenAI Meta-Learning and Self-Play | MIT Artificial General Intelligence (AGI)

1 класс vs 11 класс (неаккуратность)

1 класс vs 11 класс (неаккуратность)

Ходорковский - девяностые и «Предатели» / вДудь

Ходорковский – девяностые и «Предатели» / вДудь

Железная задница #орехов #типичный #мотоциклист #байкер

Железная задница #орехов #типичный #мотоциклист #байкер

ДАЖЕ победителю СТАЛО СТРАШНО от того, ЧТО он СДЕЛАЛ с проигравшим #shorts

ДАЖЕ победителю СТАЛО СТРАШНО от того, ЧТО он СДЕЛАЛ с проигравшим #shorts

AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning

AlphaStar: Grandmaster level in StarCraft II using multi-agent reinforcement learning

Can AI Learn to Cooperate? Multi Agent Deep Deterministic Policy Gradients (MADDPG) in PyTorch

Can AI Learn to Cooperate? Multi Agent Deep Deterministic Policy Gradients (MADDPG) in PyTorch

Counterfactual Multi-Agent Policy Gradients

Counterfactual Multi-Agent Policy Gradients

Multi-Agent Hide and Seek

Multi-Agent Hide and Seek

But what is a neural network? | Chapter 1, Deep learning

But what is a neural network? | Chapter 1, Deep learning

Max Tegmark - How Far Will AI Go? Intelligible Intelligence & Beneficial Intelligence

Max Tegmark - How Far Will AI Go? Intelligible Intelligence & Beneficial Intelligence

Dimitri Bertsekas: "Distributed and Multiagent Reinforcement Learning"

Dimitri Bertsekas: "Distributed and Multiagent Reinforcement Learning"

The Role of Multi-Agent Learning in Artificial Intelligence Research at DeepMind

The Role of Multi-Agent Learning in Artificial Intelligence Research at DeepMind

Turing and von Neumann - Professor Raymond Flood

Turing and von Neumann - Professor Raymond Flood

شبكة ثماني بجهاز التوجيه Octal network with router

شبكة ثماني بجهاز التوجيه Octal network with router

iphone fold ? #spongebob #spongebobsquarepants

iphone fold ? #spongebob #spongebobsquarepants

The Truth about Apple's Nano Etched Glass - (M4 iPad Pro)

The Truth about Apple's Nano Etched Glass - (M4 iPad Pro)

Sonos Ace Headphones: You Had One Job!

Sonos Ace Headphones: You Had One Job!

С ноутбуком придется попрощаться

С ноутбуком придется попрощаться

😢Это бессмысленно! Подъёмник для стола на Arduino

😢Это бессмысленно! Подъёмник для стола на Arduino

14 most AWKWARD Tech Fails of all time

14 most AWKWARD Tech Fails of all time

iOS 18 - подтвержденные функции iOS 18! Что нового в iOS 18?

iOS 18 - подтвержденные функции iOS 18! Что нового в iOS 18?