The Most Important Algorithm in Machine Learning

A friendly introduction to Deep Learning and Neural Networks

What happens *inside* a neural network?

Мама полеглого Героя України Михайла Яворського: «Дякую Богу, що він не мучився» #війна #shorts

PEDRO PEDRO Championship!!! Who’s the champion?? 🤔🤯 @Mamiko #beatbox #challenge #fyp

ВІТАЛІЙ ВОЛОЧАЙ В КЛУБІ ДИЛЕТАНТІВ 37

Geometric Intuition for Training Neural Networks

Seattle Applied Deep Learning

Переглядів 17 796

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 24 лис 2019
Leo Dirac (@leopd) gives a geometric intuition for what happens when you train a deep learning neural network. Starting with a physics analogy for how SGD works, and describing the shape of neural network loss surfaces.
This talk was recorded live on 12 Nov 2019 as part of the Seattle Applied Deep Learning (sea-adl.org) series.
References from the talk:
Loss Surfaces of Multilayer networks arxiv.org/pdf/1412.0233.pdf
Sharp minima papers:
-Modern take arxiv.org/abs/1609.04836
-Hochreiter, Schmidhuber 1997 www.bioinf.jku.at/publications...
SGD converges to limit cycles: arxiv.org/pdf/1710.11029.pdf
Entropy-SGD: arxiv.org/abs/1611.01838
Parle: arxiv.org/abs/1707.00424
FGE: arxiv.org/abs/1802.10026
SWA: arxiv.org/pdf/1803.05407.pdf
SWA implementation in pytorch: pytorch.org/blog/stochastic-w...
Наука та технологія

КОМЕНТАРІ • 20

@susmitislam1910 3 роки тому ⁺⁸
For those who are wondering, yes, he's the grandson of the late great Paul Dirac.
@miguelduqueb7065 2 роки тому ⁺²
Such insights so easily explained denote a deep understanding of the topic and great teaching skills. I am eager to see more lectures or talks by this author.
Thanks.
@MrArihar 4 роки тому
Really useful resource with intuitively understandable explanations!
Thanks a lot!
@PD-vt9fe 4 роки тому ⁺²
Thank you so much for this excellent talk.
@katiefaery 4 роки тому
He’s a great speaker. Really well explained. Thanks for sharing.
@uwe_sterr 3 роки тому ⁺¹
hi leo,
thanks for this very impressing way of making somewhat complicated concepts so easy to understand with simple but well structured visualisations.
@RobertElliotPahel-Short 3 роки тому ⁺¹
This is such a great talk! Keep it up my dude!!
@matthewtang1489 4 роки тому ⁺⁵
This is so coooooollll!!!!!!!
@oxfordsculler8013 3 роки тому ⁺¹
Great video. Why no more? These are very insightful.
@matthewhuang7857 Рік тому ⁺²
Thanks for the speech Leo! I'm now a couple of months into ML and this level of articulation really helped a lot. I know this is probably a rookie mistake in this context but often when it's hard for my model to converge, I thought it's probably because it reaches a 'local minima'. My practice is often significantly bumping up the learning rate to hopefully let the model to kinda leap over and get to a point where it can re-converge. According to what you said, there are evidences conclusively proving there's no local minima in loss functions. I'm wondering which specific papers you were talking about.
regards,
Matt
@ramkitty 3 роки тому
This is a great lecture that ends at wolframs argument for quantum physics and relativity and what I think is manifest as orch or type contiousness through Penrose twistor collapse
@abhijeetvyas7365 3 роки тому
Dude, awesome!
@berargumen2390 3 роки тому ⁺²
This video lead me to my "aha" moment, thanks
@bluemamba5317 3 роки тому ⁺³
Was it the pink shirt, or the green belt?
@srijeetful 4 роки тому
nice one
@linminhtoo 3 роки тому ⁺³
very nice (and certainly mindblowing) video, but according to ua-cam.com/video/78vq6kgsTa8/v-deo.html, that complicated loss landscope at 13:51 is not actually a ResNet but a VGG. The ResNet one looks a lot smoother due to the residual skip connections
@LeoDirac 3 роки тому ⁺¹
Thanks for the kind words. The creators of that diagram called it a "ResNet" - see the first page of the referenced paper arxiv.org/pdf/1712.09913.pdf . Skip connections make the loss surface smoothER, but remember that these surfaces have millions of dimensions. There are zillions of ways to visualize them in 2 or 3 dimensions, and every view discards tons of information. It's totally reasonable to expect that one view would look smooth and another very lumpy, for the same surface.
TBH I don't know exactly what the authors of this paper did - they refer to "skip connections" a lot, and talk about resnets with and without them. I'm not sure if they mean "residuals" when they say "skip connections" but I'm not sure I'd call a resnet without RESiduals a RESnet myself. If you remove the residuals it's architecturally a lot closer to a traditional CNN like VGG / AlexNet / LeNet and not what I would call a ResNet at all.
@underlecht 3 роки тому
That "circle" idea is somwhat. I think it depends on implementation of SGD, if you do not have slope to that direction, how do you make going on edge of that circle? Do you really use randomized batches? Many questions
@hanyanglee9018 2 роки тому
17:00 is all you need.
@elclay 3 роки тому
please the slides sir

Наступне

Автоматичне відтворення

The Most Important Algorithm in Machine Learning

The Most Important Algorithm in Machine Learning

A friendly introduction to Deep Learning and Neural Networks

A friendly introduction to Deep Learning and Neural Networks

What happens *inside* a neural network?

What happens *inside* a neural network?

Мама полеглого Героя України Михайла Яворського: «Дякую Богу, що він не мучився» #війна #shorts

Мама полеглого Героя України Михайла Яворського: «Дякую Богу, що він не мучився» #війна #shorts

PEDRO PEDRO Championship!!! Who’s the champion?? 🤔🤯 @Mamiko #beatbox #challenge #fyp

PEDRO PEDRO Championship!!! Who’s the champion?? 🤔🤯 @Mamiko #beatbox #challenge #fyp

ВІТАЛІЙ ВОЛОЧАЙ В КЛУБІ ДИЛЕТАНТІВ 37

ВІТАЛІЙ ВОЛОЧАЙ В КЛУБІ ДИЛЕТАНТІВ 37

🛑ВАЛЬС на 50 месте в чарте! Сможем выше?

🛑ВАЛЬС на 50 месте в чарте! Сможем выше?

L3.5 The Geometric Intuition Behind the Perceptron

L3.5 The Geometric Intuition Behind the Perceptron

Tom Goldstein: "What do neural loss surfaces look like?"

Tom Goldstein: "What do neural loss surfaces look like?"

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!

A Friendly Introduction to Generative Adversarial Networks (GANs)

A Friendly Introduction to Generative Adversarial Networks (GANs)

The Essential Main Ideas of Neural Networks

The Essential Main Ideas of Neural Networks

AMMI Course "Geometric Deep Learning" - Lecture 1 (Introduction) - Michael Bronstein

AMMI Course "Geometric Deep Learning" - Lecture 1 (Introduction) - Michael Bronstein

The Neural Network, A Visual Introduction

The Neural Network, A Visual Introduction

Dendrites: Why Biological Neurons Are Deep Neural Networks

Dendrites: Why Biological Neurons Are Deep Neural Networks

Attention is all you need; Attentional Neural Network Models | Łukasz Kaiser | Masterclass

Attention is all you need; Attentional Neural Network Models | Łukasz Kaiser | Masterclass

WWDC 2024 - June 10 | Apple

WWDC 2024 — June 10 | Apple

КУПИЛ ПОДДЕЛКУ iMac С WILDBERRIES ЗА 20К - ИГРОВОЙ АЙМАК С WB ЗА 20.000р, ОБЗОР

КУПИЛ ПОДДЕЛКУ iMac С WILDBERRIES ЗА 20К - ИГРОВОЙ АЙМАК С WB ЗА 20.000р, ОБЗОР

lol Apple Intelligence is dumb...

lol Apple Intelligence is dumb...

Распаковал Новый Huawei Pura 70 Ultra и Huawei Watch FIT 3! Сравнение Камер с iPhone кратко.

Распаковал Новый Huawei Pura 70 Ultra и Huawei Watch FIT 3! Сравнение Камер с iPhone кратко.

Мечта Каждого Геймера

Мечта Каждого Геймера

iOS 18 использует iPhone ВМЕСТО ТЕБЯ. Всё о WWDC 2024!

iOS 18 использует iPhone ВМЕСТО ТЕБЯ. Всё о WWDC 2024!

НЕДЕЛЯ с Sony Xperia 1 V - последний образец ЯПОНСКОГО ЧУДА? | ЧЕСТНЫЙ ОТЗЫВ

НЕДЕЛЯ с Sony Xperia 1 V — последний образец ЯПОНСКОГО ЧУДА? | ЧЕСТНЫЙ ОТЗЫВ

Technics 1500 Японская легенда барахлит

Technics 1500 Японская легенда барахлит