This Algorithm Could Make a GPT-4 Toaster Possible

2 Years of My Research Explained in 13 Minutes

How might LLMs store facts | Chapter 7, Deep Learning

Дурнєв та Фелікс Редька дивляться сторіс ZОМБІ #54 (napisy PL, eng subtitles)

I Took An iPhone 16 From A POSTER! 😱📱 #shorts

От первого лица: Школа 7 😡 УБЕЖАЛ из ДОМА 😱 БРОСИЛ ДЕВУШКУ ИЗ-ЗА ДЕНЕГ 😰 СТЫД ГЛАЗАМИ ШКОЛЬНИКА

RL Foundation Models Are Coming!

Edan Meyer

Переглядів 22 028

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 5 жов 2024

КОМЕНТАРІ • 53

@theodoreshachtman7360 Рік тому ⁺²⁷
This is a really high quality video, on par with 2 minute papers but with a more detail oriented approach. Also you have a lovable vibe king, keep it up
@THarshavardhanReddy Рік тому ⁺⁴
I used to love 2 Minute Papers. But it's become very repetitive now, and just too fluffy. Probably I'm not in the target audience anymore.
@herpderp728 Рік тому ⁺⁵
I absolutely hate 2 minute papers. It's all hype and no substance. I physically cringe every time I hear the guy say "now hold onto your papers everybody! this is gonna be crazy!" and then he tells you the most boring anti-climactic shit possible.
@theodoreshachtman9990 Рік тому
Yeah, but how come your stinky doo doo though…
@theodoreshachtman9990 Рік тому
Yeah, but how come your stinky doo doo though…
@theodoreshachtman7360 Рік тому
@@herpderp728 Yeah, but how come your stinky doo doo though…
@MickGardner-vc4us Рік тому ⁺²
edan bro makes my dopamine policy gradients high everytime. fingers crossed we get open rl foundation models.
@CristianGarcia Рік тому ⁺⁴
Just give this environment to speed runners, watch the true potential of what humans can do with games.
Thanks for the video!
@tchlux Рік тому ⁺⁷
Another way to frame the problem of neural network representations becoming “too specific” to learn new tasks at 25:59 is to consider exactly how the gradient of weights is computed.
It’s the matrix multiplication between the directional error after a layer and the directional values before the layer. When the values become totally orthogonal to the error (they contain no information relative to the error), then it’s impossible to reduce the error by changing the weights in that layer.
The reason weight randomization helps with this problem is it introduces new values after the layer that was randomized. However a much more efficient way to do this is to instead reduce the existing weights in a layer with linear regression over a representative sample of data to “pack” the good information into fewer existing neurons. Then you’re free to randomly initialize the remaining neurons, or even better to initialize weights that produce values already aligned with the directional error! I’ve got some ongoing research in this area if anyone is interested in collaborating. 🤓
@MickGardner-vc4us Рік тому ⁺¹
sounds pretty badass. might be easier to do a backward pass through lin-reg as well
@jadenlorenc2577 Рік тому
I'd be interested! How do I get in contact?
@tchlux Рік тому
@@jadenlorenc2577 my UA-cam profile has links to different places, whatever is easiest for you!
@exoqqen Рік тому ⁺¹
amazing breakdown, thank you for making this paper accessible to me!
@vsiegel Рік тому ⁺³
At 7:10, the first pronounciation of Muesli is right. German Müsli, Muesli may be the Swiss-German spelling.
@zxgrizzly3401 Рік тому ⁺¹
Thanks for your videos, but at 7:44, efficient zero and mu zero do not reconstruct the raw observation/image, mu zero learns it’s latent representation based on value equivalence only while efficient zero also cares about temporal consistency, so they take next observation to supervise the representation and dynamics part of the model in an unsupervised manner(simsiam)
@billykotsos4642 Рік тому ⁺³
sounds like RL is progressing? maybe I should jump back in !
@chickenp7038 Рік тому ⁺⁴
since wandb doesn’t work for me i will actually try clearml thanks to you
@dragossorin85 Рік тому
Been thinking about this for some time
@zigzag4273 Рік тому ⁺⁵
My 2nd petition on this matter. Please make a video of how you read and implement papers. Thank you **kiss**
@EdanMeyer Рік тому ⁺¹
Still considering. Part of the issue is that every paper is just so different when it comes to this, and lots of the background is going to be dependent on the paper. Still might try as I guess maybe I can extract some general guidelines from my process
@_RMSG_ Рік тому
@@EdanMeyer Where to start would be a pretty good help
@ch1n3du3 Рік тому ⁺¹
Do you think the approaches here could be applied to Dreamer V3?
@afish5581 Рік тому
Coffee is culture too!
@Kram1032 Рік тому
I wonder if there is any benefit to be had at all from, like, across multiple full training iterations, distill a large model into a smaller one and then distill the small one back into a larger one (vs. *just* repeatedly distilling a large model into a model of the same size)
@henrycook859 Рік тому ⁺¹
22:55 uhh 5 x 300 isn't 1800 lmao
@kemalware4912 Рік тому ⁺¹
I really liked vscode theme on the clear ml section. Can you share it?
@kemalware4912 Рік тому ⁺¹
Community Material Theme ocean high contrast
@before7048 Рік тому ⁺²
7:10 Myu-slee. It's a quick, easy and tasty breakfast so that you too, can be reinforced!
@EdanMeyer Рік тому ⁺²
Lmao I don’t think I could have been any further from the mark
@alexcai1320 Рік тому
@@EdanMeyer no worries -- it was incredibly entertaining XD
@wpgg5632 Рік тому
Really love it !
@Blacky372 Рік тому
I wonder if you could train a model that could beat a human in Rock Paper Scissors, but with retained memory in a best of 7 or so. That would only require it to train on human behavior episodes, which would be hard to acquire. But if this was possible with synthetic games, this would be the best party trick ever.
@shadamethyst1258 Рік тому ⁺⁵
Why did they have to choose the same name as the Ada programming language ._.
They did the same thing with MLKit, which was a model language suite of tools, which google decided should instead be a machine learning kit
@EdanMeyer Рік тому ⁺²
I’m pretty sure every short name in ML papers shares a name with something else at this point lol
@ChocolateMilkCultLeader Рік тому
If you're ever interested in collaborations, let me know. I'd love to have you on my newsletter to cover some of your most interesting ideas.
@pauljones9150 Рік тому
Good stuff
@angelowentzler9961 Рік тому ⁺⁷
Muesli is pronounced "MEW-zlee" HTH
@user-kp7xs4rb3t Рік тому
The hell, we have the same name!
@robertsimonuy9743 Рік тому ⁺¹
"ADA" and "Muesli"
Thought this was about the cardano ecosystem. lol
@Ideagineer Рік тому
An army of GPU's? time to break open the piggy bank.
@sitrakaforler8696 Рік тому
Wow x)
@JinKee Рік тому
I worry that independent agents will make mistakes faster than we can realign their goals.
@polecat3 Рік тому ⁺³
20:25 I laughed
@DanielTorres-gd2uf Рік тому
I cried
@SENTRY456123 Рік тому ⁺¹
I sobbed
@omeadpooladzandi9786 Рік тому
i cant even train ciar10 in 15 mins
@Sviktam Рік тому
Like for a cultured matcha enjoyer
@johnnylatenight Рік тому ⁺²
first
@phoneticalballsack Рік тому ⁺¹⁰
AGI is easy. Just build a neural network that takes in input, and puts out an output.
@ShivaTD420 Рік тому ⁺⁴
Yup, it's just a bunch of keystrokes in the right order. Soez

Наступне

Автоматичне відтворення

This Algorithm Could Make a GPT-4 Toaster Possible

This Algorithm Could Make a GPT-4 Toaster Possible

2 Years of My Research Explained in 13 Minutes

2 Years of My Research Explained in 13 Minutes

How might LLMs store facts | Chapter 7, Deep Learning

How might LLMs store facts | Chapter 7, Deep Learning

Дурнєв та Фелікс Редька дивляться сторіс ZОМБІ #54 (napisy PL, eng subtitles)

Дурнєв та Фелікс Редька дивляться сторіс ZОМБІ #54 (napisy PL, eng subtitles)

I Took An iPhone 16 From A POSTER! 😱📱 #shorts

I Took An iPhone 16 From A POSTER! 😱📱 #shorts

От первого лица: Школа 7 😡 УБЕЖАЛ из ДОМА 😱 БРОСИЛ ДЕВУШКУ ИЗ-ЗА ДЕНЕГ 😰 СТЫД ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7 😡 УБЕЖАЛ из ДОМА 😱 БРОСИЛ ДЕВУШКУ ИЗ-ЗА ДЕНЕГ 😰 СТЫД ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡 СКАНДАЛ в ШКОЛЕ 😱РАЗГРОМИЛИ САЛОН 😰БОЛЬНОЙ ОДНОКЛАССНИК 🥹ГЛАЗАМИ ШКОЛЬНИКА

От первого лица: Школа 7😡 СКАНДАЛ в ШКОЛЕ 😱РАЗГРОМИЛИ САЛОН 😰БОЛЬНОЙ ОДНОКЛАССНИК 🥹ГЛАЗАМИ ШКОЛЬНИКА

Dendrites: Why Biological Neurons Are Deep Neural Networks

Dendrites: Why Biological Neurons Are Deep Neural Networks

DeepMind Adaptive Agent: Results Reel

DeepMind Adaptive Agent: Results Reel

What P vs NP is actually about

What P vs NP is actually about

Model Based RL Finally Works!

Model Based RL Finally Works!

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Sparks of AGI: What to Know

Sparks of AGI: What to Know

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

Wolfram Physics Project: Working Session Wednesday, Apr. 29, 2020 [Finding Black Hole Structures]

Wolfram Physics Project: Working Session Wednesday, Apr. 29, 2020 [Finding Black Hole Structures]

This is why Deep Learning is really weird.

This is why Deep Learning is really weird.

Викторина предметов 2 ❓ | WICSUR #shorts

Викторина предметов 2 ❓ | WICSUR #shorts

Танкісти ОК "Захід" порівняли радянські Т-64 та Т-72 з німецьким Leopard 1A5 #shorts

Танкісти ОК "Захід" порівняли радянські Т-64 та Т-72 з німецьким Leopard 1A5 #shorts

Дикий Бармалей разозлил всех!

Дикий Бармалей разозлил всех!

"Якщо ми приймаємо європейські закони, то хай мені дадуть і пенсію європейську" #shortsvideo #пенсія

"Якщо ми приймаємо європейські закони, то хай мені дадуть і пенсію європейську" #shortsvideo #пенсія

Зняла шкарпетки з чоловіка і зробила...

Зняла шкарпетки з чоловіка і зробила...

Что в джунглях лучше не тpогать?

Что в джунглях лучше не тpогать?

Когда отец одевает ребёнка @JaySharon

Когда отец одевает ребёнка @JaySharon

СМ*РТЬ Путіна та ЗАКІНЧЕННЯ ВІЙНИ 🔥 ШОКУЮЧЕ пророцтво ВАНГИ

СМ*РТЬ Путіна та ЗАКІНЧЕННЯ ВІЙНИ 🔥 ШОКУЮЧЕ пророцтво ВАНГИ