How To Train an LLM With Diffusion From Scratch

How AI 'Understands' Images (CLIP) - Computerphile

Why Does Diffusion Work Better than Auto-Regression?

Валерий Ширяев о событиях в Курской области и их последствиях / Редакция. Интервью

🔥 Уся правда про українську СУДЖУ

ПОМОГЛА НАЗЫВАЕТСЯ😂

How Diffusion Works for Text

Oxen

Переглядів 1 298

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 19 сер 2024
Наука та технологія

КОМЕНТАРІ • 7

@BooleanDisorder 4 місяці тому ⁺²
Diffusion text noise could improve reasoning. A kind of overview of the problem, instead of trying to guess just the next token. If you make an oopsie at the start it can quickly compound later-on with autoregression. Being able to go back and forth must be a huge boost. I could see a model in the future where the question to an answer is put like "therefore answer to [question asked] must be" at the end of the noise to force it to answer.
It's also a step into the direction of explainability.
@DanielPramel 4 місяці тому ⁺²
Could this potentially improve function calling and adherence to certain output formats, e.g., JSON?
@oxen-ai 4 місяці тому
That's a great call out, the benefits of infilling could definitely help with certain output formats. IE put the curly braces at the start and end of the sequence.
@rogerc7960 4 місяці тому ⁺¹
Tesla diffusion model taught itself to read street signs.
@oxen-ai 4 місяці тому
Oh interesting, do you have a link?
@jensg8547 3 місяці тому
couldnt the diffusion pertubations happen on the embedding vector level - as suggested in one of the questions - and a nearest neighbor search be used to predict a vector that resembles an actual token?
@oxen-ai 3 місяці тому
Yes, I love this idea. I think someone should try it and see how well it works. We dived a little into the code in our next video as a jumping off point!

Наступне

Автоматичне відтворення

How To Train an LLM With Diffusion From Scratch

How To Train an LLM With Diffusion From Scratch

How AI 'Understands' Images (CLIP) - Computerphile

How AI 'Understands' Images (CLIP) - Computerphile

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Валерий Ширяев о событиях в Курской области и их последствиях / Редакция. Интервью

Валерий Ширяев о событиях в Курской области и их последствиях / Редакция. Интервью

🔥 Уся правда про українську СУДЖУ

🔥 Уся правда про українську СУДЖУ

ПОМОГЛА НАЗЫВАЕТСЯ😂

ПОМОГЛА НАЗЫВАЕТСЯ😂

КУРСК в ОГНЕ, кадыровцы грызутся с ГЕНШТАБОМ РФ, а СКАБЕЕВУ послали

КУРСК в ОГНЕ, кадыровцы грызутся с ГЕНШТАБОМ РФ, а СКАБЕЕВУ послали

What are AI Agents?

What are AI Agents?

An update on DPO vs PPO for LLM alignment

An update on DPO vs PPO for LLM alignment

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Beyond the Hype: A Realistic Look at Large Language Models • Jodie Burchell • GOTO 2024

Denoising Diffusion Probabilistic Models | DDPM Explained

Denoising Diffusion Probabilistic Models | DDPM Explained

Sakana AI's Latest Release: Evolutionary Optimization of Model Merging Recipes

Sakana AI's Latest Release: Evolutionary Optimization of Model Merging Recipes

Stable Diffusion in Code (AI Image Generation) - Computerphile

Stable Diffusion in Code (AI Image Generation) - Computerphile

Diffusion Models | Paper Explanation | Math Explained

Diffusion Models | Paper Explanation | Math Explained

How I Understand Diffusion Models

How I Understand Diffusion Models