Rethinking Pre-training and Self-Training

Don't Stop Pretraining!

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Харламов не сдержал смех 😂 #ComedyClub #КамедиКлаб #галыгин #Харламов #тнт4#тайнасфинкса #конецсвета

ДАНТЕС ТА МАНДЗЮК| КУЛЬТУРА "ВІДМІНИ"|ЯК КАРАТИ БЕЗВІДПОВІДАЛЬНИХ БЛОГЕРІВ/ЗІРОК?!⁠⁠⁠@Raminaeshakzai

Stupid Barry Find Mellstroy in Escape From Prison Challenge

ImageGPT (Generative Pre-training from Pixels)

Connor Shorten

Переглядів 7 852

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 17 чер 2024
This video will explore the exciting new 6.8 Billion parameter ImageGPT model! The researchers show that better and larger generative models learn better representations for tasks like ImageNet classification!
Thanks for watching! Please Subscribe!
Paper Links:
ImageGPT (Blog Post): openai.com/blog/image-gpt/
ImageGPT (Paper): cdn.openai.com/papers/Generat...
A Survey of Long-term Context in Transformers: www.pragmatic.ml/a-survey-of-...
Google TPUs: cloud.google.com/tpu/docs/tpus
The Illustrated Transformer: jalammar.github.io/illustrated...
PixelCNN: keras.io/examples/generative/...
PixelCNN (Paper): arxiv.org/pdf/1606.05328.pdf
Contrastive Predictive Coding: arxiv.org/pdf/1905.09272.pdf
Big BiGAN: arxiv.org/pdf/1907.02544.pdf
BERT: arxiv.org/pdf/1810.04805.pdf
Rethinking Pre-training and Self-Training: arxiv.org/pdf/2006.06882.pdf
Наука та технологія

КОМЕНТАРІ • 12

@connorshorten6311 4 роки тому ⁺⁴
2:18 Auto-Regressive modeling of Pixels
4:18 Denoising Autoencoders: AR and BERT
5:40 GPT Architecture, No CNN Prior!
7:00 6.8 BILLION parameters!! Comparison with SimCLR, CPC, BigBiGAN
8:24 Generative Models and Representation Learning for Vision
10:30 Fine-Tuning with Linear Probes
11:50 Working around Quadratic Complexity of Self-Attention
12:50 Context Reduction
13:52 Results and Ablations
18:50 Promise of Longer Context Transformers and Visual Representation Learning
@herp_derpingson 4 роки тому ⁺⁷
Yannic Kilcher sent me here. Good channel. Subbed!
@jeremykothe2847 4 роки тому
ditto
@Schematical 4 роки тому
Awesome stuff. Have to watch it a couple times to wrap my head around it.
@citiblocsMaster 4 роки тому ⁺²
That imageGPT result is crazy. It seems that you can replace inductive biases (translation invariance via convolutions) with just more data and compute.
@jeremykothe2847 4 роки тому ⁺¹
The resolution is so low though - not sure it would scale as well even if memory was available for a larger size.
@airazure2050 4 роки тому ⁺¹
Awesome content! Thanks!
@erikacardenas2964 4 роки тому ⁺²
Awesome video!
@Ritrixone 4 роки тому
Good job!
@AbdennacerAyeb 4 роки тому ⁺¹
Great job. We need colab tutorials.
@geekionizado 4 роки тому
😩 too awesome i can't even process
@quadhd1121 4 роки тому ⁺²
Can u use plain English please ,it still sounds complex for bigginners

Наступне

Автоматичне відтворення

Rethinking Pre-training and Self-Training

Rethinking Pre-training and Self-Training

Don't Stop Pretraining!

Don't Stop Pretraining!

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Харламов не сдержал смех 😂 #ComedyClub #КамедиКлаб #галыгин #Харламов #тнт4#тайнасфинкса #конецсвета

Харламов не сдержал смех 😂 #ComedyClub #КамедиКлаб #галыгин #Харламов #тнт4#тайнасфинкса #конецсвета

ДАНТЕС ТА МАНДЗЮК| КУЛЬТУРА "ВІДМІНИ"|ЯК КАРАТИ БЕЗВІДПОВІДАЛЬНИХ БЛОГЕРІВ/ЗІРОК?!⁠⁠⁠@Raminaeshakzai

ДАНТЕС ТА МАНДЗЮК| КУЛЬТУРА "ВІДМІНИ"|ЯК КАРАТИ БЕЗВІДПОВІДАЛЬНИХ БЛОГЕРІВ/ЗІРОК?!⁠⁠⁠@Raminaeshakzai

Stupid Barry Find Mellstroy in Escape From Prison Challenge

Stupid Barry Find Mellstroy in Escape From Prison Challenge

СКОЛЬКО ПОЛУЧИТСЯ БУРГЕРОВ ИЗ 1 БЫКА ?

СКОЛЬКО ПОЛУЧИТСЯ БУРГЕРОВ ИЗ 1 БЫКА ?

Image GPT: Generative Pretraining from Pixels (Paper Explained)

Image GPT: Generative Pretraining from Pixels (Paper Explained)

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

✨ АСМР Валькирия Исцелит Тебя 💎 Ролевая Игра с Магией и Тихим Голосом

✨ АСМР Валькирия Исцелит Тебя 💎 Ролевая Игра с Магией и Тихим Голосом

Data Augmentation using Pre-trained Transformer Models

Data Augmentation using Pre-trained Transformer Models

Contrastive Learning for Unpaired Image-to-Image Translation

Contrastive Learning for Unpaired Image-to-Image Translation

The U-Net (actually) explained in 10 minutes

The U-Net (actually) explained in 10 minutes

The Mastermind Behind GPT-4 and the Future of AI | Ilya Sutskever

The Mastermind Behind GPT-4 and the Future of AI | Ilya Sutskever

What are Diffusion Models?

What are Diffusion Models?

DETR: End-to-End Object Detection with Transformers (Paper Explained)

DETR: End-to-End Object Detection with Transformers (Paper Explained)

iPhone 16 Pro і Fold 6 на фото, Х стає ХХХ та твій iPhone застарів

iPhone 16 Pro і Fold 6 на фото, Х стає ХХХ та твій iPhone застарів

Holographic transparent flexible LED panel.

Holographic transparent flexible LED panel.

Для фанатов SEGA MEGADRIVE - Anbernic RG ARC

Для фанатов SEGA MEGADRIVE - Anbernic RG ARC

ТОП-5 культовых телефонов‼️

ТОП-5 культовых телефонов‼️

Will the battery emit smoke if it rotates rapidly?

Will the battery emit smoke if it rotates rapidly?

Samsung S24 Ultra professional shooting kit #shorts

Samsung S24 Ultra professional shooting kit #shorts

Technics 1500 и все его трещинки. Часть 2

Technics 1500 и все его трещинки. Часть 2

wireless switch without wires part 6

wireless switch without wires part 6