Coding a Paper - Ep. 5: Adding KNN memory to transformers

Coding a Paper - Ep. 2: Processing data to keep GPUs busy

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

⚡️Орбан ЗУСТРІВСЯ із Зеленським в Брюсселі #shorts

Скабєєва ПЕРЕКОСИЛО! Пропагандист ВИХВАЛЯЄ ЗСУ #shоrts

как спать в самолете правильно ‼️ #марьяналокель #shorts

Coding a Paper - Ep. 4: Adding in Position Embeddings

ChrisMcCormickAI

Переглядів 777

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 26 жов 2024

КОМЕНТАРІ • 6

@imotvoksim 8 місяців тому ⁺¹
Just commenting to say that this series is appreciated and I took the weekend to follow along! Time well spent. Hopefully continue next weekend
@wilfredomartel7781 Місяць тому
Amazing work!
@imotvoksim 8 місяців тому ⁺¹
At the end when you are bringing out the heads dimension out of the resulting relative_position_values matrix shouldn't the operation be relative_position_values.transpose(1, -1).transpose(0, 1).unsqueeze(0) so we end up with (batch, heads, sequence, context) instead of (batch, heads, context, sequence)?
@ChrisMcCormickAI 8 місяців тому ⁺²
Good catch! Yes, the context and sequence are in the wrong order (and I've ignored the batch) - your solution puts things in the correct order. We switch to einops later as we put everything together so this will be corrected in later videos. Glad you're enjoying the series :)
@mm100latests5 8 місяців тому ⁺¹
awesome!
@Tripp111 8 місяців тому ⁺¹
Thank you.

Наступне

Автоматичне відтворення

Coding a Paper - Ep. 5: Adding KNN memory to transformers

Coding a Paper - Ep. 5: Adding KNN memory to transformers

Coding a Paper - Ep. 2: Processing data to keep GPUs busy

Coding a Paper - Ep. 2: Processing data to keep GPUs busy

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

How large language models work, a visual intro to transformers | Chapter 5, Deep Learning

⚡️Орбан ЗУСТРІВСЯ із Зеленським в Брюсселі #shorts

⚡️Орбан ЗУСТРІВСЯ із Зеленським в Брюсселі #shorts

Скабєєва ПЕРЕКОСИЛО! Пропагандист ВИХВАЛЯЄ ЗСУ #shоrts

Скабєєва ПЕРЕКОСИЛО! Пропагандист ВИХВАЛЯЄ ЗСУ #shоrts

как спать в самолете правильно ‼️ #марьяналокель #shorts

как спать в самолете правильно ‼️ #марьяналокель #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Coding a Paper - Ep. 7: Putting it all together

Coding a Paper - Ep. 7: Putting it all together

How I Automatically Organize My Obsidian Vault 📥 (Smart Inbox & Index)

How I Automatically Organize My Obsidian Vault 📥 (Smart Inbox & Index)

Self-Attention Equations - Math + Illustrations

Self-Attention Equations - Math + Illustrations

Coding a Paper - Ep. 6: Adding XL Recurrence to Transformers

Coding a Paper - Ep. 6: Adding XL Recurrence to Transformers

Stop Wasting Hours - Every Python Dev NEEDS to Master Poetry

Stop Wasting Hours - Every Python Dev NEEDS to Master Poetry

Coding a Paper - Ep. 3: Let’s build GPT in an hour

Coding a Paper - Ep. 3: Let’s build GPT in an hour

It's Not About Scale, It's About Abstraction

It's Not About Scale, It's About Abstraction

Let's build GPT with memory: learn to code a custom LLM (Coding a Paper - Ep. 1)

Let's build GPT with memory: learn to code a custom LLM (Coding a Paper - Ep. 1)

All Machine Learning algorithms explained in 17 min

All Machine Learning algorithms explained in 17 min

Странная суперспособность вомбатов и новый тренд у шимпанзе

Странная суперспособность вомбатов и новый тренд у шимпанзе

Тамар. Страшное пророчество Залужного, ядерная карта Зеленского, почему Гутерриш кланялся Путину

Тамар. Страшное пророчество Залужного, ядерная карта Зеленского, почему Гутерриш кланялся Путину

Я в детстве с маминым дорогим шампунем:🧼

Я в детстве с маминым дорогим шампунем:🧼

"Ми дуже дякуємо цим хлопцям". Українські військові врятували двох жінок з лівого берега Дніпра

"Ми дуже дякуємо цим хлопцям". Українські військові врятували двох жінок з лівого берега Дніпра

Олександр Мацієвський з роду городових козаків. Ростислав Мартинюк у Інструкція.Смисл

Олександр Мацієвський з роду городових козаків. Ростислав Мартинюк у Інструкція.Смисл

Вот для чего китайцы туалетную бумагу кладут в авто которое отправляют в Россию , у нас нет разметки

Вот для чего китайцы туалетную бумагу кладут в авто которое отправляют в Россию , у нас нет разметки

Б0РЗАЯ ЖЕНА И ЕЁ НЕАDЕКVАТНЫЙ ZАРАБАТЫВАТЕLЬ НА КР0VИ @VolodymyrZolkin

Б0РЗАЯ ЖЕНА И ЕЁ НЕАDЕКVАТНЫЙ ZАРАБАТЫВАТЕLЬ НА КР0VИ @VolodymyrZolkin