Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - 694

The "Modern Day Slaves" Of The AI Tech World

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Молодой боец приземлил легенду!

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

skibidi toilet 77 (full episode)

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - 693

The TWIML AI Podcast with Sam Charrington

Переглядів 5 429

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 22 лис 2024

КОМЕНТАРІ • 15

@first-thoughtgiver-of-will2456 2 місяці тому ⁺²
Thanks Albert and Sam! Surprisingly insightful for someone researching the Mamba architecture right now!
@l.halawani Місяць тому
Super happy to see you on YT! Been missing you since Alphabet scraped Google Podcasts! Awesome content.
@JonathanWong-u8g Місяць тому
As of 2024, the latent diffusion paradigm has been very successful in these 'natural' modality tasks (sound, images, video) and the paradigm is now being applied to 3D spatial awareness. We've actually been in the post-transformer era for a while (1-2 years)! I am wondering where Gu's work fits in here-- perhaps these Mamba models will produce better latents for extremely long-context video and spatial point cloud data? Will stay tuned. Thanks for the talk!
@mephilees7866 Місяць тому
the problem with latent diffusion (something like DiT) is that it's too slow. especially with high bandwidth data like images. Mamba will help in the encoder part. but i don't see how to benefit from it in the decoder part. i would suggest you check VAR(Visual Autoregression). it works by regressing the next resolution instead of out of noise. around 20x faster with better performance.
@JonathanWong-u8g Місяць тому
@@mephilees7866 Excellent, thank you!
@lobovutare 3 місяці тому ⁺²
Interesting to hear that the author of Mamba feels that attention is indispensable. My initial thought was that Mamba is a full replacement for Transformers, but it seems that Gu believes attention layers are still necessary for the model to be able to reason at the level of tokens. Perhaps hybrid models like Jamba are the way to go.
@Noah-jz3gt 3 місяці тому ⁺³
Well seems like Gu tries to find theoretical relations between attention and SSM in Mamba-2. For me, Mamba even doesn't look like SSM anymore to be honest.
@wwkk4964 4 місяці тому ⁺¹
Brilliant, the tokenizer ought to be a learned parameter that coevolves in response to task.
@minshenlin127 18 днів тому
Hi, may I know how to add your channel to Apple Podcast?
@twimlai 18 днів тому
Hi. You can follow our channel here: podcasts.apple.com/us/podcast/the-twiml-ai-podcast-formerly-this-week-in-machine/id1116303051
@minshenlin127 16 днів тому
@@twimlai Thank you for your reply. But I cannot visit the site; the URL seems invalid
@twimlai 16 днів тому
Strange. Works on my end. Try twimlai.com/podcast and look for the button on that page.
@minshenlin127 5 днів тому
@@twimlai Thank you very much. But it's still not working. So I use Spotify now😃
@chickenp7038 4 місяці тому
great interview
@ps3301 3 місяці тому
How about vision?

Наступне

Автоматичне відтворення

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - 694

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - 694

The "Modern Day Slaves" Of The AI Tech World

The "Modern Day Slaves" Of The AI Tech World

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Молодой боец приземлил легенду!

Молодой боец приземлил легенду!

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

Симбу закрыли дома?! 🔒 #симба #симбочка #арти

skibidi toilet 77 (full episode)

skibidi toilet 77 (full episode)

Президент відвідав українських воїнів, які проходять лікування в госпіталі

Президент відвідав українських воїнів, які проходять лікування в госпіталі

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

Top Minds in AI Explain What’s Coming After GPT-4o | EP #130

Language Understanding and LLMs with Christopher Manning - 686

Language Understanding and LLMs with Christopher Manning - 686

MedAI #41: Efficiently Modeling Long Sequences with Structured State Spaces | Albert Gu

MedAI #41: Efficiently Modeling Long Sequences with Structured State Spaces | Albert Gu

No Priors Ep. 70 | With Cartesia Co-Founders Karan Goel & Albert Gu

No Priors Ep. 70 | With Cartesia Co-Founders Karan Goel & Albert Gu

Generative Model That Won 2024 Nobel Prize

Generative Model That Won 2024 Nobel Prize

Attention in transformers, visually explained | DL6

Attention in transformers, visually explained | DL6

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - 701

Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - 701

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

MAMBA from Scratch: Neural Nets Better and Faster than Transformers

Эффект Карбонаро и госуслуги

Эффект Карбонаро и госуслуги

💔«Закінчується відчуття радості та щастя» #конкурентtv #новини #новинисьогодні

💔«Закінчується відчуття радості та щастя» #конкурентtv #новини #новинисьогодні

Це ТРЕБА БАЧИТИ! "НАТАША ПОПАЛА, ПОПАЛА!". Колишня вихователька дитсадка ЗБИЛА РАКЕТУ

Це ТРЕБА БАЧИТИ! "НАТАША ПОПАЛА, ПОПАЛА!". Колишня вихователька дитсадка ЗБИЛА РАКЕТУ

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

СТАЛКЕР 2 ВЫШЕЛ ➤ STALKER 2: Heart of Chornobyl ◉ Прохождение 1

СТАЛКЕР 2 ВЫШЕЛ ➤ STALKER 2: Heart of Chornobyl ◉ Прохождение 1

Мама у нас строгая

Мама у нас строгая

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

⚡️ МАЙК ТАЙСОН ОФІЦІЙНО ПОВЕРНУВСЯ! Огляд бою Джейк Пол - Майк Тайсон

А я думаю что за звук такой знакомый? 😂😂😂

А я думаю что за звук такой знакомый? 😂😂😂