Do we really need NPUs now?

AI’s Hardware Problem

Имеет ли смысл идти в Computer Vision и ML в 2024?

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

Computer Vision on NPU - all you need to know

Anton Maltsev

Переглядів 2 914

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 21 гру 2024
Наука та технологія

КОМЕНТАРІ • 15

@wolpumba4099 7 місяців тому ⁺⁵
*Summary: Running Computer Vision Models on NPUs*
What is an NPU? (0:37)
- NPUs are specialized silicon chips optimized for running neural network computations, especially matrix multiplications.
- Unlike CPUs and GPUs, they can't run general-purpose programs, focusing purely on neural network inference.
- Many different names exist for these chips, including LPU, TPU, VPU, etc., but they share the core idea of accelerating neural network calculations.
Why Use NPUs? (2:29)
- Main advantages: Reduced power consumption, lower device cost, potential for significant speedups compared to CPU/GPU for specific tasks.
- Main disadvantages: Increased development complexity, limited choice of neural network architectures, more intricate deployment and testing processes.
Challenges of working with NPUs:
- Diverse Ecosystem: (7:42) A vast landscape of vendors, frameworks, and boards makes finding a perfect solution difficult. Each vendor typically offers its own custom framework.
- Model Export and Compatibility: (10:09)
- Requires careful preparation, including specific patches and quantization, to adapt your model to the target NPU architecture.
- Non-maximum suppression (NMS) (18:59) often needs to be handled outside the NPU, requiring separate code or fallback mechanisms.
- Memory Limitations: (20:54)
- Limited memory size on NPUs restricts model size and complexity.
- Memory access speed and structure significantly impact performance.
- Preprocessing: (22:46) May need to be performed separately on the CPU, GPU, or dedicated accelerator depending on the NPU and its capabilities.
- Transformer Support: (23:58) Limited or non-existent on many NPUs, often requiring model adjustments or alternative convolutional architectures.
- Layer Support: (25:23)
- Advertised layer support can be misleading due to merged layers or limited functionalities.
- Always verify compatibility and performance for your specific model layers.
- Quantization: (27:33)
- Essential for many NPUs to reduce model size and accelerate inference.
- Can be complex and lead to accuracy degradation, requiring careful fine-tuning and evaluation.
- Benchmarks: (30:30)
- Often don't reflect real-world performance.
- Always test on your target hardware and specific model for accurate results.
Additional considerations:
- CPUs play a vital role in data transfer, image decoding, preprocessing, and fallback mechanisms, impacting overall performance (36:43).
- C++ is the dominant language for inference on most NPUs, while Python prevails in model training and export (38:45).
- Training on NPUs is possible but involves a separate class of processors and different considerations (39:51).
i used gemini 1.5 pro
@zorqis 3 місяці тому
Good summary and useful for passers by. However, the video contains some small remarks that contain a lot of useful information, so I still recommend watching the whole video.
@boltvalley3076 27 днів тому
Thank you.
@shakhizatnurgaliyev9355 7 місяців тому ⁺¹
good one!
@andreyl2705 7 місяців тому ⁺¹
awesome)
@diegosantos9757 7 місяців тому
Dear, tks for the content.
Which sbc would you recommend for somente just starting with computer vision?
@AntonMaltsev 7 місяців тому ⁺¹
Depends on your budget.
The smooth experience is with Jetsons or Intel-based boards.
In the case of a low budget, I recommend some RockChip-based solutions.
@diegosantos9757 7 місяців тому
Tks mate, I will check the rockchip!
@עינהרע 7 місяців тому
You gonna test the new Hailo GenAI m.2 board?
@AntonMaltsev 7 місяців тому ⁺²
It's difficult to buy one piece for home use, and none of my friends or colleagues are using it right now, so I have no chance to borrow.
So, it's not in the plans. But if there is a chance, I will try.
@AntonMaltsev 7 місяців тому ⁺¹
But the next video will probably be about my experience of using Hailo in production (more about framework and Hailo-8)
@ДенисСлепцов-ь6п 7 місяців тому ⁺¹
Здравствуйте, давно слежу за Вашим творчеством. Прошу Вас, продолжайте в том же духе! Очень интересно. Могли бы Вы сказать, доводилось ли Вам размещать нейронную сеть на FPGA ? Если да, то могли бы Вы, пожалуйста, поделиться своим опытом ?
@AntonMaltsev 7 місяців тому
Добрый день, спасибо!
Пару раз хотел потестить xilinx kria, но меня каждый раз отговаривали со словами что это полный хлам.
В целом FPGA дефолтовый не то что хорошо ложиться на архитектуру сетей. Так что не очень понятен смысл даже...
@ДенисСлепцов-ь6п 7 місяців тому
@@AntonMaltsev Понял, спасибо
@____________________________.x 7 місяців тому
Your jump cuts make this confusing

Наступне

Автоматичне відтворення

Do we really need NPUs now?

Do we really need NPUs now?

AI’s Hardware Problem

AI’s Hardware Problem

Имеет ли смысл идти в Computer Vision и ML в 2024?

Имеет ли смысл идти в Computer Vision и ML в 2024?

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

🔥"СВОшник" РОЗНОСИТЬ шоу путіністів! Ведучий ШОКОВАНИЙ від цих СЛІВ #shorts

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

The Witcher IV — Cinematic Reveal Trailer | The Game Awards 2024

The evil clown plays a prank on the angel

The evil clown plays a prank on the angel

When Optimisations Work, But for the Wrong Reasons

When Optimisations Work, But for the Wrong Reasons

Copilot+ PCs - Do you need an NPU? Microsoft Says "Yes", I Say "No"

Copilot+ PCs - Do you need an NPU? Microsoft Says "Yes", I Say "No"

Do you really need a GPU or NPU for AI?

Do you really need a GPU or NPU for AI?

Systems Design in an Hour

Systems Design in an Hour

Google’s Quantum Chip: Did We Just Tap Into Parallel Universes?

Google’s Quantum Chip: Did We Just Tap Into Parallel Universes?

RISC-V 2024 Update: RISE, AI Accelerators & More

RISC-V 2024 Update: RISE, AI Accelerators & More

ADDC 2019 - Dan Abdinoor: The NPU Revolution

ADDC 2019 - Dan Abdinoor: The NPU Revolution

The Return of Procedural Programming - Richard Feldman

The Return of Procedural Programming - Richard Feldman

Why Can't We Make Simple Software? - Peter van Hardenberg

Why Can't We Make Simple Software? - Peter van Hardenberg

Devin just came to take your software job… will code for $8/hr

Devin just came to take your software job… will code for $8/hr

СБОРКА ПК из ПОСЫЛОК с OZON🤯 ЛУТБОКС С ОЗОНА🔥

СБОРКА ПК из ПОСЫЛОК с OZON🤯 ЛУТБОКС С ОЗОНА🔥

КУПИЛ НЕ РАБОЧИЙ игровой ноут ЗА 9К и смог его оживить! Топ за 9к. Ремонт Acer Nitro 5 an515-58

КУПИЛ НЕ РАБОЧИЙ игровой ноут ЗА 9К и смог его оживить! Топ за 9к. Ремонт Acer Nitro 5 an515-58

НЕДЕЛЯ с Samsung Galaxy S24 FE - зачем КОРЕЙЦЫ так ОШИБАЮТСЯ? | ЧЕСТНЫЙ ОТЗЫВ

НЕДЕЛЯ с Samsung Galaxy S24 FE — зачем КОРЕЙЦЫ так ОШИБАЮТСЯ? | ЧЕСТНЫЙ ОТЗЫВ

12 днів OpenAI | Google, квантові компʼютери і мультивсесвіт | Антистрес по-індійськи

12 днів OpenAI | Google, квантові компʼютери і мультивсесвіт | Антистрес по-індійськи

Никогда не покупай процессоры INTEL! #пк #игры #гейминг #сборкапк #games #pc #gaming #intel

Никогда не покупай процессоры INTEL! #пк #игры #гейминг #сборкапк #games #pc #gaming #intel

Gizli Apple Watch Kamerası😱

Gizli Apple Watch Kamerası😱

Что происходит внутри клавиатуры😮

Что происходит внутри клавиатуры😮