How to Implement an FIR Filter in C++ [DSP #15]

SIMD and vectorization using AVX intrinsic functions (Tutorial)

Miguel Raz Guzmán Macedo - Portable SIMD tricks for fun and profit

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

What Are SIMD Instructions? (With a Code Example) [DSP #14]

WolfSound

Переглядів 14 128

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 25 гру 2024

КОМЕНТАРІ • 21

@WolfSoundAudio 2 роки тому ⁺²
Have I helped you with this video? If yes, please, consider buying me a ☕ coffee at www.buymeacoffee.com/janwilczek
Thanks! 🙂
@niranjanm5942 Рік тому ⁺²
Thanks this was great intro on this topic. I wanted to get started on SIMD and this will put me in right way
@chen-kim9440 7 місяців тому ⁺¹
Thanks for your great introduction and lively demo! I really like your pace!
@auditiv0276 2 місяці тому
If you want to make sure to compile using SIMD instructions specific for the HostCPU you can use llvm bindings for the language of your choice and then compile through llvm. Interesting vid!
@moliver_xxii 2 роки тому ⁺²
hej, to jest trudny temat, nic nie można znaleść na Internet, cieli dziękuję ci Jan!
@WolfSoundAudio 2 роки тому ⁺²
Bardzo się cieszę, dzięki również!
@alldyallnite 2 роки тому ⁺¹
Thank you Jan!
@WolfSoundAudio 2 роки тому
Thanks for commenting! :)
@cliffmathew 2 роки тому
Great job explaining, and demonstrating. Thank you.
@theruisu21 Рік тому
great video!. looking forward the next one. for the next time, could include more on the arm and risc v case?
@NecdetSanli Рік тому
You made the concept easy to understand, thank you. Would like to see some C examples if it's possible too.
@ifnullreturn1 Рік тому ⁺³
Line 13 is killing me lol
@KeypleezerOfficial 2 роки тому
Nice video & nicely paced and clear. Just what I needed to get this topic a bit more. Just need some more examples of calculations actually taken care of by the SIMD extension sets, and perhaps some alternative SIMD/FFT libraries with info about what does what and how, that would be epic. Not many people teaching this in audio with such good phrasing! Keep up the great work! 👍
@KeypleezerOfficial 2 роки тому
I didn´t read the article about this topic you wrote before. It is great, much more info there giving more depth, thanks!
@davidminnix 2 місяці тому
many dsp algorithms contain single sample feedback. can anything be done to vectorize these algorithms? It seems like the feedback complicates any attempt to use block processing to vectorize.
@moisascholar Рік тому
Very helpful video. I was working on a particle system/simulation, and I use GL to draw the particles. Was wondering with SIMD and GL, how can I draw multiple particles at once? Or is this something more to do with GL buffers?
@przekladanki 2 роки тому ⁺²
Yes, you helped a lot ^_^
@WolfSoundAudio 2 роки тому
That's great ;)
@BalakrishnanIrudhayaraman Рік тому
I can understand the concept of simd. But, in the code I can see that you are adding each value when it is added to the register. I see that which is equivalent to scalar addition, I think inorder to avoid one more for loop to store the addition values into the result array which makes sense. This points me to ask whether the intrinsic function performs the addition, only when all the 256bits are filled with values or it can also perform otherwise?
@omnisepher Рік тому
Great job,
but didn't second for-loop killed the entire reason of using SIMD?
@corporalwill123 5 місяців тому
Late reply, you probably already have figured it out by now. Responding anyway for others with the same question.
That's like saying planes are pointless for traveling large distances, because you still need to walk the short distance to your destination from the airport.
SIMD will do a large portion of the work, in this case it will do it in multiples of 8, and the regular loop will finish the remaining amount
so for normal loop you are looking at:
N*scalar
while for SIMD you are getting:
floor(N/8)*SIMD + (N%8)*scalar
Since by design 1*SIMD will be faster than 8*scalar, for sizes greater or equal to 8, the second algorithm will be faster than just doing the first loop. Otherwise, for sizes smaller than 8, it will be the same as the first loop + some overhead because of the division by 8.

Наступне

Автоматичне відтворення

How to Implement an FIR Filter in C++ [DSP #15]

How to Implement an FIR Filter in C++ [DSP #15]

SIMD and vectorization using AVX intrinsic functions (Tutorial)

SIMD and vectorization using AVX intrinsic functions (Tutorial)

Miguel Raz Guzmán Macedo - Portable SIMD tricks for fun and profit

Miguel Raz Guzmán Macedo - Portable SIMD tricks for fun and profit

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

🤔Можно ли спастись от Ядерки в Холодильнике ? #shorts

The Witcher IV - Cinematic Reveal Trailer | The Game Awards 2024

The Witcher IV — Cinematic Reveal Trailer | The Game Awards 2024

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Ivan Cohen - Fifty shades of distortion (ADC'17)

Ivan Cohen - Fifty shades of distortion (ADC'17)

Top 5 Languages For Audio Programming

Top 5 Languages For Audio Programming

Rust: When C Code Isn't Enough

Rust: When C Code Isn't Enough

Harder Than It Seems? 5 Minute Timer in C++

Harder Than It Seems? 5 Minute Timer in C++

Extreme SIMD: Optimized Collision Detection in Titanfall

Extreme SIMD: Optimized Collision Detection in Titanfall

What is SIMD? Abusing Vector Instructions Across Threads for Ray Tracing

What is SIMD? Abusing Vector Instructions Across Threads for Ray Tracing

Refterm Lecture Part 5 - Parsing with SIMD

Refterm Lecture Part 5 - Parsing with SIMD

Writing Code That Runs FAST on a GPU

Writing Code That Runs FAST on a GPU

why are switch statements so HECKIN fast?

why are switch statements so HECKIN fast?

до конца, там самая счастливая табалапка🐾🐾 #тикток #табалапка

до конца, там самая счастливая табалапка🐾🐾 #тикток #табалапка

Уличный боец с ДУХОМ воина

Уличный боец с ДУХОМ воина

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

Заява ЗАЛУЖНОГО ШОКУВАЛА увесь СВІТ😱ТРЕТЯ СВІТОВА ВІЙНА ПОЧАЛАСЬ?

«Я жити не хочу»: винесли «з нуля» пораненого побратима #shorts

«Я жити не хочу»: винесли «з нуля» пораненого побратима #shorts

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Разобрался голыми руками 😎 #start #кино #фильм #сериал #молотведьм #полиция #пацаны

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

Морпіх із Каліфорнії доєднався до лав ЗСУ #shorts

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

ПРАНК НАД БОЯРСКИМ | КОНФЛИКТ НА ДОРОГЕ

СКАНДАЛЬНЫЙ бой Али, когда в ринге ему противостояли сразу ДВОЕ #shorts

СКАНДАЛЬНЫЙ бой Али, когда в ринге ему противостояли сразу ДВОЕ #shorts