C++ Weekly - Ep 125 - The Optimal Way To Return From A Function

C++ Weekly - Ep 456 - RVO + Trivial Types = Faster Code

C++ Weekly - Ep 460 - Why is GCC Better Than Clang?

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

😳Трамп ПОТІШИВ Скабєєву, але одразу РОЗЧАРУВАВ #shorts

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

C++ Weekly - Ep 124 - ABM and BMI Instruction Sets

C++ Weekly With Jason Turner

Переглядів 3 838

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 27 січ 2025

КОМЕНТАРІ • 13

@HaraldAchitz 6 років тому ⁺¹
why should march=native add special instructions for a special CPU (family) , don't you need to specify the cpu architecture explicit to test if a instruction set is generated or not?
like march=amd10fam or newer, barcelona, or intel, haswell or broadwell or newer?
@RomanOrekhov 4 роки тому ⁺¹
Ofc you were not able to see blc* instructions emitted since they are only implemented for certain AMD processors which support TBM set of instructions.
parallel_bit_deposit from example should actually be named parallel_bit_extract. Why didn't it translate into PEXT? Probably because it's slower than what it translated into due to too simple selector (which only has one island of 1s). But wiki example of return ((x & 0xff000000) >> 12) | ((x & 0xfff0) >> 4); also doesn't compile into PEXT.
for those willing to play with code here's the link godbolt.org/z/JUhCaF
@willofirony 6 років тому
Excellent video, Jason. I have often wondered about the compilers' use of advanced instruction sets. I have used POPCNT (via intrinsic calls), it is very valuable when used with bitmaps (similarly to the FATs used in the disk OS) that enable the reuse of elements in containers such as vectors and as such cut down, on reallocation costs. The purists will, probably object on the basis of portability of code and there is some worth to such objections; because there is now less incentive to up grade to the latest technology (one can see that in the plethora of pentium and I3 machines currently being retailed). Nevertheless, would be valuable to see how and if these instruction sets are used by the various compilers. Thank you
@erichopper4979 6 років тому ⁺³
So, that's missing MMX, which is pretty old. I don't know if that somehow doesn't count for some reason or another.
And yes, there is probably a specific -march flag (or perhaps other -m flags) you could use to enable the instructions. -march=native is... going to get you random results on godbolt.org. You should specify a specific architeture you're interested in. I have a Ryzen 7, and in the output of /proc/cpuinfo I can see that abm, bmi1, and bmi2 are all supported. So you could've specified -march=znver1 and it should enable those instructions.
@Henrik0x7F 6 років тому ⁺¹
When march=native is set the compiler will try to use all features of the current instruction set, right? Is there a way to specifically enable certain extensions? For example my cpu has SSE2 but I only want to include SSE1 instructions. I always had the feeling that byte code languages have a small advantage there.
@xXH3ll5xB3llXx 6 років тому ⁺²
You can specify the exact cpu architecture you're targeting:
gcc.gnu.org/onlinedocs/gcc/x86-Options.html#x86-Options
e.g. to just enable SSE use '-msse'
@ahmeterdem9312 6 років тому
As long as you compile for x86-64(aka amd64), minimum of SSE2 is assumed since the ISA requires SSE2. Then if you want to reduce your userbase and include more advanced extensions, you can add -mavx, -mavx2, etc. to GCC as far as I know. Of course, if you do -march=native, your output binary will be strictly using whatever is available on your machine(that you are compiling). The reason it is not recommended on GodBolt is that march=native will generate binary assuming the server GodBolt is running. There are more options in GCC like specifying minimum x86 generation and supporting all the newer architectures and so on. Or using march=generic(default) and tune your implementation with 'mtune' for a specific uarch.
@Henrik0x7F 6 років тому
Ahmet Erdem Thanks for the detailed answer. I'll have a look :)
@slayer5171 6 років тому
Is C++ weekly videos about C++17? Or can i use C++11?
@erichopper4979 6 років тому ⁺¹
So, the guy who makes these videos generally uses the very latest C++ available. Sometimes he even uses C++ features that are not in any standard (yet). But, more often, the examples and ideas are broadly applicable to many different versions of C++.
But finding a C++17 compiler isn't hard. The latest versions of both clang and gcc fully support C++17, and these compilers are Open Source, so you can download them and compile them easily if your platform doesn't already have them.
@slayer5171 6 років тому
Eric Hopper thanks for the information friend, i will install visual studio 2017, and follow this video.
@slayer5171 6 років тому
Jason Turner thanks, i check it.
@emiliadaria 6 років тому
nice find ~ 💖

Наступне

Автоматичне відтворення

C++ Weekly - Ep 125 - The Optimal Way To Return From A Function

C++ Weekly - Ep 125 - The Optimal Way To Return From A Function

C++ Weekly - Ep 456 - RVO + Trivial Types = Faster Code

C++ Weekly - Ep 456 - RVO + Trivial Types = Faster Code

C++ Weekly - Ep 460 - Why is GCC Better Than Clang?

C++ Weekly - Ep 460 - Why is GCC Better Than Clang?

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

МІША ЛЕБІГА і АНДРІЙ ЛУЗАН в СРАЧІ #32

😳Трамп ПОТІШИВ Скабєєву, але одразу РОЗЧАРУВАВ #shorts

😳Трамп ПОТІШИВ Скабєєву, але одразу РОЗЧАРУВАВ #shorts

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

УКРАЇНСЬКИЙ ДЕТЕКТИВ | Стоматолог. ТОП СЕРІАЛ. 1,2 серія

УКРАЇНСЬКИЙ ДЕТЕКТИВ | Стоматолог. ТОП СЕРІАЛ. 1,2 серія

Master Pointers in C: 10X Your C Coding!

Master Pointers in C: 10X Your C Coding!

C++ Weekly - Ep 457 - I Read C++ Magazines (So you don't have to!)

C++ Weekly - Ep 457 - I Read C++ Magazines (So you don't have to!)

The Latest Celebrity Tech Scam…

The Latest Celebrity Tech Scam…

Stack vs Heap Memory in C++

Stack vs Heap Memory in C++

C++26's std::span Over initializer_list - C++ Weekly Ep 465

C++26's std::span Over initializer_list - C++ Weekly Ep 465

C++ Weekly - Ep 454 - std::apply vs std::invoke (and how they work!)

C++ Weekly - Ep 454 - std::apply vs std::invoke (and how they work!)

Зачем учить язык Си в 2024 году | Как выбрать между C или C++ или Rust | Podlodka Podcast #387

Зачем учить язык Си в 2024 году | Как выбрать между C или C++ или Rust | Podlodka Podcast #387

Easily Printing std::variant - C++ Weekly - Ep 464

Easily Printing std::variant - C++ Weekly - Ep 464

Running “Hello World!” in 10 VISUAL Programming Languages!

Running “Hello World!” in 10 VISUAL Programming Languages!

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

СОЛДАТ КНДР: ВТЕЧА/ВІЙНА В УКРАЇНІ/10 РОКІВ ШПИГУВАВ У ПІВНІЧНІЙ КОРЕЇ/ТОРГУЮТЬ НАРКОТИКАМИ І ЗБРОЄЮ

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Пилот обманул смерть ракета пролетела рядом с ним #shorts

Пилот обманул смерть ракета пролетела рядом с ним #shorts

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

ДИЗЕЛЬ ШОУ 2024 🇺🇦 ❄️ ЗИМОВА ПРЕМ'ЄРА ❄️ 🇺🇦 ВИПУСК 154 на підтримку ЗСУ ⭐ Гумор ICTV від 13.12.2024

ДИЗЕЛЬ ШОУ 2024 🇺🇦 ❄️ ЗИМОВА ПРЕМ'ЄРА ❄️ 🇺🇦 ВИПУСК 154 на підтримку ЗСУ ⭐ Гумор ICTV від 13.12.2024

У ДЕТЕНЫША СТЕПЫ ИСЧЕЗ ГЛАЗИК

У ДЕТЕНЫША СТЕПЫ ИСЧЕЗ ГЛАЗИК

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow

#JasonDeruloTV // Funny #GotPermissionToPost From @SofiManassyan #SlowLow