Characters, Symbols and the Unicode Miracle - Computerphile

Unicode Encoding! UTF-32, UCS-2, UTF-16, & UTF-8!

But, what is Virtual Memory?

How Strong Is Tape?

Психіатр Глузман УПЕРШЕ сканує Зеленського, Путіна й Трампа

Гениальное изобретение из обычного стаканчика!

What are UTF-8 and UTF-16? Working with Unicode encodings

Erik Wilde

Переглядів 24 312

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 24 гру 2024

КОМЕНТАРІ • 37

@lalpremi 11 місяців тому ⁺¹⁰
That is exactly what I want to know, showing some great tools. Thank you for sharing, and have a great day:-)
@ErikWilde 11 місяців тому ⁺²
Thank you!!
@vladislavkaras491 Місяць тому ⁺¹
That was great explanation!
Thanks!
@higiniofuentes2551 11 місяців тому ⁺²
Thank you for this very useful video!
@IndianaJoenz 4 місяці тому ⁺¹
Thank you for a great talk with useful visuals! I make a Unix program (durdraw) for drawing Unicode and other text art, and find myself working with different character encoding regularly. Perhaps I missed it, but Utf-8's backwards compatibility with ASCII is worth considering when choosing an encoding scheme. I also liked the useful "od" syntax. I rarely encounter Utf-16, but thanks to your video I will now be able to recognize it in a hex dump.
@qeetcode Рік тому ⁺²
Great explanation. Much appreciated.
@ErikWilde Рік тому
Thanks a lot, @qeetcode!
@brm901 2 роки тому ⁺⁷
great and informative video ; thanks
@ErikWilde 2 місяці тому
@@brm901, thanks for the kind words!
@nervocalm Рік тому ⁺¹
Excellent visual explanation! Couldn't be clearer! I didn't know that it would choose the correct length to each character. I thought it always has a fixed length. I really would like to know more, about this in general... Headers, BO, LE, etc. I also find it very interesting and very useful to work with ETL in data engineering. If you think of something else besides the links you already shared in the description please let me know. Thank you for making this video.
@VishwaMukh 2 місяці тому ⁺¹
Sir, Very well explained. Thanks.
@ErikWilde 2 місяці тому
@@VishwaMukh , thanks for the kind words!
@2bitsbyab Місяць тому ⁺¹
very good explanation thanks.
@flaviomelo7893 Рік тому ⁺¹
Hi Erik, congratulations on the video and thanks for sharing your knowledge. I am migrating an Oracle database on Solaris Sparc that is using UTF-16BE, while the destination uses UTF-8. In your opinion, what would be the best approach to converting the data source?
@ErikWilde Рік тому ⁺¹
Whatever migration tool you are using should really give you that option. If it does not give you that option I would look for a different tool.
@nournote Рік тому ⁺¹
Thanks. Very informative.
@Soupie62 6 місяців тому
If you have a CPU where every address is 16 bits wide, you may as well use UTF-16 as default. If memory is 8 bits wide, use UTF-8.
For 32 bit (or 64 bit) you can store multiple characters per RAM address, no matter what system you choose.
@ErikWilde 6 місяців тому
In the end, if you care about memory efficiency, UTF-8 may be the best choice if you mostly use ASCII characters. But there (sadly) is no generally best default choice.
@pazaresosset6348 4 місяці тому ⁺¹
thanks, very interesting video
@gersoncjunior 5 місяців тому
Thanks for sharing that!
@parsifal8232 Рік тому ⁺¹
6:29 please go into the details "byte order mark" in utf 16
@parsifal8232 Рік тому
or general into additional byte info for example in txt files, bom withaut bom, maby how to add additional info into a jpg file (without damaging it.) ..
@ErikWilde Рік тому ⁺²
A byte order mark depends on the format you are using. Specifically in Unicode the byte order mark talks about byte order in UTF-16. How to do it another day to four minutes is a very different question. For UTF-16, the byte order mark signals whether the Unicode file uses big endian or little endian format.
@akshardrashti 5 місяців тому
Please how do I find encoding of my file
@AshisRout-b4q Рік тому ⁺¹
you have a linkedin handle?
I find this very interesting
@ErikWilde Рік тому
www.linkedin.com/in/erikwilde
@LuisHernandez-dv4xu 2 роки тому ⁺¹
¡Muchas gracias!
@human4566vv Рік тому
Hi thanks man, thanks for the video
@gt10i 7 місяців тому
Danke!
@sabitkondakc9147 Рік тому
It seems that windows switched to utf8 either, speaking of win10 21H2 and later.
@ErikWilde Рік тому ⁺³
Nobody can escape globalization, sooner or later you have to support more than just ASCII or the fragmented ISO 8859 character sets. At that point, Unicode and very likely UTF-8 become your best friends.
@sabitkondakc9147 Рік тому
@@ErikWilde I'm having a hard time grasping the fact that native windows api only accepted utf-16 encoded strings up to day, such a rubbish decision!
This explains why windows takes up a huge RAM, not to mention that completely redundant cpu cost for the sake of utf transformation.
@MrJloa Рік тому
I wonder Microsoft's office still can't open files in utf8 😳
@صالحمحمد-ص2ك1ك Рік тому
Hi utf8.46
@RobertHernandez-t5q 2 місяці тому
Johnson Eric Thomas Jose Perez Elizabeth
@Tapajara Рік тому
UTF-16 should be abandoned because it is so problematical.
@ErikWilde Рік тому ⁺¹
Maybe it's problematic, but be prepared to have to deal with it for many years to come.

Наступне

Автоматичне відтворення

Characters, Symbols and the Unicode Miracle - Computerphile

Characters, Symbols and the Unicode Miracle - Computerphile

Unicode Encoding! UTF-32, UCS-2, UTF-16, & UTF-8!

Unicode Encoding! UTF-32, UCS-2, UTF-16, & UTF-8!

But, what is Virtual Memory?

But, what is Virtual Memory?

How Strong Is Tape?

How Strong Is Tape?

Психіатр Глузман УПЕРШЕ сканує Зеленського, Путіна й Трампа

Психіатр Глузман УПЕРШЕ сканує Зеленського, Путіна й Трампа

Гениальное изобретение из обычного стаканчика!

Гениальное изобретение из обычного стаканчика!

Анна Трінчер - Треш (Official Music Video)

Анна Трінчер - Треш (Official Music Video)

What is Unicode? How does it work and how do you use it?

What is Unicode? How does it work and how do you use it?

What is Data Mesh? Explained and easy to understand!

What is Data Mesh? Explained and easy to understand!

The 3 Laws of Writing Readable Code

The 3 Laws of Writing Readable Code

State of Webhooks: How Webhooks are used in APIs in 2023

State of Webhooks: How Webhooks are used in APIs in 2023

Ep 021: UTF-8 Encoding Examples

Ep 021: UTF-8 Encoding Examples

threading vs multiprocessing in python

threading vs multiprocessing in python

Password Storage Tier List: encryption, hashing, salting, bcrypt, and beyond

Password Storage Tier List: encryption, hashing, salting, bcrypt, and beyond

Unicode, in friendly terms: ASCII, UTF-8, code points, character encodings, and more

Unicode, in friendly terms: ASCII, UTF-8, code points, character encodings, and more

How computer processors run conditions and loops

How computer processors run conditions and loops

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

Рождение Немецкой Легенды - Mercedes 190E 2.3-16

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

Cute Baby Ties Up Dad And Wants To Play With His Phone #funny #fatherhoodlove#cute#fatherhoodmoments

TOY STORY IN BRAWL STARS!?

TOY STORY IN BRAWL STARS!?

Lp. Сердце Вселенной #60 РОЖДЕНИЕ ЛОЛОЛОШКИ [Финал] • Майнкрафт

Lp. Сердце Вселенной #60 РОЖДЕНИЕ ЛОЛОЛОШКИ [Финал] • Майнкрафт

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade

THE AMAZING DIGITAL CIRCUS - Ep 4: Fast Food Masquerade