Why is Radix Sort so Fast? Part 2 Radix Sort

Creel

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 24 гру 2024

КОМЕНТАРІ •

@BillTyler 3 роки тому ⁺³⁵³
Radix sort is exactly how punched card sorters worked in ancient times. The machine had a bin for each punch position. (There were 12 punch positions in each column.) The card deck was run through the sorter once for each digit position, and the bucket contents picked up in order, the deck restacked, and run through the sorter for the next punch position until the entire sort key had been covered. Sorting a couple of thousand cards could be done quite quickly.
@new_reZn 3 роки тому ⁺⁶
astonishing!
@vNCAwizard 3 роки тому ⁺⁸
Such equipment went by the name Unit Record Machines. This category of hardware included sorters, simple calculation tools (adders, multipliers, subtracts and dividers, for instance) and printers.
@deedewald1707 3 роки тому ⁺²
Bucket sort at work being efficient !
@suntexi 3 роки тому ⁺⁵
I worked in a clearing bank in London back in the '60s and we used the same technique to sort cheques by account number, by magnetising the digits on the cheques and reading them. Eight passes for the account number which also sorted in branch order. The cheques had already had anything other than those that belonged to the bank removed and sent to the other bank. London cheque clearing in action.
@joebarrett4353 3 роки тому ⁺¹
Those were the days!!!
@timberwolf4242 4 роки тому ⁺¹⁶
This is literally the only channel on youtube about programming that i watch solely because the videos are made so engaging and fun, and not because i have to prepare for test.
Best channel
@ddc9999 4 роки тому ⁺⁷⁸
I’m just a random 20yo student and I discovered this channel out of YT’s algorithm
Loved it after only one vid, subbed and have been binge watching this on this rainy sunday afternoon, sipping coffee.
My life is pointless but at least I’m learning cool stuff
@andrewwalker2888 3 роки тому ⁺¹¹
Hey mate, your life isn't pointless! I found this channel last night, and it's been great. I "graduated early" with a 2-year degree in electrical engineering a long while ago, never used it outside of school unfortunately.. and today I'm employed as a full stack dev with 7+ years of professional experience in that position. Life can be crazy, and I don't know much about it (yet!), but I know for sure, your life isn't pointless :) This is an old post, but hit me up if you wanna chat, or have any questions about anything. Much love.
@maxm9960 3 роки тому ⁺²
if you "really" learn this, by which I mean you understand to the degree that you can explain it to others well by yourself, you are already on track to a great career as a good software engineer. In time, you'll build yourself a good career at a software company like Microsoft or Google. Regardless of whether you like the companies, you'll at least set yourself up for a good salary which'll enable you to do what you like. Keep it up!! - source : I'm a senior software engineer at Google myself.
@markusballsdeep6051 3 роки тому
Cringe
@theMosen 3 роки тому ⁺¹
Ok, this just randomly turns up in my YT feed for no apparent reason, even though it's old and I haven't watched a coding video in literally years. But thanks, algorithm, for knowing me better than I know myself, because I loved it!
@awardofsky9908 4 роки тому ⁺¹⁸⁰
I've seen many youtube videos and papers on this matter and I must say that I never have seen sorting algorithms explained so simply and with such passion!
I really liked how you explained the logic behind comparison based sorts on part 1, it's really useful to understand sorting networks and their limitations.
You are an excellent teacher and hands down one of the best youtube channels on this matter!
Btw, just a note: to implement radix sort for signed integers you just need to do the prefix sum in the last digit that includes the signed bit in a different order, going from the middle of the array to end and then from the start to the middle again so that the digits starting with
1 (the negatives) appear first.
For floats the idea is similar, but instead, you also have to do the first partition of the prefix sum in reverse order (going from the end of the array to the middle, and then from the start to the middle again) and then you adjust the scattering part so it decrements the pointers corresponding to numbers from the first partition. There are also bit flipping solutions, so you don't need an extra radix pass.
@Wyld1one Рік тому
Just have counts from -9 to 9. Eg 19 counts
@Redditard Рік тому
@@Wyld1one it won't work, we start from the once's place, if you want to do -9 to 9 i'll add some overhead such as add +9 to your counts etc etc, and would also have to check the -ve every digit cycle
@amdreallyfast 3 роки тому ⁺⁹²
A few years ago I was working on a project to create a bounding volume hierarchy for collision detection, and as part of organizing the objects (along a space-filling curve for efficiency's sake) I followed some technical papers to create a GPU-based, parallel binary radix sort. It was fast, and the particles that were being collided never had to leave GPU memory. But it was a right beast.
@theRPGmaster 3 роки тому ⁺¹²
That's why I'm here, octrees!
@musicdev 2 роки тому ⁺⁸
It takes a highly specific kind of nerd to call that awesome, but god damn, that is awesome!
@bobwatkins1271 4 роки тому ⁺⁶⁵
Decrementing the prefix sum serves two purposes: 1) converting to a zero-based index, and 2) avoiding collisions in the output array. The latter is accomplished by updating the value stored in the prefix sum array. This is a crucial point that should be stated explicitly.
@WhatsACreel 4 роки тому ⁺¹⁵
You are right! Thank you for watching :)
@incription Рік тому ⁺⁶
I was confused at first, but the visualisation made it really easy to realise you can just increment the counter!
@mikicerise6250 4 роки тому ⁺¹⁵
MIND. BLOWN. Never have I been so sad that my sorting list is short. xD
@bluesillybeard 4 роки тому ⁺⁴⁷
This year has been really nerdy for me. I learned Python, Java, and i'm watching this awesome series!
@TimeConvolution 3 роки тому ⁺⁴
That's awesome good for you
@АндрейОнищенко-з8х 3 роки тому ⁺³
I code in c++ and whole life is nerdy)
@davidlynch4202 3 роки тому ⁺²
Awesome, you should try c, c++ and your cpus assembly language to see what goes on under the hood
@bluesillybeard 3 роки тому
@@davidlynch4202 I am learning C and C++, but x86 assembly is way out of my scope right now.
@1andrew123 3 роки тому
@@bluesillybeard thats what i thought too before I started trying to learn it :P you could figure it out
@romanmueller3479 4 роки тому ⁺²
Was blown away by count sort. Simple yet genious idea. Thanks for this video.
@TheMR-777 4 роки тому ⁺⁵⁰
Seriously Mate, your this series ... I am a CS Student of 3rd Semester. We haven't started Data Structures and Algorithms yet, but will soon be (from 15th of Sept). And I am feeling so lucky, that you are making such a Great Series of videos based on Algorithms! This Counting and Radix Sort got me curious to understand all types of Sorting and other Algorithms. All Thanks to you mate :)
@tucan1309 4 роки тому ⁺⁴¹¹
linked lists take a lot of memory
me a python programmer:
i have never heard of that guy
@jacknguyen5220 4 роки тому ⁺⁵⁹
me, a Java programmer {
you missed the braces and semicolons;
}
@cedric1731 4 роки тому ⁺⁷⁰
@@jacknguyen5220 I doubt you are a Java Programmer... There is neither a class declaration nor any method indicator ;)
@qreeves 4 роки тому ⁺⁸⁹
Meanwhile, in Perl: !#$%&(%$%^#@#%&%^@#*
@binarycat1237 4 роки тому ⁺¹⁶
Haskell: wait, there's an alternitive?
@bluesillybeard 4 роки тому ⁺³⁸
me, an assembly programmer: what's a list?
@wlockuz4467 3 роки тому
This is like the big interview you get after winning a championship, perfect!
@sbqp3 2 роки тому ⁺¹
Awesome stuff.
Around 15:30, when talking about how to apply the prefix sum, I was thinking of the interpretation that the prefix sum is the running total of numbers with lower digits. So in the first iteration for 721, which end with 1, you are looking up how many items end with 1 or lower. And for 779 you are looking up how many numbers end with 9 or lower, etc. For each number, that tells you how many elements that need to be placed before that number, which also tells you the index of where to place it.
@Lyam695 4 роки тому ⁺¹
This has become one of my favourite comp sci channel. And this is not a small achievement, given the number of great resources on youtube.
@Aristotle000001 4 роки тому ⁺⁴
I love the way you present yourself and talk about these subjects. I've learned great things on this channel that I didn't even ask to learn, but just watched for entertainment xD
@scorch855 4 роки тому ⁺¹
Your enthusiasm for sorting algorithms and computing in general is really engaging.
@urouroniwa 3 роки тому ⁺¹
Right after I graduated from university, I did some travelling and ended up doing odd jobs at a travel tour company. It happened to be the end of the fiscal year and given that this was more decades ago than I'd like to admit, they had a mountain of physical files that needed sorting. Usually they spent 2 whole weeks to sort the files. They asked me to sort it. I can't remember exactly how I set it up, but I took over several desks and made my "buckets". Bucket sort works *really* well with physical files because concatenating them back into the original pile is simple :-). I got it all done in an afternoon and after that they thought I was a genius :-D
@michaelwilkes0 Рік тому
seriously. when someone hands you several file cabinets of paper, the first step is to sit down and think.
@alialmans 2 роки тому
absolutely beautiful, human ingenuity at its finest
@gaius_enceladus 4 роки тому ⁺⁵¹
Your energy and accent remind me of Steve Irwin!
"Crikey! It's a radix sort! Now, we'll just sneak up behind heeeem and grab heeeem behind the head....... "
Cheers from New Zealand!
@achtsekundenfurz7876 3 роки тому ⁺²
That hit close to the heart . . . XD
@yorgle 3 роки тому ⁺³
I love watching those animated sort algorithm demonstrations, and Radix sort was always one that looked neat... but your explanation is fantastic! I finally understand it, and goddamn, is it freaking clever! :D
@Antiorganizer Рік тому
You're a natural! Nice and concise and clear! Kudos for that!
Tell me you're a high school, college, or university teacher!
@SC1240 4 роки тому ⁺³³
Best channel on youtube hands down
@fergalhennessy775 22 дні тому
the prefix sum part is SO COOL thank you for this great video mr creel
@jfk_the_second 3 роки тому ⁺¹
Very informative and easy to understand! I think it was well-paced for beginners, as well as for people to whom the particular topic is novel, but the concepts familiar.
@rczeien 3 роки тому
It makes perfect sense as you explain it. It reminds me of when you sort by multiple fields, you do the most important last.
@sveinungchr 4 роки тому
Man.. The things youtube wants me to learn is amazing... Thank you for taking the time to make this topic clear, understandable and interesting.
@aiacfrosti1772 3 роки тому ⁺⁶
Radix sort is O(d * n), where d is the length of the elements. For sorting elements whose length is shorter than log(n) radix sort absolutely blazes through the list. If sorting a list whose elements lengths are longer than log(n) radix sort actually performs worse than its contemporaries. Given that a randomly generated list of unique packed keys has a longest length of log(n) (you eventually run out of unique elements at any given length and have to add another digit), the d * n starts looking suspiciously similar to nlogn.
However if you have a very long list of short, non-unique keys (if you wanted to sort a list of dice rolls or something I guess) this is absolutely the best way to do it!
@GuillaumeGris 3 роки тому
If you have to sort a list of dice rolls, Just count them in an array of 6 bins. This will most likely be faster than radix. If for some reason you want a long list instead of a compact array with the same information, you can always generate it from the bin array in O(n) time
@JJCUBER 4 роки тому ⁺⁸³
10:36 “0 to 9!” That’s quite the range for the array 😉
@Zeero3846 4 роки тому ⁺¹⁴
That's a fact ...
... orial
@leyasep5919 4 роки тому ⁺³
some amps even go to 11 !
@frale_2392 4 роки тому ⁺²
I discovered your channel like yesterday, and as a game programmer in training I love your videos, you always explain things perfectly. Keep up the good work
@NeilRoy 4 роки тому ⁺¹
I was puzzled at first, but the final part of the video cleared it up for me. Brilliant algorithm.
@WhatsACreel 4 роки тому
I know what you mean! It's such a clever trick! Cheers for watching brus :)
@retinizer7702 4 роки тому ⁺⁷
I think I am in love with counting sort
@rayredondo8160 4 роки тому ⁺²
Okay, this is awesome! I've been trying to figure out a smart way to implement Radix Sort for a while, and you just gave me a way! Thank you for the awesome videos, keep doing awesome!
@WhatsACreel 4 роки тому ⁺¹
No worries mate! Links to the original sources as well as my own version of the code will be provided in the final video. I hope to upload it later today. Thank you for watching :)
@AngrocSound 3 роки тому ⁺²
Brilliant, and such an awesome explanation. Only improvement would be to mention to subtract 1 from the prefix sum for *each* time you move an input to the ouput, not only once (all the talk of moving from 0 to 1 base counting confused that part for me). Slow-mo'ing the part where you go through the rest of the items real fast helped me see what you where doing to make it work. Thanks for sharing!
@markt43 3 роки тому
An alternative approach would be to shift all the counts to the right and set the count of 0 to zero. In this approach, you would iterate from left to right and increment the counts instead of decrementing them. You would not have to convert between base 0 and 1 in this approach.
@KingThrillgore 4 роки тому ⁺⁴⁹⁰
Radix sort is truly a sort without comparison.
Get it? Because Radix doesn't do comparisons
@shirosurfer8864 4 роки тому ⁺³³
Like discrete mathematics is the one that counts the most
@restcure 4 роки тому ⁺³⁰
Glad *that's* sorted!
@NStripleseven 3 роки тому ⁺⁴
I see what you did there
@leocomerford 3 роки тому ⁺¹⁴
Now, there's a time and a space for this sort of thing ...
@elliott8175 4 роки тому ⁺³
Great video series. Radix getting the attention it deserves!
I would point out though that I don't think that Radix is a compromise between speed and memory (when compared to Counting Sort). If k is (1 + max_element - min_element), then counting sort has O(k) space complexity **and** O(n+k) time complexity (auxiliary array requires linear traversal: first to initialise to 0, then to load the solutions). This is just as much of a time problem as a memory problem, so Radix helps to handle both of these.
@AhmedKhaled-xp7dm 9 місяців тому
Beautiful content, Thanks for the amazing explanation and visuals, it's clear that you love what you are teaching and that makes the videos more than perfect
@linobigatti 4 роки тому ⁺²
Radix sort is plain magic, and youve explained it perfectly ö
@JesusP7 4 роки тому
Don't know why youtube recommended me this, but for once the algorithm got it right! Didn't know i wanted to know this.
Perfectly explained, great step-by-step examples. Thank you for the content
@ke9tv Рік тому
Back in the day, I encountered a radix-2 sort that was coded for a PDP-8 with two LincTapes (or DECtapes, which were just LINCtapes rebranded). Random access tapes - there was a timing track so that you could read and write anywhere, not just at the end of the tape. The buckets were accumulated in odd- and even-numbered sectors on the output tape. There was a weird trick where alternating passes through the tapes read the tapes backward. Nothing like sorting a megabyte of data on a machine with 16 kilobytes of core memory!
When I jot a job as a mainframer, I encountered five-tape read-backward polyphase merge. Insane!
@xavierdupont5772 3 роки тому
Thank you man.
I discovered radix sort in this crazy guy videos that make animated sorting algorithms with sound, and it's a lot of fun to watch, but not I finally understand how it works. It's a kind of magic ;).
@Greedygoblingames 4 роки тому ⁺¹
An excellent explanation and demo of radix sort. Quality UA-cam content this is.
@bruceleenstra6181 4 роки тому
I agree, very clearly explained. After watching this video, (plus the previous videos reference to a mask and right shift) I wrote a working 16 bucket radix sort in 10 minutes.
@pepe6666 Рік тому
thanks mate i appreciate your approach to these videos. you are very human and i can really get a feel for the worth / importance of each thing you mention. thanks man
@serenasamborski6856 4 роки тому ⁺¹
You did such a good job explaining this! I’m just a self taught programmer but i had no problem understanding this! And it’s really interesting too!
@brucea9871 3 роки тому
This is the first time I've heard of counting and radix sorts. I've always assumed that quicksort and merge sort were the fastest sorts. But once you began describing counting sorts I quickly got the idea of how it worked.
@MMasterDE 3 роки тому
Hehe, there's a lot of sorting algorithms that are way faster or have better worst cases than the usually default implementations, but they usually consider the data to be sorted and may require more memory. Quicksort doesn't require any extra memory. Another example of better worst case ("faster big O notation") that works with pretty much any comparable data, that has better worst case, only takes up about twice the memory, easily implemented and understood, is tree sort. Just insert items into a self balancing tree (log(n)), then traverse the tree in order AKA O(n*log(n)). That's a very stable sort, best, average and worst case is all the same.
@Goejii Рік тому
Great video!
I'd never looked at how radix sort works under the hood. I paused the video at 16:36 and I started prototyping a radix sort algorithm... And it works! It is even faster than std::sort!
Two points though:
- is there a clever way to know when the array is sorted to avoid unnecessary iterations?
- can this algorithm work with a user defined comparison function?
@MadhavaVishnubhatla 4 роки тому ⁺⁶⁷
I love you my guy
@WhatsACreel 4 роки тому ⁺¹⁰
Ha! You too
@kippenegg 4 роки тому ⁺¹
Same, this series is so interesting
@Fanaro Рік тому
What an underrated channel!
@timmurphy5541 3 роки тому
This was very good. I wish that more had been put into explaining the prefix sum - some of the mechanics were quite obvious and didn't need as much explanation as that did. I know it's a lot of work to do these things so thank you and at least it has given me a question to find an answer for.
@timmurphy5541 3 роки тому
ah, Ding! now I get it - after writing that...duh. :-)
@firefly618 3 роки тому
So the overall lesson is: know your data.
The digits of a base-10 number, as well as the bytes or nibbles or even bits of a base-2 number, contain much more information than just x < y. Because every digit (or byte, bit...) is already classifying the number according to some power of 10 (or 2). If you can exploit that, you don't need to perform N log N comparisons. In fact, I think the same technique can be applied to fixed- or bounded-width strings, dates and times, tuples...
By the way, I didn't know N log N came from log N!, that's very interesting. Here's a simple proof I found online (not taking the credit)
Proof that O(log n!) ⊆ O(n log n):
*log n!* = log(1 · 2 ⋯ n) = log 1 + log 2 + ⋯ + log n ** log n/2 + ⋯ + log n *>* ⌊n/2⌋ · log ⌊n/2⌋ ∊ O(n log n)
Therefore O(log n!) = O(n log n) ∎
@Winnetou17 4 роки тому ⁺²⁹
10:41: "We've got the input list just here... same elements as before, I think..." "Well, you wrote it, mate!"
=)) What's a Creel playing both the teacher and the student for our amusement.
Question: for integers, if you know you're big-endian, isn't it easier to simply build the counter array to have power-of-two buckets (like 8 or 16) ? And then, instead of doing mod 10, simply read the relevant 3-4 bits, put in the relevant bucket, and so on. You also have to make sure to have the sign bit in it's own step. Also, would it help to construct all the count arrays at once, so you only traverse forward the array once ? It uses a bit more memory, but arrays of 16 integers... doesn't sound that bad. Worst case for normal integers would be to compare 64 bit signed integers, and that would amount to 64/4 + 1 = 17 count arrays (the extra one is for the sign count array).
Cheers!
@WhatsACreel 4 роки тому ⁺³
Sry, just realized I replied to the wrong comment yesterday, haha! My response didn't make any sense. Anyway, thank you for watching :)
@Winnetou17 4 роки тому
@@WhatsACreel No, it made sense. Having ANDs instead of DIVs (well, mod by 10 I think would've been for taking the digits in base 10)
@deedewald1707 3 роки тому ⁺¹
@@WhatsACreel Performing AND on LS 4 bits, then AND on MS 4 bits with SHR to bucket count and sorting process used on byte size numbers ! If needed, Scale up by groups of 4 !
@dodogo777 3 роки тому
superb explaination! by far the most intuitive explanation I watched on radix sort. Thank you!
@yashpandey9550 4 роки тому
One of the best sorting videos I have seen so far. Great work!
@eugenematison5571 Рік тому
Did not see other videos from this channel yet. But this one is enough to be subscribed! ))
@bakedbeings 4 роки тому
This is a man who loves it.
@SosetaFurioasaJr 2 роки тому
G'day, matey! Thank you again, here's one for the algorithm ! Btw, loved the little joke between you and the presentation with the same different set of numbers, haha.
@KetilDuna Рік тому
That was a really neat explanation of a smart sorting method. Thank you for sharing.
@epictetzu85 3 роки тому
8.5k likes with zero dislikes is very impressive. Well done!
@willzin-da-esfiha Рік тому
I was implementing radix sort by my own and I was surprised when my algorithm managed a total of 10GB RAM (not at same time, of course, but the sum of all allocating and deallocating operations).
To improve it first I tried to do a better approach for bucket vector: instead of allocating 1 extra position by push operation, I would double the allocated size and use a second length prop to measure the vector. After that, I could reduce the 10GB total handled RAM to 400MB.
Watching the video, an idea instantly clicked in my mind when I saw the digit counting array: "I CAN PREDICT THE BUCKET SIZE BY ITERATION!!!!" then I stopped video and did it. Reduced from 400MB to 100MB total handled memory.
@chrisburke1672 4 роки тому ⁺¹
Just watched the first one, nice job posting the second part so quickly
@arlynsneha5052 3 роки тому
You sorted it clearly in our brains !!!!!!!!
@lepatenteux592 3 роки тому ⁺¹
I would like to see a pinned comment about repeating digits when decreasing the prefix index... I did not realize that you do not reset the prefix index and "accumulate" the -1 so that identical digits don't get the same spot... I had to go through many comments before I found the answer to that Interrogation...
The rest of the video is really informative, thank you!
@meanmole3212 4 роки тому ⁺⁴
Hearing "Good stuff!" takes me back to 2014-2015 :D
@vukpsodorov5446 4 роки тому ⁺¹
awesome stuff! also you're really good at explaining all this, it's all so simple to understand compared to some other explanations out there, awesome work!
@RickeyBowers 4 роки тому ⁺²
Well done, loved the graphic work you put into your videos. Of course, some of the radix sorting can be done in parallel.
@WhatsACreel 4 роки тому
Indeed it can, yes! Cheers for watching mate, have a good one :)
@jesse2667 3 роки тому
13:24 The moment I realize the GENIUS of this technique.
This is AMAZING
@Darkev77 3 роки тому
Could you elaborate your insight please?
@jesse2667 3 роки тому
@@Darkev77 He was talking sbout the linked list approach, with the downside of the overhead. I was picturing how to layout the datastructure in an array, remembering how a heap is actually stored in an array. The problem is to know where to start each of the buckets. At this point is where I noticed the sums are the index of the pointers to each bucket 🤯. So he solves the allocation problem.
@hacked2123 4 роки тому ⁺⁴
Watched the first video and was going to propose what you called "counting sort"; my idea was to determine min/max and only use counts that fall in that range.
@kzdm5255 3 роки тому ⁺²⁴
The complexity of the Radix Algorithm is dependent on the size of the largest number. So if you had just one max value 64bit integer, you would be running the counting algorithm 19 times.
@JohnNada-b9s Рік тому
Not really. If you do MSB radix sort it is perfectly possible to skip empty buckets and buckets of size one.
As usual, real performance depends on implementation and data distribution.
@Slicerwizard Рік тому
Nothing stopping you from using 100, 1000 or 10,000 counters. Gets you down to ten, seven or five passes. 22 upvotes for you, but not a single bright bulb in the crowd for two years - yikes!
@jacey0479 Рік тому
⁠⁠⁠
No? Bc you will quickly run out of memory
@Slicerwizard Рік тому
@@jacey0479 You are mistaken. Why do you think memory will be exhausted?
@jacey0479 Рік тому
Your counters have to be < the size of the list. That is the problem with Radix sort, trying to find a balance between memory allocation and speed. Obv it also isn’t good for small lists.
@kensmith5694 4 роки тому ⁺²
The "bucket" version of radix sort is exactly how you sorted cards in an old mechanical punch card sorter. It took as many passes as there are digits. So you had to pull the cards out of the output stacks and stack them up in the input.
Programs stored on punch cards would often be numbered in steps of ten in the last 8 columns. This way to insert a line after line #1000 of a 100 card stack, you would number the new card as 1001.
If you go with 16 counters (AKA 4 bit digits), the counters can be on the stack. On most machines, allocating 16 words of stack takes no extra time at the routine entry.
Sorting strings of text or other large things can often be more efficient if you make arrays of pointers.
@rhymereason3449 4 роки тому
That's right! I forgot about that... anyone who ever dropped their program card deck (or who's wise cracking buddy gave it a toss) but was wise enough to put line numbers on the cards knows all about this.
@kensmith5694 4 роки тому
@@rhymereason3449
The new discussion of it is sort of an "everything old is new again" thing. The cards were invented for the US census before computers were machines. The mechanical "tabulator" let the government work out how many plumbers owned cats without any electronics needed.
@HungPham2411 4 роки тому
The visualization in the video is very clear and easy to understand. Programmers may good at algorithms but not many of them good at describing.
@ddichny Рік тому
Some clarification:
6:35 "Hey presto, we haven't got anything sorted at all" (after the first pass). This glosses over the important fact that at the end of the first pass, the last digits *are* sorted. Indeed, each pass sorts the whole list via the digit being scanned in that pass.
Another key feature that should be mentioned is that the method of stacking the buckets preserves the order the numbers were already in, and thus preserves the lower-digit ordering(s) imposed by the earlier passes even as you're sorting the new current digit. And so does the method of unstacking the buckets back to the output array. Without this, each new pass could scramble and lose the low-digit ordering imposed by the previous passes. But with it, the final (highest-digit) pass is guaranteed to leave everything in a fully sorted condition, since it preserves the already-ordered runs of the lower digits.
This property (of preserving the prior ordering of same/bucketed items during a new sort) is called "sort stability", and is very useful when constructing multipass algorithms. Meanwhile, most efficient sort algorithms are "unstable" and don't guarantee preserving any prior ordering. For example, if you quicksort a list of employees by name, and then quicksort that by sex, the resulting list will have all the males together, but their names will likely end up scrambled again. But if you use a stable sort algorithm and do the name sort followed by sex sort two-pass process, the guys on the final list will still have their names alphabetized.
@ddichny Рік тому
Incidentally, you can do a "stable" sort even with an "unstable" sorting algorithm by including the item index number in the comparison key. In the above example, don't just sort employees by "sex" on the second pass, generate temporary "line numbers" for the alphabetically sorted list (1 for first item, 2 for second item, etc.) and then sort by "sex"+"line number".
(In this simplified example, you could of course just do a one-pass sort on the key "sex"+"name" to get the job done, but most cases of multipass sorting aren't so easily collapsed, and that's where stable sorting and the line-number trick become quite useful.)
@winstonsmith8597 3 роки тому
I have a large array been waiting for this sort
@tatianabasileus 4 роки тому ⁺¹¹⁸
Seems like radix-sorting strings would work too, as long as they are made from a defined alphabet?
@WhatsACreel 4 роки тому ⁺⁶⁴
Indeed it does! A viewer from the last video mentioned he did a talk on implementing Radix Sort for strings at JuliaCon last year! His talk can be viewed at: ua-cam.com/video/C65vYWeN7-k/v-deo.html
@meneldal 4 роки тому ⁺⁵²
It's actually potentially much better with strings because comparisons can take a lot of time with strings, but accessing a single character is fast.
The bad news is you probably can't make this work with unicode easily. A string comparison in unicode is already an unsolved problem so you may not care about that.
@vylbird8014 4 роки тому ⁺¹¹
@@meneldal You don't need your string comparison to be particularly meaningful to use it in a sort. You just need it to be consistent.
@meneldal 4 роки тому ⁺¹⁸
@@vylbird8014 The problem in Unicode is two string can look identical when you render them but don't have the same representation. Even normalization doesn't deal with every subtle thing.
So while it will be consistent, it may give you results you don't want.
@MatthewDempsky 4 роки тому ⁺⁸
@@meneldal You just need to transform your Unicode strings into byte strings such that the desired ordering is preserved. For example, the C standard library includes strxfrm() and wcsxfrm() for exactly that purpose.
@jaimeduncan6167 3 роки тому
For people starting: The first algorithm as expressed does comparison, but not between the elements but between the numbers and the values on the array. The kind is “is it a 1” the reason he says correctly no comparison is because he has the implementation in his head. Like this m[n]+=1 where m is the count vector and n is the element in the input array. So yes it’s lineal. Notice that this is the case vaciar we are exploiting the nature of the data.
@xipheonj 3 роки тому
Thank you youtube algorithm, it has decided that I needed to watch this today, and it was right.
@CrystlizeWorld 4 роки тому
Mate unreal content, love you from your beginners to C++ but this stuff is top!
@strangeWaters 2 роки тому ⁺¹
You can iterate the input array from the left if you use an exclusive prefix scan instead (add up all the elements up before but not including the slot)
@eugenematison5571 Рік тому
Great explanation! Thank you! One small remark. Unfortunately, by adding some offset to float and then substracting the same value you will not end with original float value. Even more, adding offset to some float values can make them indistinguishable.
@hendrixgryspeerdt2085 3 роки тому
When building the prefix sum , if you begin one index offset, (sum in index 0 is 0, second index 1 is sum of 0’s and so on). Then if you iterate from left to right when building the output array, you would place the element in the index specified by the prefix sum and then increment count.
@furkanunsal5814 4 роки тому
This is absolutly genius. And a great video congrats
@NbWfer 3 роки тому ⁺¹
For computers, might be more sensible to use binary counting system for the bucketing so, you can use the speed of faster bit-shift and bitwize operations for the bucketing -if used with signed/unsigned numbers, and not with 2's complement numbering system
@Wyld1one Рік тому
Can be used on variable length strings too. numbers are just strings with just two counts (0 & 1)
Short cut too: if you run out of digits have a extra count to indicate 'already sorted' and treat like -1.then you can skip all the items in the next round (bin) that was -1 (character strings go left to right,numbers right to left)
@nathanmartinez2630 3 роки тому
Beautiful; a work of art. Thank you Creel!
@georgewashington90 3 роки тому ⁺¹
If you change memory addressing before starting input data into computer you do not need any algorithm sort to get sorted output. That is the fastest way of getting sorted data without any sorting algorithm. It is possible because memory addresses are already sorted. I made it in C and that is why I know it works, it simplifies many other things too (comparison numbers ,strings). For instance you do not need mathematical formula to state which number or string is greater, it is determined by which string or number is read from memory first. There is also very nice information overlapping in memory which does not lose information and makes memory much more efficient without compression.
@simontillson482 Рік тому
I think you”re describing an insertion sort there.
@trolledwoods377 4 роки тому ⁺¹
If you subtract one from the first element when building the prefix sum(making sure to use a wrapping subtract to avoid overflow(unless you're using signed numbers), it should remove the awkward -1 for each step.
@nispelsm 3 роки тому ⁺¹
It wasn't explained very well in the video, but you are actually subtracting 1 from that digit's prefix *each* time you copy a value from the source array to the destination array. Let's say your original array has three "5"s, and the starting prefix for that digit is 9. The first time you add the "5" to the destination array, you subtract 1 from the prefix, placing it at index 8, but the 2nd time you add the "5" to the destination array, you subtract 1 AGAIN from the prefix, making it index 7, and so on.
@JamesSmith-fg8hp 4 роки тому ⁺³
Thank you for the video, I really enjoyed watching it.
I would want, however, to put an emphasis on the witchcraft that let you rebuild sorted array by a prefix sum, as this is the part that confused me for a bit, but I actually have finally realized it and I want to help others as well.
So, to the explanation. There are two major points why the radix sort works. The first is, on each iteration it sorts values by one digit. How does prefix sum helps it? The idea is that the prefix sum (at the index 7, exempli gratia) is the amount of numbers whose last digit is less or equal to 7 (say this amount is 4). And the prefix sum at the index 8 shows the amount of numbers whose last digit is less than equal to 8 (let it be 6). So in a sorted by last digit array the indices 0,1,2,3 would be occupied by numbers with last digits less or equal to 7, which clearly means that indices 4, 5 would contain numbers with their last digit equal to 8. I really liked this trick, thank you for showing it to me.
The second part of radix sort is it's stability, but it's trivial compared to the witchcraft above.
@apuji7555 3 роки тому
Wow, this makes a lot of sense. Thanks!
@davidjames1684 3 роки тому
Also one thing worth noting is for input numbers with lots of digits, it takes some time and processor power to extract each digit. Also for very long numbers like 32 bit, which can be as high as 4,294,967,295 using standard binary encoding, that is a lot of passes you need to make, 10 total on each.
@simontillson482 Рік тому ⁺¹
I think for 32 bit numbers, you’d usually only consider 8 bits at a time so 256 counters and four passes would do it. Nobody would consider sorting 32 bits numbers in decimal. It just makes no sense when bytes are easily extracted.
@MrLuckymica 3 роки тому
i did my master thesis on this theme 6 years ago, comparison of sorting algorithms and how parallel programming benefit them on different size of lists. so doing 1 thread vs 4 threads(had intel i5). radix sort(with buckets) was one of them :D
@clonger204 3 роки тому
You're a bloody legend mate, cheers.
@ChrisHalden007 4 роки тому
Excellent as always. I absolutely love your style. So fresh and educative 👍
@robertlittlejohn8666 3 роки тому ⁺³
I;m old enough to remember how the old IBM punched card machines were used for sorting. It was a radix sort, and the "buckets" were actual buckets or slots that a card would fall into, if, say, column 35 were punched with a 3. You'd then gather the 10 buckets up in order, and run the deck through again, soring on column 36 (or whatever).
@ferinzz 4 роки тому
Watched the whole series, this stuff is amazing!
@jursamaj 3 роки тому ⁺¹
I feel like there's a better way to generate the prefix sums:
A. Set the prefix for 0 to 0.
B. Iterating n from 0 to N-1, set prefix[n+1]=prefix[n]+count[n].
Doing this yields 0-based indices to the location to store the first item of each n, instead of 1-based indices to where to store the last. Now you can iterate over the original data left to right again, storing each in the slot for its prefix, then increment that prefix.
I know, both methods yield exactly the same result. This way just feels better to me.
@panjak323 Рік тому
How do you solve collisions ? You have subtract 1 anyways on each repeating element.
@jursamaj Рік тому
@@panjak323 There are no collisions. And you don't have to decrement, you have to increment. Notice I said go left to right, not his right to left.
@panjak323 Рік тому
@@jursamaj what are you talking about? Look at the next video, he is clearly decrementing counters for each number contained in the bucket... Otherwise how else would you know where to put 2 elements ending with same number in the output array? Without decrementing the counter each time you would put them in the same spot. Your method is totally redundant since only difference it would make would be counter[i]-- vs --counter[i].
@jursamaj Рік тому
@@panjak323 Sorry, but you didn't understand what I said above. Creel has to fill the array backwards & pre-decrement the indices because his calculated indices point past the end of each bucket. My method points to the beginning of each bucket, so you fill the array forward and post-increment the indices.
@amkhrjee Рік тому
Amazing mini-series! Thanks a ton for taking the time and effort to make the beautiful presentations. Much appreciated. And, on a different note, is that a cat holding a cricket bat? I'm talking about the picture behind you.
@ibrahimyosif 3 роки тому
love your videos man, just found you here great work!
@durimmiziraj4815 3 роки тому
Great job at explaining this. I absolutely love it.
@Nikolai4567 4 роки тому ⁺¹
Oh lol, I once implemented that by fideling arround with sorting algorithms and I didn't know it was called Radix sort, same idea, same outcome. Nice to know.
Edit: you can just allocate the counting array on the stack, and do Radix Sort in place by swapping elements around. However I am not sure that would work for stating with the lower digits - but it sure does if you start with the highest digits. Also once the first iteration is done in that case, the problem is effectively localized and you only need to sort the patches between themselves.
If you do it like this, you do not need any expensive allocations
@WhatsACreel 4 роки тому ⁺¹
Brilliant! Cheers for sharing :)
@Alexander_Sannikov 4 роки тому ⁺⁶
well strictly speaking, it's not O(n). it's O((n+b) *log k/log b), where k is the largest number you're sorting and b is the base of its integer representation. because of this, it might be better to use base-2 representation to minimize b term or to use base-10 or even something like base-256 representation to maximize the divisor.
so assuming that k and b are constants is not always precise, because you might actually want to change them to make the algorithm run at maximum speed.
@WhatsACreel 4 роки тому
Right you are! The third video shows the code and performance of a base 256 Radix Sort! I agree that the performance is logarithmic when sorting data or arbitrary lengths. It's only O(n) when the length of the keys is fixed. Cheers for watching anywho, have a good one :)
@acasualviewer5861 2 роки тому
I would call it O(N log R) where R is the number of bits. No need to change base as that is just a constant that gets simplified away by the definition of Big-O
But I agree that it's not fully honest to call it O(N).
@mcspud 4 роки тому
This video was so good I just became a Patreon. Thanks mate!
@WhatsACreel 4 роки тому
Legend! Cheers for watching mate, and cheers for the support :)
@axelkoster 3 роки тому
very smart algorithm, i love this stuff

Наступне

Автоматичне відтворення

Why is Radix Sort so Fast? Part 3 Comparison and Code, Radix Sort vs QuickSort