How Auto-Tune Works
Вставка
- Опубліковано 1 жов 2024
- Featuring the Gregory Brothers! Go subscribe to them: / schmoyoho
Pitch correction: it can make terrible singers sound decent, brilliant singer sound mediocre, or Cher sound like a robot. But how does it work? And is it possible to explain that without actually trying to understand Fourier transforms?
🟥 MORE FROM TOM: www.tomscott.com/
(you can find contact details and social links there too)
📰 WEEKLY NEWSLETTER with good stuff from the rest of the internet: www.tomscott.c...
❓ LATERAL, free weekly podcast: lateralcast.com/ / lateralcast
➕ TOM SCOTT PLUS: / tomscottplus
👥 THE TECHNICAL DIFFICULTIES: / techdif
Tom's accidental singing is so beautiful that someone could sneak this into pop radio airplay & no one would BAT AN EYE 💝💖💗
Thanks, folks! This was a lot of fun.
6 days ago?
How can this comment be 6 days ago?
6 days
6 days.... How? What is this sorcery
2:13 That damn fly in your room at 3 am
Where???
@@usteg.4818 he's taking about the buzzing sound.
IM WHEEZINN😂😂😂😂
A mosquito
2:36 fuooo.... hurt-hurt i hurt
Tom singing? Tom... are you a hostage? Should we send help?
Blink twice
Thrice
@Ezequiel Ciamparella MANUAL BLINKING AND BREATHING ACTIVATED
@Ezequiel Ciamparella git
Send priority help
0:40
Hello
_Hello_
*Hello*
*_HELLOOOOOO_*
*steps on mine turtle*
oh no
*boom*
Muffin: Somebody please kill me
XD
*DeSmoNd thE mOoN bEar*
THE END
Asdf movies tomska...
I like trains..............
That 's' at 'maths' at 1:30 was really sneakily placed
I'd totally not spotted that! Shifty sneaky chameleon!
That was Quick
The "i" is missing from "frequencies" in the following sentence at 1:34.
I thought my app glitched
wow! thanks, Mr. hindsight!
GUYS. WE DID IT. WE FINALLY GOT TOM TO SING ON UA-cam. BEEN WAITING FOR THIS SINCE THE "MUSIC VIDEO"
I assume then that you do not know "We flew a kite in a public space"
@@FreyasArts Was that from the "Illegal Things to do in London" video??? Completely forgot about it haha
@@Jonic_P yup 😁
Someone needs to edit the song from this video on top of that one
We flew a kite in a public place
*Waiting for a Tom Scott album full of educational music like this ...*
The educational music video would be the album opener
Yes
Still waiting for the full version of We Flew A Kite In A Public Place
Me too
Please
Awesome collaboration with awesome people! Fourier transforms ftw. The idea of pulling a single string out of a guitar chord and tuning it is crazy math. Surely that only works for certain combinations? If they were off-tune harmonics would they be hard to isolate?
Fourier transforms are not as amazing for signal processing as one might think, there are some caveats, mainly a certain amount of uncertainty between frequency resolution and time resolution. So either you know exactly which frequency is there or when it started exactly, which is what you've touched on with overtones not being exactly harmonics. IIRC it also has to do with the Heisenberg Uncertainty Principle, 3blue1brown has a video on it. Anyway, due to the limitations of the fourier transform, basically all algorithms for pitch detection (recognizing the note being played) are not frequency-based, like this video migh suggest, but time-based. This means that most algorithms don't even use the fourier transform for pitch detection (e.g. Yin or McLeod algorithm) but a process called auto-correlation. Though, admittedly, I don't know how that would work for chords, so software like Melodyne that can also recognize chords may use a combination of both time-based and frequency-based pitch detection algrithms. I haven't looked into real-time pitch shifting (the tuning part of auto-tune) but I'll guess that it is also time-based. Finally, the fourier transform is mainly used for analyzing a signal so you can see the different frequencies in an analyzer (like those on many music videos from Monstercat etc.) and for a kind of filtering (FIR filter) that is much faster in the frequency domain than in the time domain, where the other kind of filters (IIR filters) are carried out.
@Tom9358 I think 3blue1brown might have used that as an analogy, but it's been a while since I watched that (really good) video so I can't be sure.
@Tom9358 the uncertainty principle applies to any wave. It doesn't matter if it's a sound wave or the wave function of a quantum particle.
@Tom9358 Either Sixty Symbols or Computerphile has a really good video on exactly how and why it *does* apply to sound waves
@Tom9358 False as well. Nowadays "Heisenberg's uncertainty principle" and "uncertainty principle" fall together. It does not specifically refer to the quantum principle any more.
And here I thought that they were just slicing the audio into a bunch of tiny slices, then repeating or removing some of them
hi
hi
That is indeed what it sounds like. Makes one wonder if all the extra trickery is necessary.
@@ThreadBomb That is indeed what it sounds like when someone does a poor job of it. When someone does a good job of it... well, it wouldn't be trickery if you could tell, would it?
@@ThreadBomb Yes, that's because Tom is just speaking. When it's done to a talented singer, it's barely noticeable. Also, some musicians deliberately use that to get a "robotic", vocoder effect - see Cher, Kanye West, etc
"but it can't write a song, some talent's still required"
_Ghost writers have entered the chat_
_Kanye has left the chat_
Apart from those two-sentence songs that are around...
you still need a voice
@@Hoch134
Gotta pick the right sentences, though.
Pen-pineapple, apple-pen.
Edit:
nmnmnmnmnmnmnm
Cool, ghost writers... They're not actual ghosts, aren't they?
Tom, try to wear a bit darker red shirt every video until it’s black
He's been doing that for a while. Haven't you noticed? ;)
What??
I see a red shirt and I want it painted black
@@sphealingit222 no colours anymore I want them to turn black
Inb4 someone starts plotting the color of his shirt
You know, I quite like how Tom shakes up the usual format every once in a while.
Speaking of which, when is the album coming out and will it feature the hit single "We flew a kite in a public place"?
Cool, now I won't sound like someone hitting a refrigerator with a saucepan
Better than hitting a saucepan with a refrigerator ._.
@@TheSam1902 ...or getting hit by a saucepan wielding refrigerator...
@@Spikehead777 ...or really a refrigerator wielding saucepan...
Eueu
@@judebedessem4101 Just use a Nokia as a shield
0:52 I totaly expected _Never gonna give you up, never gonna let you down_ after this beat ^^
Reverse rickroll.
I was going to say the same thing
It felt familiar and my body thought it knew where it was going but I didn't know why. Now I'm less confused
exactly it's the rickroll toms
0:52 is the most awkward smile I've ever seen
Nah, this is his normal smile.
Can someone tell me to like when this comment doesn't have 69 likes anymore?
@@hlfan Already.
Looked too much like Mark Zucc
Can anyone please give me the name of the background music?
If Tom says something might as well be magic, it must be the closest thing we've got to _actually_ being magic
yep
Well as a software developer, coding is kind of like weaving spells so....
Ah, music, a magic beyond all we do here!
@@MasonC2K Seconded. Working on code *someone else wrote* is frequently like being in a Lovecraft story, studying indecipherable, madness-inducing texts, using arcane and sometimes ancient incantations, knowing what they do but not how they do it, to achieve some goal, hoping desperately there won't be any disastrous consequences (bugs) *this* time.
"Any sufficiently advanced technology is indistinguishable from magic." - Arthur C. Clarke
If music videos were edited like educational videos
okay
More like if educational videos were edited like music videos
@@joshgiannini1804 "been there, seen that"
-Okey-
This needs to be funded
Plot twist: Tom talked normally and autotune made him sing.
I believe that's what they did, except the end
@@FAB1150 r/woooosh
@@theturtle3891 You're the woooosh
@@FAB1150 then it would sound horrible, not possible
Plot twist? That's not a plot twist at all. That's what describes many singers carrers nowadays actually.
"How does Auto-Tune work?"
"Math. Here's a song about it"
I love this
600 likes no comment
@@goldbysilver36942 likes and no response
@@tadeassoucek6214 no like 1 response
Meh, I loved Tom Scott's singing voice since his debut song 'We Flew A Kite In A Public Place'.
everyone talks about we flew a kite, but everyone forgets Forget You
Now we need an autotune version of the hit single 'We flew a kite in a public place'
But you can't fix perfection
PLEASE
This
This
This
Step 1: Build a time machine
Step 2: Go to the 90's
Step 3: Start a band with the Gregory brothers
Step 4: Profit
90s *
They already did it themselves
or just sell the time machine
@@Planehazza * '90s
Well done with keeping this format fun and creative, Tom :)
1:37 This is my favorite part of the whole video especially the "isolates them"
2:36 fuooo.... hurt-hurt i hurt
Last time, we got an education video in a music video. Now, we got a music video in an education video!
What else? A music video by Tom?
I was waiting for a pitch corrected “OnE tAKe” at the end.
I love how your videos are straight to the point and not 10:01 minutes long explaining a topic that takes less then 5 minutes to present.
I mean, with how advanced Auto-Tune is, it probably takes more than ten minutes to explain it fully.
At 1:31, I love the flickering “s” on math, gotta internationalis/ze
2:36 fuooo.... hurt-hurt i hurt
Maths. Tom is English. Maths.
@@BarryChumbles And the greatest mathematicians spelled it "math". Because "mathematics" is singular, not plural.
I'm sure they do, and I'm sure you are right, but the British English vernacular is 'maths'.
@@BarryChumbles So? The majority of English speakers call it "math". Therefore, "math" is the objectively correct form.
"Wow, this sounds like schmoyoho's style."
*Had no idea Schmoyoho was another name for the "Gregory Brothers" until checking the description*
Seriously though, I'd love to see a normal Tom Scott video on Songify This.
• No nudity
• No autotune
Best artist ever.
Love you [Insert artist and cringey emojis]
Van Gogh 😍😍
An exception to the former is Bicycle Race by Queen
@@AlphaCore_ da vinky
When is this getting on Spotify?
Right after "You Can't Download This Because it's Not a Song".
I actually thought that Tom's degree was in something else. I can't believe his degree is in linguistics
I mean... Makes sense
Autotune is like windows movie maker - if you can't tell that it was used, it was used well.
Except daft punk. And glados.
Daft punk uses a vocoder, which makes a far more robotic sounding voice; exactly what they want
Daft punk uses analog vocoders
and T-pain
and Imogen Heap
Glados's best singer
"Some talents still required"
Song writing bots: imma bout to start this mans whole career
Can't wait for the video that explains how auto-tune works.
I beleive 3 blue 1 brown has something at least close to that, check out his videos on the fourier transform
It’s really hard to explain if you don’t have enough acoustical physics concepts clear in your mind. It’s actually a bit more complicated because Fourier covers a little part of the process and includes only the transposition of the formant (the root pitch) and its harmonics. The hard part is actually how changing the pitch without affecting the SOUND of the source works.
Simone Salvucci Do you know where one could find an explanation to formant shifting? I don’t know how it could work without some sort of reference
Short Answer: I don't know.
Long Answer: 🎵I DON'T KNOOOOOOOOW!🎵
2:36 fuooo.... hurt-hurt i hurt
For those who did not quite get that:
When doing pitch correction or pitch change, one can simply strech or shrink the audio wave. This will change the frequency but the length also. To make it the same length again you simply extend the audio wave. However this works only with sine waves (fundamental waves). Every other sound is made up of these fundamental waves in different volumes, frequencies and a thing called phase shift. To extract these fundamentals you can apply the Fourier transform which extracts all that information. Now you have the various fundamental waves wich you can strech and then simply add up. Because the software changes the pitch perfectly it sounds odd and has the typical kind of sound, especially if the pitch changes rapidly. But in normal converation or singing we expect a certain amount of variation and especially a small timeframe to change the pitch as humans can not change pitch instantly.
:)
I actually know a little bit about how this works! Kinda. The core concept which Tom forgot to mention is that *any* waveform, whether or not its periodic, can be represented as *the sum of a (potentially) infinite number of sine-waves* (which all have varying frequency/offset/amplitude). The Fourier Transform is the way to go from normal-land to sum-of-sines land, and the Inverse Fourier Transform does the opposite. (Don't worry about how they work, like Tom said it may as well be magic.) Once you're in sum-of-sines land, you can tweak those sine waves in specific ways, then go back to normal-land and end up with a modified sound wave. Badda bing badda boom!
I would have loved to hear him talk about that!
But Fourier transform only keeps the frequency information and loses the phase.
@@ThatSilentGuy nope, Fourier keeps everything, including phase, but 99% of spectrum analyser software that uses FFT just chose not to show the phase component. Which doesn't mean it's not there.
3blue1brown has a very simple explanation of the Fourier transform that completely demystifies it.
@@MsZiomallo the fun part is, humans can't hear the phase
just finished my exam on fourier transform / series (among other things) I really thought I was over this for today ...
It's amazing how often one learns about something they keep seeing it in places they didn't expect
So you now know a practical application for all that theory you were about to let slide into the depths of memory...
There's worse to come. Fast Fourier transforms. LaPlace transforms. And Z transforms.
What, you thought you were going to get away with just Fourier transforms?
@@vlanoikIt's called the Frequency Illusion, and it's more general than school. We often don't notice things that are all around us until after we learn about them, and suddenly they seem to be everywhere :)
@@bdf2718 "Name a more iconic trio, you can't" - every engineer
> But it can't write a song
*laughs in artificial intelligence*
Is that a GPT3 reference?
“Blue jeans, and bloody tears~”
0:46 the "and all inbetween" made me so happy :'
but why are you gae?
@@amiqai who sais I'm gea?
accent on the "yo"
hi
Now I know why Tom singing was a distraction in battle of the beer
😂😂😂
Goddamn Fourier Transform, I take a break from my signals courses and it follows me to UA-cam
I sometimes wonder if I'm actually a masochist, because I keep toying with the idea of going back and revisiting my signals coursework after a decade+ The high level concepts are so interesting, but I know that getting into the details will melt my brain all over again.
2:05 Some talent's still requireeeeeed
🤣🤣🤣🤣🤣
I know why you recommended this to me youtube, Tom sang silent night and i watched it whole so now this .
1:30 "The secret it math(s).."
2:00 Oooh yes! Thanks for dispelling that myth!
We act like autotune is this magical tool that makes everyone a great singer, but that's not true. If you're a terrible singer, autotune isn't going to fix that. It just fixes the imperfections of otherwise already brilliant singers. As much as I hate the overuse of autotune, we should cut some slack to those who do use it.
Thank you Tom for another great video. Musical clip and now this, are you hinting at a career change? I wouldn't mind.
Also everyone in the industry uses auto tune even subtly, it’s like cgi where you won’t even notice it half the time
Oh I don't know Robbie Williams gets away with it. 😜
Needless to say sometimes autotuning pieces in certain places and making it sound quite obvious can give an extremely cool sound effect, most notably for some rappers. Just autotuning speech can sometimes sound really cool.
I don't like autotune when it's used as a crutch, 'cause bad singing, even on pitch, will sound bad. But I do like it when it's _obviously_ overused, 'cause then it's just another effect, like reverb, distortion, and what-have-you.
@@Finkelfunk not to mention "Believe" by Cher.
Love that non binary inclucivity
Can we get this song on iTunes? It’s brilliant!
Downloaded and cut it for me to play when I drive tomorrow
I won't lie, I'm kinda disappointed you didn't explain how a Fourier transformation works 😜
3blue1brown channel has the best videos for that. Check it out!
It's not something that can be crammed into a short video. And also, the ways it is used are much more interesting than how to do it. Fun side fact: There were analog machines for doing some fourier transforms, before electronic computers existed.
@@GamesFromSpace Analog machines for doing some fourier transforms have existed for a very long time. If you send a signal down a spiral path at a constant linear velocity, and sum the measurements from points on adjacent turns of the spiral path in the wall in between and have each sum decay over time while being reinforced by the input, the wall will read out the frequency component at each point whose wavelength is the distance along the path between its sides, which varies continuously along the wall because it's a spiral. That's the structure of the cochlea in the ear, which people are believed to have had for hundreds of years.
@@macronencer i hit reply to mention 3b1b channel and found you already shared it 🙂
@@iabervon I'd love it if that were true. However, wikipedia disagrees with you, if I've read it correctly. It's the stiffness of the basilar membrane that determines the frequency response of different portions of the cochlea. The membrane is actually stiffer near the start of the cochlea, and higher frequencies are picked up there, lower frequencies nearer the centre. I was surprised to learn this myself, as I've never studied it properly before. Thanks for prompting a learning experience! By the way, you wrote 'hundreds'. I think you meant 'millions'. :-\
"Any sufficiently analyzed magic is indistinguishable from science!"
-Agatha Heterodyne
Videos like this is when I miss the Park Bench for behind-the-scenes stuff.
Interesting that I get this recommendation after Tom sang in his latest video on his secondary channel.
YT gave me this after Tom's latest video "Can auto-tune make me a good singer"
Finally, a Tom Scott music video that we needed!
0:40 I feel bad for the guys laying down fr at least a solid 40 seconds XD
Guys, gals and non-binary pals.
no such thing as “non binary’ 🤦♂️
who else is here after that tom Scott plus video
Luckily Tom was taught How to sing now so i expect a rerelease any day now!
this is literally schoolhouse rock and i'm here for it
How have I not seen this until now? I guess Tom's Plus channel kicked the algorithm in the backside for me.
Good job UA-cam algorithm. Well done.
The music could be in rewind 2019
I really really like the part at 1:30 - 1:46 It has been stuck in my head since you released the video.
Marvel: "Infinity War is the most ambitious crossover event in history!"
Me:
You?
"Ladys, gentlemen, and all inbetween"
I love you tom
1:02 🅱️ASS
Thank you for adding "and all in between", though a small additional comment it really made me feel valid and reminded me why you're my favourite youtuber. thank you!
This reminds me so much of like, melodysheep's symphony of science songs and I'm vibing with Tom's autotune singing
I love Symphony of Science.
So, when's your solo album dropping?
It makes sense that it works through a fourier transform.
You mean a FAST fourier transform
@@lordofthecats6397 But of course
I can't believe how good this video is. It's got everything: good explanation, good music, good science, good visuals, good humour
How did you fit all of that into 2 min 40 ??
I'm still waiting for the full version of "We Flew a Kite in a Public Place".
"and everyone in between"
Ikr ;v;
This guy reminds me of hbomberguy for some reason.
Well he's blond and British so basically the same person
Gazelle true that
Who now?
@@gazelle1467 they fused after they'd met eachother
@@gazelle1467 His voice is more similar to Ollie from Philosophy Tube.
I love the part where it goes “splits them up, identified them, isolates them”
ikr it's like real music
Missed the fourth Brother, but Tom did a good job of stepping in
Fourier Transform is probably one of my favorite topics. It's used for an insane amount of stuff from audio to graphics. Its a very popular way to get real time oceans with believable variance and was used in the days of SNES and before to synchronize sound waves allowing three sound bites to seemingly last forever.
I was gonna comment “That note wasn’t bad actually...” and then the dude said if for me🤣🤣
daddy tom spitting pure fire
I believe there was a park bench where you said we would never hear you sing. What happened to that?
Nah that was Matt. There are several videos of Tom singing.
Entertaining but also horrifying! Next stop UK Top 40.
This just might be my favorite Tom Scott video.
A Fourier transform is basically like a bunch of stacked filters. Sort of like those coin sorters where the small dimes fall through the hole, while the quarters slide over top. What you end up with is a set of bins that each contain the "amount" of that corresponding pure frequency that was in the original audio clip. The width or "selectivity" of each filter/bin can be changed, but that effects the frequency and temporal resolution of the resulting transform. For example, if you have an audio clip that is sampled at 2 kHz, and your sample width is 100 samples, you'll get 50 bins at 40 Hz resolution, but it will take 50 ms to collect all the bins. If you use 10 samples you'll get 5 bins, each with a width of 400 Hz, but it will only take 5 ms to collect the bins.
I think you've officially won UA-cam, or at least UA-cam crossovers.
This is the crossover I didn't know I needed until now
0:52 Why does Michael look so much like Jay Foreman?
Why do we 519k views on this?
Why don't we have like 50 million? Send this stuff to your friends ...NOW!
0:51 was it rick roll?
I can't believe this was only 2 and a half minutes long! This was awesome!
No, you are awesome!
(maybe)
@@SuviTuuliAllan awww uwu
Your avatar or whatever it's called is lovely.
It would seem Tom has finally discovered Pogo.
0:51 I thought you rick rolled me..
Now ChatGPT can write a song so no talent is requaired
This is the last thing I expected but I'm glad it's what I got.
Also, I noticed that "and all in between" part and I appreciate it a lot :3
I noticed it too :D such a small thing, but it can mean so much to some people
I find it kind of annoying when people call it Autotune when they're actually using Melodyne or even just a vocoder. They don't do the same thing.
Eh, same as any other generic trademark.
It's the same as calling every tissue kleenex
I always figured the process itself was called "Autotuning" as that basically describes what the process is. It's tuning the audio automatically.
What other word would you use for it?
Melodyne is very similar to Autotune. However, a vocoder is extremely different from both of them, in my opinion. It's not a pitch-correction tool, it's a spectral analyser and VCA. Big difference in technology, big difference in sound.
The technical term is "fellating a broken robot".
0:45 Mine Turtle: Hello!
💥💥💥
TomSkatt
Auto-tune on records is kinda cheating, but no more so that getting a singer to sing the same line 10 times until they get it right, which was how they used to do it.
Now the video "If Educational Videos Were Recorded Like Music Videos" makes a hole lot more sense
Plot Twist: Tom Scott is Travis Scott's brother. Both use autotune.