I made an AI listen to 10,000 hours of DUBSTEP

Поділитися
Вставка
  • Опубліковано 10 вер 2024

КОМЕНТАРІ • 724

  • @faselblaDer3te
    @faselblaDer3te 4 роки тому +2151

    Humans: "AI will enslave all of humanity!"
    Meanwhile, AI: "Skrillex is a jazz"

    • @violahero4life
      @violahero4life 4 роки тому +4

      @@w花b Lmfao

    • @konraddomanski242
      @konraddomanski242 4 роки тому +8

      Missing first half of your comment being
      "2025: AI destroyed the humanity"
      2020:"

    • @Lol-os2bo
      @Lol-os2bo 4 роки тому +1

      lol

    • @aiexzs
      @aiexzs 4 роки тому

      @@Lol-os2bo lol

    • @aerius4479
      @aerius4479 4 роки тому

      hey sorry for irrelevance but do you know how to recreate a sound similiar to the one at 0:09 in this song - ua-cam.com/video/-VXIcgf8biU/v-deo.html&ab_channel=IDJVideos.TV thankss

  • @mikedevey2748
    @mikedevey2748 4 роки тому +1505

    "i made a DAW in Excel but thats it...."

    • @ProdBerGotti
      @ProdBerGotti 4 роки тому +55

      lmaooo the classic undersell

    • @matthewhodge4215
      @matthewhodge4215 4 роки тому +6

      I am the 1000th like
      Good day

    • @Tommy3D
      @Tommy3D 3 роки тому

      Yeah man it’s not like it’s a huge accomplishment. I can’t even use patcher to make a basic effect

    • @Dujjack
      @Dujjack 6 місяців тому

      *taking notes*
      - being Humble while doing something impressive is impressive .

  • @excelli3971
    @excelli3971 4 роки тому +1375

    AI listens to Dylan's music for 24 hours and replaces him

    • @MzHeather-904
      @MzHeather-904 4 роки тому +3

      🤣

    • @XeMusic
      @XeMusic 4 роки тому +2

      wait I think I know you??

    • @lime2226
      @lime2226 4 роки тому +2

      More like for eternity because he makes all kinds of music

    • @nyranstanton203
      @nyranstanton203 4 роки тому +1

      and replaces his channel

    • @disastermidi1990
      @disastermidi1990 4 роки тому +1

      I would pay to have an AI do that to my music

  • @user-ob9tp3jf1j
    @user-ob9tp3jf1j 4 роки тому +903

    AI: This kick is a kick
    Also AI: Skrillex is a dubstep

    • @Zer0Spinn
      @Zer0Spinn 4 роки тому +12

      Scary, I know.

    • @broland115
      @broland115 4 роки тому +5

      this debate is closed now. :))

    • @pipoper101
      @pipoper101 4 роки тому +1

      @@broland115 there was never a debate bruv, just a lil' bunch of retards that said shit like "skrillex is brostep" (which is not actually even a genre of music or anything) without actually cringing

    • @NoNamer123456789
      @NoNamer123456789 4 роки тому +1

      Why isn't Dubstep called Wubstep?

    • @leugim8435
      @leugim8435 4 роки тому

      Hmmm yes the floor is made out of floor

  • @Fuego.017
    @Fuego.017 4 роки тому +336

    My mans just threw a whole college study in 14 minutes

  • @DinkyPattern1
    @DinkyPattern1 4 роки тому +522

    Skrillex is Jazz
    Skrillex: opening Serum

  • @jej_gay
    @jej_gay 4 роки тому +1394

    "i'm not a programmer"
    mfw you're more passionate about code than 99% of compsci graduates
    u love to see it

    • @itsPonkulz
      @itsPonkulz 4 роки тому +45

      99% more skilled

    • @wakanzeee46
      @wakanzeee46 4 роки тому +34

      mi then you’ve never met a compsci graduate. He is more passionate but certainly not twice more skilled than people who does this for a living.

    • @wakanzeee46
      @wakanzeee46 4 роки тому +11

      mi then you’ve never met a compsci graduate. He is more passionate but certainly not twice more skilled than people who does this for a living.

    • @Nerevaar
      @Nerevaar 4 роки тому +30

      @@wakanzeee46 I am a programmer for a living and have been for 6 years and he is at least more skilled than I am

    • @itsPonkulz
      @itsPonkulz 4 роки тому +4

      @@wakanzeee46 You sure?

  • @Mansch
    @Mansch 4 роки тому +1123

    _SKRILLEX IS A DUBSTEP_

  • @JeffreyMMVIII
    @JeffreyMMVIII 4 роки тому +1498

    thumbs up if you thought it was gonna make dubstep

    • @Dev1nci
      @Dev1nci 4 роки тому +65

      Yeah I’m a little disappointed. This was more like teaching a computer what dubstep is and then not using it to produce dubstep and knowing that it will also not appreciate the education 😂

    • @IQuick143cz
      @IQuick143cz 4 роки тому +12

      To be fair generative tasks like making music are really difficult. You might be interested in the OpenAI jukebox openai.com/blog/jukebox/ which does a pretty good job. But it still has a lot of audio artifacts even if it's made by really smart people who know a lot more about AI.

    • @d-rockanomaly9243
      @d-rockanomaly9243 4 роки тому +2

      @@IQuick143cz **edit** I just realized that your link is something similar to what I'm describing.**
      - Check out the video AI makes a Nirvana song. It was kind of cool. Buuut, it was basically just a mish mash of rearranged Nirvana riffs, pitched up and down to match and put in a song like structure. Like one moment it would vaguely sound like In Bloom, the next part would sound vaguely like Polly. But the lyrics were strangely Cobain-esque. I think AI will be good at recreating the style of established music one day, but... I don't think it will be able to create anything original AND good. It won't be able to write anything of enough quality that it's actually worth listening to, or be able to replace artists. To do that it would have to have the creative, original thought programmed into it, which ultimately is a human. At least, it won't be able to do that anytime remotely soon.

    • @chromebookacer7289
      @chromebookacer7289 4 роки тому

      @@d-rockanomaly9243 please God please please be right i am sooooo scared

    • @markovcd
      @markovcd 4 роки тому

      Check out OpenAI Jukebox. It does just that.

  • @awkwerddubz
    @awkwerddubz 4 роки тому +260

    Dylan Tallchief tortures a robot for who knows how long

  • @AnimalzyNL
    @AnimalzyNL 4 роки тому +146

    Damn bro, clicked on this video to watch you torture a robot, and now you're making me learn stuff? I didn't agree to this 😤

    • @DylanTallchief
      @DylanTallchief  4 роки тому +55

      the viewers were the ones actually getting tortured the whole time!

    • @AnimalzyNL
      @AnimalzyNL 4 роки тому +15

      @@DylanTallchief Shit, we got played lol

  • @TalesGrimm
    @TalesGrimm 4 роки тому +28

    RIP to all the people who thought we were gonna hear the AI make its own dubstep at the end

  • @Smetvrees
    @Smetvrees 4 роки тому +267

    Honestly I fully expected this to be in excel

  • @TGKawikachu
    @TGKawikachu 4 роки тому +48

    everybody gansta until dylan tallchief tricks you into taking a math lesson

  • @GhostSamaritan
    @GhostSamaritan 4 роки тому +159

    School: Here's a Summer break!
    UA-cam: Yes, but actually no.

  • @Maartwo
    @Maartwo 4 роки тому +148

    In 25 years some revisionist is going to find this video and charge you with torturing and exploiting AI and you'll be cancelled.

    • @perigee9281
      @perigee9281 4 роки тому

      Lmao

    • @TallicaMan1986
      @TallicaMan1986 4 роки тому +1

      Rokos Basilisk. He's already committed his crime as did a lot of us.

    • @khem3275
      @khem3275 3 роки тому

      @@perigee9281 Lmao

  • @giantneuralnetwork
    @giantneuralnetwork 3 роки тому +6

    Thanks for the shoutout! 100% agree that knowing the math is good but not required to make something amazing. Also agreed you should up the model complexity. Try adding more layers, neurons per layer, and different activation functions (ReLU/tanh/sigmoid). Nice work! Subscribed.

  • @cyxo_o
    @cyxo_o 4 роки тому +58

    Next time, I'll try telling my math teacher that "it does some equations"

  • @jay50lane
    @jay50lane 4 роки тому +160

    Elon will wake XÆA-12 up with dubstep

    • @isaygg.butitwasntgg.itwasb4958
      @isaygg.butitwasntgg.itwasb4958 4 роки тому +13

      Funfact: the "a" in his/her name stands for the dubstep track archangel by burial.

    • @jay50lane
      @jay50lane 4 роки тому +4

      @@isaygg.butitwasntgg.itwasb4958 that makes it even better

    • @marcelkonnerth2634
      @marcelkonnerth2634 4 роки тому +1

      He' actually liking some content of Dubtep producers on different platforms.

    • @Keidon1337
      @Keidon1337 4 роки тому +1

      Waking up and walking to school with bangarang playing in the background.

    • @hansenchrisw
      @hansenchrisw 4 роки тому +1

      XÆA-12 is a dubstep

  • @Mokey
    @Mokey 4 роки тому +55

    What i learned today: Music is just waves that go like wo-wo-wowowowo up and down. up and down. But just very many of them together. Nice :)

    • @goos_bumps
      @goos_bumps 4 роки тому +10

      The tighter the waves, the higher the pitch is

    • @josephstowell1995
      @josephstowell1995 4 роки тому +8

      everything you have ever heard is just a collection of sine waves

    • @kneecapp9186
      @kneecapp9186 4 роки тому +4

      Fancy seeing a grapple god here, just wanted to say thanks for your Apex series of videos. They made me passionate about the movement system in that game

    • @Chris-cf2kp
      @Chris-cf2kp 4 роки тому +2

      Ha ha sound go brrr

  • @mrpay4444AYypIgEDLbwfZm4kjaQk
    @mrpay4444AYypIgEDLbwfZm4kjaQk 4 роки тому +270

    In the future there will be no human voice actors.

    • @exosproudmamabear558
      @exosproudmamabear558 4 роки тому +7

      Not soon enough

    • @Gabriel-mw5ro
      @Gabriel-mw5ro 4 роки тому +41

      That'd be cool. Imagine developing your own game or animation, designing each character's voice and not have to record anything

    • @gewoonpatatmayonais
      @gewoonpatatmayonais 4 роки тому +11

      @@Gabriel-mw5ro That is already possible actually

    • @socially_mute7086
      @socially_mute7086 4 роки тому +15

      go away you postshumanist technocrat
      this post was made by anprim tribe

    • @fokoji666
      @fokoji666 4 роки тому +4

      That's scary

  • @djwillcaine
    @djwillcaine 4 роки тому +7

    I recently wrote my dissertation on BPM analysis using neural networks and used a lot of similar techniques. Dylan did a surprisingly good job of understanding and explaining everything here. Good job!
    Some improvements you could make:
    - You definitely overfit by training for so long, that was a lot of wasted time. A common technique is to use an early stopping callback to stop training after a certain number of epochs without improvement.
    - In my research I found that longer clips worked better, although that might not be the case for this network it could be worth experimenting with 5 and 10 second clips.
    - There is very little publicly available training data for this stuff and so creating your own could help expand the dataset. I did this by exporting my rekordbox collection to XML format and parsing it with a python script to produce training data.
    - There's some very interesting research that suggests using 1D convolutional layers oriented along the frequency axis could drastically improve a model of this type.

  • @mmmoshi_
    @mmmoshi_ 4 роки тому +131

    when u think that playing drums in DOOM is too much , he gives u this

  • @MOONBOY
    @MOONBOY 4 роки тому +3

    IM SO READY TO HEAR AI MADE DUBSTEP OMG

  • @ronanharris8216
    @ronanharris8216 4 роки тому +67

    EDIT: Thanks to @@UCFpUx-4O2zgsOM0Wp0HRTqw for the help.
    I would guess the problem is not in the model. Considering Spleeter actually does better at processing non-electronic music, it seems that those songs tend to be harder to nail at close to perfect accuracy. Probably because the genre of electronic music itself is full of external influences. Clap samples can be especially hard, since they can have unique characteristics in certain genres, which can make them sound close to snares in other genres.
    Note : I'm just a regular CS student.

    • @braznem
      @braznem 4 роки тому

      english intensifies

    • @ronanharris8216
      @ronanharris8216 4 роки тому +2

      @@braznem Well I'm sorry if I couldn't have write it better. Any suggestions?

    • @braznem
      @braznem 4 роки тому

      @@ronanharris8216 too long, got other things to do D:, sorry.

    • @dreadformer
      @dreadformer 4 роки тому +1

      @@ronanharris8216 I would say it'd be best if it was written like: "I would guess the problem is not in the model. Considering Spleeter actually does better at processing non-electronic music, it seems that those songs tend to be harder to nail at close to perfect accuracy. Probably because the genre of electronic music itself is full of external influences. Clap samples can be especially hard, since they can have unique characteristics in certain genres, which can make them sound close to snares in other genres.
      Note : I'm just a regular CS student." i guess

  • @StarOnCheek
    @StarOnCheek 4 роки тому +44

    That ProQ visualizer really justifies the video being 60fps

  • @MrBearProduction
    @MrBearProduction 4 роки тому +16

    The interesting thing about MFCC and cepstrums, quefrencies and all of that mixed up letters jazz, is that the transformation that MFCC makes with the Mel filter-banks brings the sound in this specific domain that isn't frequency nor time domain. And that's why they decided to call them funky names. Scientists are fun aren't they? xD

  • @bleach.princess
    @bleach.princess 4 роки тому +1

    YESS I love how in depth you go with your videos. They're well edited and fun to watch but I'm also actually learning a ton of new shit at the same time.

  • @TheWebgecko
    @TheWebgecko 4 роки тому

    Thank you so much for this!! :) great video. Even with a basic understanding of generic NN’s it’s intimidating to try to apply it to music imo

  • @One_Deeper_
    @One_Deeper_ 4 роки тому +1

    This is dope, super interesting stuff. Not to many people are making content like this and I dig it

  • @ecabanero5776
    @ecabanero5776 4 роки тому +122

    The problem sometimes in neural networks it’s the overfit. So be careful with the data and validation sets.
    1 - Overfitting happens because sometimes the train value and the test value match too much.
    On Neural Networks if you select a high epoch values could give you a bad prediction. Maybe not on the first ones but on the last ones it will be a mess.
    It all depends on the dataset and how much you train your network.
    REMEMBER. MORE COMPLEX DOESN’T MEAN MORE EFFICIENCY
    2 - I am going to kill Dylan. Wtf you explain on the cepstrum part bro. It’s not even close. The cepstrum gives you on time the repetition period of the signal. And the first part always represents the Harmonic response.
    It’s on milliseconds because it’s the absolute inverse of Fourier Transform =Time
    (Fourier Transform = Frequency)
    PD: Thanks for the intention of the video. Telecommunications Engineers appreciate the effort.
    If you want to applied more projects like this visit AudiasLab of University Autónoma Madrid, EPS.

    • @josephstowell1995
      @josephstowell1995 4 роки тому +3

      regularization!

    • @shapshooter7769
      @shapshooter7769 3 роки тому +1

      I thought the inverse of a FFT is an IFFT, and that cepstrum is an FFT of a spectrum.

    • @ecabanero5776
      @ecabanero5776 3 роки тому +1

      dagambler999 as you say the inverse of FFT is IFFT and that gives you the signal on time.
      Signal on time
      FFT(signal) = signal on frequency
      IFFT(signal on freq)= signal on time.
      But cepstrum is something apart is a representation of periodicity and the harmonics on the first part.
      Hope you understand better :)

  • @comedyclub333
    @comedyclub333 4 роки тому +7

    The quefrency is in ms because a Fourier transformation transforms time (s) to frequency (1/s = Hz). So another Fourier transformation (to the cepstrum) would transform the new "time" domain (which is in 1/s) into the quefrency domain (1/[1/s] = s).

    • @DylanTallchief
      @DylanTallchief  4 роки тому

      Ah yes, I think I understand. Thanks!

    • @tune_m
      @tune_m 4 роки тому

      I think this might be wrong. Correct me if you find a better explanation. Remember that the inverse Fourier transform (F^-1) is not equal to the Fourier transform (F) itself. Therefore, the second applied Fourier transform does not transform the frequency signal back to the original time domain.
      Quoting Wikipedia: "The independent variable of a cepstral graph is called the quefrency. The quefrency is a measure of time, though not in the sense of a signal in the time domain."

    • @comedyclub333
      @comedyclub333 4 роки тому +1

      @@tune_m that's what I said. The inverse Fourier transformtion is of the same type but not exactly the same as the Fourier transformation. It is the time domain as in "same unit of time" not as in "same meaning like time".

  • @SuperPolentaman
    @SuperPolentaman 4 роки тому

    Your visual way of explaining the cepstrum is actually amazing. I finally understood it intuitively after two Comp Sci university courses where I didn't get it.

  • @toyuyn
    @toyuyn 4 роки тому +16

    It is not uncommon for validation accuracy to go higher than training accuracy, because the training process is random.
    The model might have randomly found a configuration that happens to perform better on the validation set.
    A sub-100% training accuracy is not necessarily a bad thing, maybe your data is not perfectly clean (like the skrillex example you mention at 12:43).
    Also, getting the training accuracy higher is not necessarily a good thing because you might be forcing the model to overfit (which you also mention at 13:28), and you might find the validation accuracy gets even worse.

  • @octopath480
    @octopath480 4 роки тому +12

    AI: Welcome back to a new video *sips tea*

  • @cakes43
    @cakes43 4 роки тому +25

    How did you train a 2-d linear classifier in excel ?? YOu're like an excel wizard-man.

    • @josephstowell1995
      @josephstowell1995 4 роки тому

      im doing this now lol

    • @oligarchymusic
      @oligarchymusic 4 роки тому +1

      Well, Excel is scriptable in VBA and VBA is turing-complete, so, technically, this is possible. However, it will probably be too slow for anything with a large feature space. However, simpler algorithms like K-nearest-neighbour or the C4.5 decision tree algorithm would probably work.

  • @OrangeC7
    @OrangeC7 4 роки тому +4

    Now I want to hear what Hardstyle Jazz Trance would sound like

  • @goos_bumps
    @goos_bumps 4 роки тому +2

    You know, this sparked my interest into A.I research or at least the basics of it, thank you

  • @daveymwas8408
    @daveymwas8408 4 роки тому +2

    but seriously this thing was vey educative.I HAD TO LITERALLY PAUSE EVERY 2 SECONDS TO COMPREHEND WHAT Dylan was really talking about and i have to say it was worth it.It's amazing how you can turn hours of learning into minutes.

  • @ademkarakaya834
    @ademkarakaya834 4 роки тому +1

    It’s actually funny how you’ve stumbled upon AI and signal processing. I’ve been messing around on FL since I was 15 and once I started an engineering course in uni I quickly got into signal processing. Towards the end of the course stream they naturally lead onto the introduction of ML. It’s really fun, and I don’t think I would have gotten into it if I hadn’t invested time into music production.

  • @cyberhighdiver
    @cyberhighdiver 4 роки тому

    You are a very entertaining person my dude..thanks for the effort you put into your ideas and videos.

  • @davidmatthew7470
    @davidmatthew7470 4 роки тому +1

    Wow, that is some serious dedication. Perhaps one day you will be able to create a style transfer application, to turn a song into a different genre, or even automatically master a song.

  • @rudolphdandelion6840
    @rudolphdandelion6840 4 роки тому +2

    Can't wait for your AI to produce a dubstep track!

  • @AlexBerish
    @AlexBerish 4 роки тому +2

    My boy Dylan out here tricking people into loving maths

  • @MK-Masters
    @MK-Masters 4 роки тому +5

    Probably the first time I've seen the words "Skrillex Is a Jazz"

  • @dscreations1534
    @dscreations1534 4 роки тому +4

    Me : Hey Dylan you create music, code or AI ?
    Dylan : *YES*

  • @kfshradio
    @kfshradio 4 роки тому +1

    I'm a PhD student studying auditory neuroscience and was not expecting to get a lecture on MFCCs from this channel 😂
    but you did a great job explaining and yes i agree the naming convention on cepstrum/etc is dumb

  • @woofcaptain8212
    @woofcaptain8212 4 роки тому +1

    This would be really useful in the rythm game I've been developing

  • @vincentpollack
    @vincentpollack 4 роки тому

    Really interesting. This video deserves more views. Seems like a lot of work was put into this

  • @jaimerojas6578
    @jaimerojas6578 4 роки тому +2

    I've been hesitant into getting into AI because it seems to hard, but damn it I going in, thanks Dylan

  • @rs10269
    @rs10269 4 роки тому +23

    AI

  • @sosavoa
    @sosavoa 4 роки тому +1

    this is EXACTLY WHAT I WAS LOOKING FOR

  • @user-yk7mp8yp8x
    @user-yk7mp8yp8x 4 роки тому +30

    haha ai go *brrrr*

    • @syndice
      @syndice 4 роки тому +4

      NO! You can't just feed AI 1000 epochs of dubstep!

    • @jasonmiller6181
      @jasonmiller6181 4 роки тому +1

      I laughed harder at this than I should've. xD

  • @IndoorBradster
    @IndoorBradster 3 роки тому +1

    I think a part of the accuracy problem stems from sample diversity, but another factor that would prevent getting higher accuracy would be the fact that music has periods of rest, when sound is absent. If an interval occurs during a period of rest, it would probably mess up analysis.

  • @Norbert.Gardonyi
    @Norbert.Gardonyi 4 роки тому

    This video is what the internet should look like. Funny, personal, but still informative, full of references, well-edited video- and audio-wise, well-explained, and not full of sponsors, VPNs and Raids shadow legends. Thank you Dylan.

  • @akzentprod
    @akzentprod 4 роки тому

    I love this type of stuff, keep it up!

  • @atomic7680
    @atomic7680 4 роки тому +1

    For a sample classifier you could use check out computer vision techniques, such as the OpenL3 model. It directly uses the log-mel spectrogram images and is pretty powerful! This particular network looks at spectrograms of 1s though, so for songs probably use MFCC and other features extracted from the song (the things shazam uses)

    • @atomic7680
      @atomic7680 4 роки тому +1

      Also, training for more epochs does not necessarily help :P better would be to actually collect more data :)

    • @atomic7680
      @atomic7680 4 роки тому +1

      As to why the validation performance doesn't necessarily go higher if training increases: this is something called overfitting. If your network has too many parameters, it learns to fit the training data so well, that having a slightly different input (your validation set) messes it up.
      This is the problem of generalization. You want to prevent overfitting by using techniques as dropout and weight normalization. Also make sure that your datasets have a similar distribution of classes. If the network sees 90% dubstep and 10% hardstyle during training, and you validate it on 10% dubstep and 90% hardstyle, it will for sure not work as well as if both were 50/50. The weights have been tuned more specifically to the dubstep features

  • @SpitfyaUK
    @SpitfyaUK 4 роки тому +1

    This was pretty cool actually. Imagine feeding an AI a bunch of Skrillex tracks and getting it to auto generate random Skrillex sounding dubstep. That's an idea I've had for a while but am too stupid to try :C

  • @TripleTSingt
    @TripleTSingt 4 роки тому +2

    One thing I noticed: Some samples in my sample libraries are kinda wrongly named… there are claps that sound more like snares and vice versa, and some closed hi hats have longer tails than open ones. So, a program training on these samples might get them wrong because of this weird naming and not because the program is bad.

    • @DylanTallchief
      @DylanTallchief  4 роки тому

      Yes I have that exact same thing! Especially with some of the clap/snare vengeance samples. While I could have gone through each sample individually to make a better split, I also didn't use that many samples (about 350 samples for each class). If I had way more samples to train with, it would probably do a lot better too.

  • @Pandahydra
    @Pandahydra 4 роки тому +21

    Dylan is like the Thomas Edison of EDM at this point.
    Edit : I mean a better version.

    • @tortture3519
      @tortture3519 4 роки тому +2

      Stealing a bunch of stuff from others and claiming it's his?

  • @Firequell
    @Firequell 4 роки тому +1

    This was especially interesting as I wrote my Bachelor thesis in computer science on this exact subject; genre classification of music. But we used other models than neural networks. Also we tried a bunch more features than MFCC.

  • @threeMetreJim
    @threeMetreJim 4 роки тому

    Spectrum analysis is the best way to go, how ears work (evolution usually homes in on the simplest method). Can't wait to see you do an automatic music decomposer / re-composer (with instrument type determination?). If you can do it in excel, I'd be gobsmacked. As for AI replacing humans, not any time soon, AI has no soul (yet).

  • @bcoda
    @bcoda 4 роки тому

    You and Sebastian are two of my favorite tubers
    funny that you found each other

  • @Kortrexx
    @Kortrexx 4 роки тому +1

    Alright now we need an AI that can create random professional dubstep.

  • @ayha4115
    @ayha4115 4 роки тому +1

    This guy learned programming for music to get ideas for youtube!

  • @Hamza-xm3ro
    @Hamza-xm3ro 4 роки тому +1

    i think the next step would be to train a model on learning music theroy : scales,modes,chords etc...and see what the results would be.This might generate interesting chord progressions or even come up with it's own scales and modes.That could be the starting point,this can be expended to other tuning systems and so on

  • @cyborganic
    @cyborganic 4 роки тому

    this is so cool. excited to learn more about this

  • @MatthewKirchhof
    @MatthewKirchhof 4 роки тому

    If you haven't looked into them already, convolutional neural networks are often used for continuous or variable sized input data. These networks are made up of kernels (or windows) that slide across your input domain and output some value. Often times, these windows tend to learn patterns within your data (say you ran a CNN over numerical images, one window might have learned to recognize vertical lines and output a large value when it sees one, indicative of the number 1,4,7, etc...). The models typically contain many of these convolutional windows/kernels which learn different things.
    In your case, librosa has processed your wav files into the beautiful graph you show at 6:50. This is essentially a big picture that you can input into your CNN.
    Loved watching your video, you are doing some cool stuff. Feel free to shoot me a message if you'd like to talk a bit more about AI and music! I work at the company behind rave.dj, an AI that tries to mash up any two songs of your choice!

  • @realSethMeyers
    @realSethMeyers 4 роки тому

    Dylan you are a legend. This was highly entertaining.

  • @Revenant3D
    @Revenant3D 4 роки тому +1

    Nice, now Excision has a way to make his concerts even more crazy

  • @linkedwinters
    @linkedwinters 4 роки тому +1

    Damn, I was hoping the ai would generate a dubstep track based on what it had heard.

  • @woofcaptain8212
    @woofcaptain8212 4 роки тому +1

    As a computer science enthusiast and a producer this video is amazing.

  • @fray4life407
    @fray4life407 3 роки тому +1

    What a great video to watch with dyscalculia

  • @frostyfingers9282
    @frostyfingers9282 4 роки тому +1

    I totally thought this was going to be about getting an AI to generate new dubstep tracks based on its learnings and I'm slightly disappointed that it isn't

    • @DylanTallchief
      @DylanTallchief  4 роки тому +1

      Not yet, but this is the first step towards that goal

  • @GarviHere
    @GarviHere 4 роки тому

    I have been procrastinating about this music player that uses machine learning and haven't.even tried yet, and here you are making music and doing this stuff...

  • @StoneCut
    @StoneCut 4 роки тому

    Awesome. I‘m interested in more information about this.

  • @Threadox
    @Threadox 4 роки тому +3

    As a CS student this is awesome

  • @justrick3613
    @justrick3613 4 роки тому

    Man, you did really well for a beginner! There were a few things that you could have done to improve the model. For example, make the convolutions span the whole frequency domain. (Kernel of number of frequencies by 3). Some one at Spotify made an article about it.

  • @d-rockanomaly9243
    @d-rockanomaly9243 4 роки тому +1

    You're totally gonna get an A+ on this homework assignment.

  • @user-hv8xc7wc5b
    @user-hv8xc7wc5b 4 роки тому

    Some recommendations:
    Jukebox from OpenAI, 2020 - AI generates music from a given style & even lyric.
    DDSP from Magenta (Google), 2020 - They trained additive synth to sound like a violin, but you can also do other kinds of stuff such as speech synthesis, dereverberation & reverberation transfer, timbre transfer ... etc.
    FlowSynth from IRCAM, 2019 - Neural network was trained to learn the sound of u-he Diva synth and make macro parameters that are 'perceptually continuous.' You can even download and try it with their M4L devices if you have Diva installed.

  • @MCMACHINE
    @MCMACHINE 4 роки тому +1

    comment for the algorithm, this video is incredible

  • @BernardFrey
    @BernardFrey 4 роки тому

    so you spent a year an a third part of it making an AI to listen dubstep, STONKS

  • @egglion7931
    @egglion7931 4 роки тому +1

    “Hard style, jazz, and trance. The perfect combination.” FINALLY. SOMEONE ELSE UNDERSTANDS.

  • @jackhammer8061
    @jackhammer8061 4 роки тому +1

    3:15 That is so intriguing. It’s basically turned technology is art. ...I wonder if the robot eats mushrooms if the geometric patterns it sees is actually just a reflection of his own neuro-network haha

  • @vegetablescankill
    @vegetablescankill 4 роки тому

    this was a banger video Dylan

  • @HerrKichern
    @HerrKichern 4 роки тому

    increasing network complexity will increase your training accuracy, but it will also lead to memorization, and may not translate to better validation performance. Finding the right network topology is difficult, and we really don't have a better way to go about it than guess and check. If you train with neuroevolution instead of SGD using a technique such as NEAT, you CAN train topology at the same time as weights, which is why NEAT has seen an increase in popularity lately. You might also consider looking at regularization techniques such as dropout, this can improve your results.

  • @Twtching
    @Twtching 4 роки тому +9

    Spotify and other digital platforms will eventually be able to use AI to sort songs into genres. Because they’re staff are ass at it

    • @TUTOSANDROIDtutorialesymas
      @TUTOSANDROIDtutorialesymas 4 роки тому

      They're

    • @jamal_rahman
      @jamal_rahman 4 роки тому +3

      My friend, they already do.
      Spotify, Google, and Apple are the leading researchers into AI for music/audio.
      Top researchers & professors get nabbed by those companies to develop their AI.

    • @Twtching
      @Twtching 4 роки тому

      I forgot to mention I live in the 90s fml where have I been

  • @mrmeowbeats7842
    @mrmeowbeats7842 4 роки тому

    12:32 ah yes, my favorite jazz musician: skrillex

  • @PierZeNation
    @PierZeNation 4 роки тому +1

    I would like to see this taken more in depth to where AI will learn to produce a track or even just a beat or drop

  • @neu3478
    @neu3478 4 роки тому +6

    2:42 you watched a 3blue1brown video and didn't mention it 😔😔
    He's so amazing people should check him out

    • @DylanTallchief
      @DylanTallchief  4 роки тому +3

      true! although his video on neural network didn't really actually help me in the beginning as much as the others.
      but they are beautifully made and good to recap when you kinda understand a bit more

    • @josephstowell1995
      @josephstowell1995 4 роки тому

      @@DylanTallchief I recommend andrew ng's!

  • @garymedia7551
    @garymedia7551 4 роки тому

    This is the most awesomely nerdy thing I've seen all day. I thank you

  • @emrazum
    @emrazum 4 роки тому +2

    I want more about how you actually code stuff like this, a tutorial from you is better than a tutorial from AI phd's

  • @definitelynotmatthw
    @definitelynotmatthw 4 роки тому +1

    Unrelated story, a year ago maybe I was listening to dubstep with a crappy EarPods so the sound leaks out and my mom heard it and was like “are you listening to house music” damn mom, it was Excision

  • @AZaqZaqProduction
    @AZaqZaqProduction 4 роки тому +2

    I was hoping the video would include AI-generated dubstep, kind of like the OpenAI Jukebox project.

  • @TheAkiller101
    @TheAkiller101 3 роки тому

    "Im not a great coder" , creates a DAW in excel , bro your one of the best

  • @merseyviking
    @merseyviking 4 роки тому +1

    Cepstrum: Yo dawg, I heard you liked Fourier transforms. So I did a Fourier transform on your Fourier transform.

  • @ICODE72
    @ICODE72 4 роки тому

    The difference between a clap and a snare is rather subjective as our perception of it is based on our knowledge of how the sounds are (naturally) made. It makes sense as the aI is trained based on only the alien examples given and no context that could easily be fed to the algorithm.

  • @0pvo0
    @0pvo0 3 роки тому +1

    I am not a gifted coder:…..
    Dylan: Makes a whole site generating music patterns

    • @DylanTallchief
      @DylanTallchief  3 роки тому +1

      also dylan: uncomments a comment which breaks the site for 20 minutes

  • @shapshooter7769
    @shapshooter7769 3 роки тому

    Ahh.. gradient descent and back propagation. Had to do those manually in MATLAB.

  • @ChrisFarfan
    @ChrisFarfan 4 роки тому +1

    To increase accuracy you could increase the length of each sample, say to 4/8 seconds, you'd probably need more training material though.