Realtime AI Voice Changer Using RVC (Retrieval-based Voice Conversion w./ w-okada)

Поділитися
Вставка

КОМЕНТАРІ • 2 тис.

  • @Jarods_Journey
    @Jarods_Journey  Рік тому +352

    README!! Not downloading? 👇The VC is continually being updated so the version showed off in the video is no longer available. If you run into errors, you may have to try out the other versions to see if that resolves issues.
    Latest version as of this update: 1.5.3.8a
    I expect that I'll have to make a follow-up video.
    If the google drive link is down, use hugging face website. Look at the names there and determine what to download based on:
    cuda - Nvidia
    directml - AMD
    mac - MAC

    • @nokinirus
      @nokinirus Рік тому +10

      Ah yes. The Mac uses mac, with a side of mac.

    • @giggy_rook
      @giggy_rook Рік тому +10

      i downloaded it but it wont show my gpu it only shows my cpu and yes i use amd

    • @nokinirus
      @nokinirus Рік тому

      @@giggy_rook bro check if you dl-ed the cpu version. Because the first one says it'll work with your cpu, the mid one is cuda, and the last one's amd.
      Edit: if you're downloading from the archive...

    • @kaoruofficialtv
      @kaoruofficialtv Рік тому +3

      @@nokinirus i also have amd and it doesnt show the gpu only cpu. and yes i downloaded the directml one. I even tried downloading other versions and different types

    • @giggy_rook
      @giggy_rook Рік тому

      @@nokinirus still doesnt work

  • @NoztrozeR
    @NoztrozeR Рік тому +4363

    I have a creeping suspicion that the vtuber market is going to get a whole lot weirder with this tech improving.

    • @Everfalling
      @Everfalling Рік тому +441

      those voice changer jokes are gonna be legit now

    • @Reydriel
      @Reydriel Рік тому +141

      TBF there's a popular one that literally all AI atm, though her creator has arguably become even more popular lmao, it's kinda nuts what he's been able to make

    • @harrytsang1501
      @harrytsang1501 Рік тому +17

      Have always been

    • @CyberMonkey03
      @CyberMonkey03 Рік тому +42

      @@Reydriel Neuro

    • @VallenChaosValiant
      @VallenChaosValiant Рік тому +37

      In reality there are no shortage of women willing to do the job. Although many of them are self concious and still use a voice changer just to make themselves sound cuter/younger/whatever they felt inadequate about. Don't forget that 50% of the world are women and plenty find vtubing appealing.
      In real life the highest Youtubing earners are MEN, so if anything you lose money by voice changing into a female.

  • @Phoon1G
    @Phoon1G Рік тому +1344

    Under Audio options i found that choosing Server instead of Client, makes you sound a lot more realistic and takes away most of the robotic features

    • @zak_facts2676
      @zak_facts2676 Рік тому +5

      where do i find that
      ?

    • @noobio5510
      @noobio5510 Рік тому +27

      @@zak_facts2676 pretty sure its right under the S.Thresh, there a option next to AUDIO: for client or server

    • @Mrstan45
      @Mrstan45 Рік тому +3

      yes but it messed with my audio setup though

    • @aspen-
      @aspen- Рік тому +4

      @sorasong6780 under the download there is a "huggingface" button if you click that it works :)

    • @elpis8784
      @elpis8784 Рік тому

      What should I put the audio output to? There's MME but idk what that does

  • @amruzaky4939
    @amruzaky4939 Рік тому +720

    I'm not ready for fluently English speaking Marine-senchou.

  • @sitearm
    @sitearm Рік тому +38

    I very much like hearing the actual before and after effects and the detailed walkthrough. Thank you for posting!

  • @ElemXCR
    @ElemXCR Рік тому +481

    Some japanese VAs agencies are hammering down on their VAs' AI voices.
    It's gonna be interesting to see where this will go. While I hope I'd be able to use some of my favorite JP VAs voices and have it for english content, I'm betting there will be much more worse abusers and signing it off as their own.

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +64

      Rules and regulation are gonna be needed for sure, but there's no legal precedent for this yet so it's all up in the air for how it's gonna be dealt with in the courts. As with all things, there will be bad actors and despite me looking, I am yet to find any good resources or detection tools that can keep up with these advances.

    • @SamiTheAnxiousBean
      @SamiTheAnxiousBean Рік тому +21

      and I mean...somewhat rightfully
      If someone isn't comfortable with their voice being replicated, you shouldn't do so/leave a replication up

    • @bendover9620
      @bendover9620 Рік тому +12

      Always remember when dealjng with strict japanese laws:
      They can't touch you if you're outside their country or if your country has no reciprocation laws to back them up.
      If you get copystriked, just make another account ad infinitum, assuming you're anonymous.
      When all hope is lost, the worst-case scenario is to upload on BiliBili. Let's just say China and Japan aren't really on speaking terms.

    • @ridervtb
      @ridervtb Рік тому +4

      @@Jarods_Journey how do you get more ai models? i cant seem to find other voices to download

    • @TheMastertbc
      @TheMastertbc Рік тому +2

      imagine buying license for gura voice

  • @VIPPyroTM
    @VIPPyroTM Рік тому +789

    Holy shit, the amount of power of turning into a Hololive Girl is getting closer!
    Also, that Marine voice when she’s speaking English fluently is just so damn uncanny to imagine that there’s a timeline where Marine learned English SUPER WELL.
    I hope there’s a program that compiles all the complicated setup into an easier way of setting up since I’m tech savvy 😂

    • @DunceInAwhile
      @DunceInAwhile Рік тому

      True. Now people are going to view Hololive creators a little differently... Especially since most of the Hololive girls go to great lengths to hide their true identity. Makes you wonder...

    • @DoffDoffinson
      @DoffDoffinson Рік тому

      @user-xp9kq7xb6p You'll do it to me >:)

    • @niahonjou1933
      @niahonjou1933 Рік тому

      help,i have too much errors

  • @sabereaseera1384
    @sabereaseera1384 Рік тому +15

    Recommended to me randomly. You are super underrated.

  • @0AThijs
    @0AThijs Рік тому +61

    2:25 for anyone wondering why it's stuck at
    Booting PHASE :__main__
    Voice Changerを起動しています。
    please wait, it may take a few minutes.

    • @shat01j
      @shat01j Рік тому

      Great help Thanks!

    • @shinigamiwolfen
      @shinigamiwolfen Рік тому +1

      日本語上手

    • @Ryyza7
      @Ryyza7 Рік тому

      @@shinigamiwolfen hahah nani kore got jozued

    • @ミカ-m9p
      @ミカ-m9p Рік тому

      thanks literally after reading this it finally did it haha

    • @frits4061
      @frits4061 Рік тому

      Thnx for the tip I was searching for!

  • @chazington2
    @chazington2 Рік тому +6

    i did not know a tutorial video can be this pleasent and nice, i know this is a weird compliment but you're really good at making tutorial videos

  • @rolfathan
    @rolfathan Рік тому +22

    This is going to be so great for online role playing games. Being able to make a custom voice that you can use that matches your avatar will really increase immersion.

  • @cartoonhyperfixated
    @cartoonhyperfixated Рік тому +95

    This is insane 😭 crazy how people can replicate voice’s by using AI in real time

    • @lonelybookworm
      @lonelybookworm Рік тому

      ​@@freedomofwordbruh 2 seconds is real-time for most purposes

  • @PaxPolaris-kt1vr
    @PaxPolaris-kt1vr Рік тому +36

    This was incredibly helpful! I seen your video on TikTok and came here right away. Thank you so much for making this video; I couldn't of figured out that program without it!

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +4

      Appreciate it. I'm surprised at how much traction it gained haha.

  • @Skiedragon
    @Skiedragon Рік тому +112

    If anyone is experiencing very choppy sound, like your voice cutting off after every 'chunk', you can try changing the AUDIO to "server" instead of "client". Eliminated all choppiness for me.

    • @ge2719
      @ge2719 Рік тому +4

      Doesn't that mean your using server somewhere, and likely giving them all your audio data you're creating?

    • @Tryharding69
      @Tryharding69 Рік тому +1

      @@boombattlefields9123 ☠ bro... let's go

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 Рік тому +2

      Just don't worry about it. If you've ever mysteriously had an advert for a product you just mentioned, outloud near an active device, pop up in your feed, you're already having everything you say parsed by some sort of analytical algorithm. This, while an additional outgoing stream of data from you, is at least one that you are aware of and have some control over.
      The only thing I could really do to guarantee my phone isn't listening to me type, even this sentence right now, is to put it in the microwave to block any outgoing signals. At least all you have to do is shut off the program and they aren't able to parse your data anymore.

    • @denjiaisaka2186
      @denjiaisaka2186 Рік тому

      can you help i cant hear my voice in program

    • @Kopie0830
      @Kopie0830 Рік тому

      Tried this and there seems to be no change in the voice even after changing the tunes hmm...

  • @DjTonioRoffo
    @DjTonioRoffo Рік тому +3

    Your Chopping is because of threshold set all the way up. It does a cut off of the input under a certain volume. Make it a lot lower (almost completely at the other side actually)

  • @Fahad-21
    @Fahad-21 Рік тому +243

    It works but latency is pretty high. Lowering chunks improves that but you lose a lot of the content of what you are speaking. One thing to note is to make it most natural sounding, always tune it to a number that is closest to the voices natural sound. Like around 22 for that first model. Also any idea how to turn off real time playback? It's easier to use the record and then playback for any projects.

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +43

      If you don't need the realtime functionality of it, you might be better off recording audio and then converting them in the RVC interface. You could always increase chunk size and there is a record fucntion on the client.

    • @_Chessa_
      @_Chessa_ Рік тому +3

      @@Jarods_Journey this is very helpful knowledge thanks for this.
      And thanks for asking this question also.

  • @unedited12
    @unedited12 Рік тому +4

    Your stuff sounds SO much cleaner than mine, and I even try to use a very clear voice

  • @piplupsuper0
    @piplupsuper0 Рік тому +31

    Jarod thanks for these videos!
    you've really helped me out a lot appreciate your content man it's fun keeping up with the new stuff you showcase!

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +2

      Appreciate it man! It's all wild and crazy tech and it's an adventure everyday checking these things out!

  • @infalogger9697
    @infalogger9697 Рік тому +8

    the reason it says smartscreen protected you is because the dev hasent signed the app with microsoft, but thats because doing that costs 300 a year

  • @RuTo94
    @RuTo94 Рік тому +38

    It’s crazy to believe that there’s actually people that design voices for these Vtubers to design to there preference on how they want it to sound. More power to them.

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 Рік тому +13

      It will be used for this purpose, yes. However, it actually exists so that Horny men can privately moan at themselves in Waifu-speak.

    • @SirGlazer
      @SirGlazer Рік тому +9

      @@tripleheadedmonkey420bro why did you put this idea in their head

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 Рік тому +8

      @@SirGlazer "Their head" he says while desperately trying to hold back the tears as his Tsundere anime waifu life begins anonymously.

    • @SirGlazer
      @SirGlazer Рік тому +4

      @@tripleheadedmonkey420 😭

  • @hellfrozen3678
    @hellfrozen3678 Рік тому +126

    I swear github is like the holly grail,I just learned about it recently but now I realise that every kind of software can be obtained from there and for free

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +24

      Yuppppp, hometown of lots of open source and many, many awesome things on there.

    • @tripleheadedmonkey420
      @tripleheadedmonkey420 Рік тому +3

      Is it weird I've seen this exact comment, word for word, pop up on almost every video related to AI in the last few weeks? xD

  • @celestraic
    @celestraic Рік тому +8

    Tsukuyomi-chan's project & her creator are so inspiring! She is a free voice project across a whole number of engines, mostly any free Japanese speech & singing synthesis programs. I definitely recommend that people check out some of her other resources & samples because she is really a treasure of a voice!

  • @Chrispyy__
    @Chrispyy__ Рік тому +18

    For those of you watching this and you cant see your GPU listed under the GPU tab this is what you do. Where the Audio section is where it says "Client or server" click on server, go back to the GPU tab to make sure your GPU shows in the drop down list, and then you can click back to client or leave it on server. It worked for me.

    • @jamesduke151
      @jamesduke151 Рік тому

      what gpu do u have? AMD or Nvidia

    • @Chrispyy__
      @Chrispyy__ Рік тому +1

      @@jamesduke151 AMD Ryzen 5 3600

    • @jamesduke151
      @jamesduke151 Рік тому

      @@Chrispyy__ does a drop down menu for the GPU appear like in the video for you? On mine there is a 0 1 2 3 instead

    • @Chrispyy__
      @Chrispyy__ Рік тому

      @@jamesduke151 mine just shows my GPU name. I don’t have any numbers

    • @jamesduke151
      @jamesduke151 Рік тому +1

      @@Chrispyy__ ok thanks. Are you using the latest version?

  • @gallanomarkandrea.1787
    @gallanomarkandrea.1787 Рік тому +1

    Sloppy Walrus your a MENACE to society for setting this up for your video XD

  • @rommix0
    @rommix0 Рік тому +8

    I've started using RVC for some of my videos. I was able to change Clint's voice (Clint from LGR) to Duke Nukem's voice for a Duke Nukem review he did some years ago.

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +1

      Haha that's awesome. RVC is quite good so I can see it being used in a lot of places.

    • @rommix0
      @rommix0 Рік тому

      @@Jarods_Journey Definitely. Compared to SVC, it's the best in regards to replicating consonants with the least amount of smearing.

  • @ResmondSam
    @ResmondSam Рік тому +51

    Just wondering, are there any resources online where people can post their own trained voice weights? It'd be convenient as you won't have to keep training your own voice for the voice changer in case somebody else already happened to do so.

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +13

      Someone let me know of one called AIhub discord group

    • @SynFuZe
      @SynFuZe Рік тому +1

      @@Jarods_Journey is there a quick invite link anywhere? I can't seem to find the group anywhere

    • @literailly
      @literailly Рік тому

      Anything on huggingface?

  • @realjgerard
    @realjgerard Рік тому +186

    As a highly trained vocalist, I have been waiting for this to be a reality so that I can create cover songs that one could only dream to hear, like Freddy Mercury, Curt Cobain, and Steve Perry singing on the same ballad!! If anybody seeing this has the capability to train voices and would like to collab on a project, let me know! I’ve yet to figure out the training but I have an entire professional quality studio set up and I’m ready to get to sangin!! Let’s GOOO!!!! 🚀🚀🚀🚀

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +22

      Funny you comment this because a video I'm gonna be releasing is talking about the potential to turn my untrained voice into something that is bearable... simply by using a trained model xD. Another use case is say you throw a bunch of filters onto a voice and don't want to do post-processing ever again. Well, if you just get enough voice samples... you could essentially just "sing" and then BOOM, it's all edited. Still a little bit of issues ofc, but.......... it's super exciting lol.

    • @kylespevak6781
      @kylespevak6781 Рік тому +11

      People have already been doing similar with rap. It's definitely cool!

    • @GNR_Fan
      @GNR_Fan Рік тому

      @@Jarods_Journeycount me in… I am chasing the real time to help how song like AXL ROSE… how can I help to make real time effect a reality?

    • @neek01
      @neek01 Рік тому +1

      Personally for me i’d use it for music production, so so much easier to draft a song when you can hear fitting voice with it for an actual artist to sing later. I usually sing a bit myself but having a fairly low male voice, i can never do a female voice

    • @krakentren7988
      @krakentren7988 Рік тому

      Hey, i am a music producer, where can I contact you? This is my first private account

  • @MartHommes
    @MartHommes Рік тому +17

    I went along with this tutorial and everything went smoothly until i opened the program. Above I only have the "clear settings" button when there should also be "reload" and "select vc". My screen looks like 3:07 without those buttons and without the voices to choose from. The "edit" section for the voices is completely empty for me and I'm now stuck and don't know what to do since I'm not too advanced when it comes to computers. Does anyone know how I could fix this?
    EDIT: Nevermind I fixed it! If anyone else ran into this it's easy to fix. Under "NOISE" you have the "F0 DET." thing. It's on "dio" by default and when you switch it to one of the other modes the different voice models will appear.

  • @neo7538
    @neo7538 Рік тому +1

    hearing Hoshio Marine speaking fluent english is something I did not think my brain could comprehend, holy shyet

  • @akiodemon
    @akiodemon Рік тому +44

    The catfishing is gonna be wild..
    Anyways, thank you for uploading this video for others like me to see, It is gonna be cool to try out.

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +16

      Immediately one of the first things I thought about lol, it's gonna get wild. But also, the more you're in the know, the less likely you're to fall for any types of these things as well.

    • @SDT493
      @SDT493 Рік тому +1

      LOL ME

    • @Rinno-sempai
      @Rinno-sempai Рік тому +2

      so more males are gonna be applying to be part of a middle range agency (that cannotnmake too much background check( using these filters lol
      Catfishing and also contract breaking (one female can work at 2 or 3 agencies without her being voice recognized lmao)

    • @MaxKrovenOfficial
      @MaxKrovenOfficial Рік тому +2

      This is gold for the Vtubing community, actually.

    • @dra6o0n
      @dra6o0n Рік тому +2

      @@MaxKrovenOfficial It's also bad because agencies and companies wants to see you literally in person in order to setup any sort of contracts or deals, but it also opens up to scams and such because you can impersonate other people very easily...
      For instance it might hide the indian scammer's bad accent and fool a lot more people who are usually aware of these people and their bad voices.

  • @nihilvt
    @nihilvt Рік тому +8

    I thought it would be REALLY good, but I didn't realize it takes more resources than Chrome and Photoshop. As soon as anything else needed some GPU, it started stuttering and became unusable. I hope it gets good enough to not need more than 12GB of VRAM.

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +1

      You might be able to offload it to run on CPU instead of GPU, but yeah, most of these AI projects are pretty compute hungry.

    • @forest1605
      @forest1605 Рік тому

      @@Jarods_Journey how

  • @RandomGuy0987
    @RandomGuy0987 Рік тому +3

    WOAHHH 6:25 that's totally Marine's voice speaking fluent English. Crazy.

  • @nkozifraser2331
    @nkozifraser2331 Рік тому

    whoa! good to see you finally get the views you deserve brutha!!

  • @gamecreator7214
    @gamecreator7214 3 місяці тому +3

    If you don't have the voice actor icons, you downloaded a past version or server ( I am incompetent, don't ask me). You need to download a client version and it is at the same page in the start and currently will direct you to download it from hugging face. It is 2.+ version. It helps if you choose english on the git page... Took me a day to figure it out, never again.

  • @AlvinTheLAW
    @AlvinTheLAW Рік тому +18

    That was so weird hearing Senchou speaking native-like english

  • @espae_
    @espae_ Рік тому +6

    can you make a tutorial or is there already one of how to make your own model for real time talking? i know there's ones for singing but if I want a better talking model how much data should I use? podcasts maybe?

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +2

      I do have to link it here: ua-cam.com/play/PLknlHTKYxuNshtQQQ0uyfulwfWYRA6TGn.html
      The same models used to train in RVC can be either singing or talking models, just depends on what audio data you curate and train it with. I recommend start with 10 minutes of super, high quality data that is clear and then increase it if the model isn't good enough.

    • @Alice_Fumo
      @Alice_Fumo Рік тому +1

      It's a bit of a crapshoot. I've somehow had amazing results with something like 2 minutes of ludicrously high quality audio data and not quite as good results with several hours of also very high quality data.
      It seems that there are a few types of voice which just happen to work better.
      Whatever you do, make sure to only use data which is as good as you can get.

  • @YurgenGrimwood
    @YurgenGrimwood Рік тому +6

    I give it 5-10 years and we can just prompt a website to generate media to consume. At least that means I can finally get a second season for all those shows that didn't get one...

  • @itsonlyjurko4080
    @itsonlyjurko4080 Рік тому +2

    2:24 when i open the bat file, for some reason the download you mentioned isnt starting, is there any way to fix?

    • @mtnocap7114
      @mtnocap7114 Рік тому

      This is only a problem with the new file, download the old version and everything will work

  • @RubySapior
    @RubySapior Рік тому +3

    Both Crepe and Harvest seem to be both cpu dependent.
    CPU runs at like 100% Ryzen 7 5700G
    While gpu is at like 38% gtx 1080 ti

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +1

      It's an odd thing, but CPU draw seems to go up when using it selected on my 2070 super I've noticed. Dunno why, but it doesn't show GPU usage. Might be something to raise to the author eventually though.

  • @KebabTM
    @KebabTM Рік тому +18

    For the index option, it will improve your quality. It wanted you to choose the file starting with added_IVF5870 rather than the npy file.

  • @YonasanErihhi
    @YonasanErihhi Рік тому

    Such a great video, Thank You very much bro!

  • @KyotosEnd
    @KyotosEnd 4 місяці тому +1

    why dont the characters pop up for me

  • @Reydriel
    @Reydriel Рік тому +3

    Still has a very distinguishable "robotic quality" to it, but that will probably improve

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +2

      This is the worse the technology will ever be... so yeah, a bit spooky.

  • @greenish16
    @greenish16 Рік тому +4

    Cool video!! Try to make more often longer videos, more fun and exciting! ❤️

  • @jackyisking
    @jackyisking Рік тому +54

    For the song covering maybe, but this seems crossing the line into creepy. 😂

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +9

      It was much, much better than I thought 😅, but mind-blowing tech nonetheless.

    • @Tilt_TM
      @Tilt_TM Рік тому +4

      This would be hilarious for messing with people in VOIP games like Battlebit Remastered

  • @Smith0j
    @Smith0j Рік тому +1

    I've done everything but when I get 2:22 here the black menu doesn't show up

  • @Danzhu
    @Danzhu Рік тому +1

    Until now can't use AMD GPU, I've followed the method * 2 still can't, it still takes the source from the processor, not from video graphics 😢

  • @jaymosupreme
    @jaymosupreme 9 місяців тому +2

    4:27 Sounds like an old lady who just finished giving a toothless deepthroat gum job to a BBC.

    • @1mortar1
      @1mortar1 8 місяців тому +2

      thats a little specific

    • @nnoossiirr
      @nnoossiirr 6 місяців тому +1

      what the fuck

  • @Adorablybadmemes
    @Adorablybadmemes Рік тому +3

    For some reason, whenever I try to use any of the voices there's a lot of background noise/static, that seems to be coming from nowhere.

    • @ItsMeCharkey
      @ItsMeCharkey Рік тому +1

      Yeah I either hear nothing or just static for me as well

    • @CertifiedAsher
      @CertifiedAsher Рік тому

      Dont use cpu, be sure to have good gpu : (

    • @Adorablybadmemes
      @Adorablybadmemes Рік тому

      @@CertifiedAsher I'm using the GPU version, and my GPU should be plenty to process it at lower bitrates at least, but it always ends up sounding staticy.

    • @sxteya
      @sxteya 11 місяців тому

      ​@@CertifiedAshercope

  • @qwerty9567
    @qwerty9567 Рік тому +3

    For some reason my client doesn't have the "Select VC" button to select RVC. Does anyone know how to fix this? I can see the deafult models downloaded in the files but they don't appear on the client as RVC isn't selected. I've also realised that it doesnm't seem to be detecting my GPU as the CPU selection is the only on in the list

  • @JDizon849
    @JDizon849 Рік тому +4

    Any ideas on how to get this to output as a virtual microphone? This could be really fun in discord.

    • @Fs3i
      @Fs3i Рік тому +1

      VB Cable / Virtual Audio Cable - should be easy to find with google

    • @ociones
      @ociones Рік тому

      LOL

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +4

      :) ua-cam.com/video/IS_SPQVv5iY/v-deo.html

    • @marufranco5281
      @marufranco5281 Рік тому +1

      maybe set your recording device as Stereo Mixer , i don't know, it might work

  • @quelidle3772
    @quelidle3772 6 місяців тому +2

    having a problem, after starting start_http for the first time it said it failed because it could not find win.api or something like that. Now when i try to run start_http it opens for 1 second and immediately closes

  • @fl0wera_1
    @fl0wera_1 Рік тому +1

    Nothing of this works, I followed all directions perfectly and my mic isn't working. When i change the input it goes to some "No error message" and "Intialize"

    • @redzeroo6068
      @redzeroo6068 Рік тому

      You're not alone. I don't know if it was the update I did because it was working fine this morning. Now gpu and server giving me a "ERR_CONNECTION_REFUSED" error.

  • @Cheqipeqi
    @Cheqipeqi Рік тому +4

    Where do I find the voices like Botan and Marine? Would love to get em! (aswell as ur settings with em)

    • @mecchamina
      @mecchamina Рік тому +4

      Exactly what I was about to ask!

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +1

      Appreciate it guys, but I can't distribute the models unfortunately! However, I can share the knowledge required to train the models and I have those videos on my channel. I'm working to get it all a bit more organized, but you'll have to gather audio data on your own (thought there are plenty of tutorials on how to get audio data out there).

    • @Boredness90
      @Boredness90 Рік тому +1

      @@Jarods_Journey if you cant distribute it why even make the video at all or even showcase it LMFAO

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +2

      @@Boredness90 It's educational content and falls under fair use. Distribution does not, falls under more murky waters.

  • @cambeckett
    @cambeckett Рік тому +3

    this is super cool!! is there a way to use this as an input for a discord call or something?

  • @graysonnguyennzz2903
    @graysonnguyennzz2903 Рік тому +3

    Hi Jarods, I have tried to download this voice changer multiple time. But everytime i click at the drive (normal) it is not working. It say that too many people downloading this file which lead to failure when downloading. Please help me of how to download this if this way is not working. Thank you,

  • @StarAllKungfu
    @StarAllKungfu Рік тому +1

    This would be awesome for online TTRPG's. As a deep voiced male, the best I can do is an intimidating Hag. I'd like to get some other female voices.

  • @pepadrs
    @pepadrs Рік тому

    love how it upgraded and now you can download it from huging face

  • @nyanbrox5418
    @nyanbrox5418 Рік тому +6

    one interesting thing is this could theoretically be combined with a translator first, though that would take a whole new, probably larger model,
    As hardware and software improves, this is just the beginning!

  • @kenstar222
    @kenstar222 Рік тому +4

    This is truly an amazing find and piece of software, I would be very interested in messing around with it but unfortunately I have an AMD build and I cant find a way to use my 5700XT gpu to process the sounds, and it doesnt seem to be fairing well with my Ryzen 7 2700X cpu :( Any potential help would be greatly appreciated!

    • @GondoMan21
      @GondoMan21 Рік тому +4

      i have the exact same build let me know if you find anything ahaha

    • @kenstar222
      @kenstar222 Рік тому

      @@GondoMan21 will do, likewise -so far no luck but I will do some checking each day and come back with anything I learn

    • @Jarods_Journey
      @Jarods_Journey  Рік тому

      Might have to adjust the settings in the client to try and help it out, but it doesn't run too well on CPU unfortunately. Did you download the directml version? That would should support AMD.

    • @Diamartin
      @Diamartin Рік тому +1

      @@Jarods_Journey well, it doesn't

  • @LovelyNyx7
    @LovelyNyx7 Рік тому +2

    Whenever I speak into it. I can hear the voice quite well the only issue is that after I stop speaking a second later it will play a very quiet voice of it back to me. It only does it with one voice tho so I'm assuming it's just something to do with that voice.

  • @crusader_gaming8273
    @crusader_gaming8273 8 місяців тому +7

    Discord nitro here I come

    • @Ozzy622
      @Ozzy622 2 місяці тому

      Free money, here i come!

  • @ivaniousivanious6234
    @ivaniousivanious6234 Рік тому +6

    Hey guys, I wonder, can you just use an audio input instead of real time voice so that it still mimics your intonations? Or maybe there is some other software that can help adjust intonations?

    • @Margen67
      @Margen67 Рік тому

      Owls need HUGS

    • @IelmaoUfo-lp9bd
      @IelmaoUfo-lp9bd Рік тому

      Use the standard rvc, so Vita inference in python or in a ui like rvc GUI.

    • @Glutzz
      @Glutzz 11 місяців тому

      can you explain that more @@IelmaoUfo-lp9bd

  • @sharryboy88
    @sharryboy88 Рік тому +1

    if i speak and hear it through my headphones i get an echo and the model says what i said many more times... creepy... how can i fix this???????

  • @laya4884
    @laya4884 6 місяців тому +1

    the "crepe" is unvailable for me, it says "cepe (N/A)"

  • @stnhndg
    @stnhndg Рік тому +6

    Yep, Japanese voice models work better with Japanese. I mean... they were trained on Japanese speakers ))
    This is most noticable with consonants, since those are usually treated differently in those programms (e.g. unvoiced consonants don't have pitch). For example, I couldn't make an English model to pronunce 's' from my language, or any variations of it actually (regarding tongue position being more forward/backward). But with vowels it followed my speach pretty close even if those vowels were not typical for English phonetics... with some exceptions on high vowels (pretty decent though). Tough the last problem might be due to relative lack of palatalized consonants in English.

    • @wargreysama
      @wargreysama Рік тому

      I tried speaking Turkish with it and it works just fine, probably due to the fact that Turkish is pretty similar to Japanese when it comes to pronounciations and stuff.

    • @stnhndg
      @stnhndg Рік тому

      @@wargreysama To be honest Turkish is close to Japanese even at grammar (up to some degree) ))
      It works pretty decent with many languages. I was just curious about possible limitations and since I'm a bit into languages I tried to -make poor anime girl suffer- to play with different sounds non-typical for Japanese.
      As for now - my favorite thing is a word 'tractor'. Those voices make it more like 'toractor' which is adorable )

  • @onlydistant
    @onlydistant Рік тому +3

    Is it possible to use this with audio files, in terms of converting the audio file to the respective voice?

    • @trent-po8qm
      @trent-po8qm Рік тому

      yes, in the input, you can select file. for me it errored the 1st time but after reloading, it let me select a file from my computer as the input, and record the converted audio by clicking the record / save button on the bottom to get the converted output

  • @T4EKO
    @T4EKO Рік тому +5

    Figured out the link, but do you have any advice for getting clearer audio? When I speak it chirps and distorts pretty constantly (sort of like when you lowered the chunk down really low, but I get that effect in all chunks)

    • @T4EKO
      @T4EKO Рік тому

      this is both with custom pth files and the provides stock ones

    • @kennethnathantagalog5597
      @kennethnathantagalog5597 Рік тому

      it has to be your gpu

    • @T4EKO
      @T4EKO Рік тому

      @@kennethnathantagalog5597 its set to my GPU (seems to be putting the load on my CPU anyway)

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +2

      Might be hardware specs, I'll be going over this a little bit more in a vid

  • @wandychandrawijaya5867
    @wandychandrawijaya5867 Місяць тому +1

    Hello, i followed all the instructions but mine still doesnt work. Now i want to delete everything, do i just delete the file in file manager? How to uninstall the one downloaded in the cmd program?
    Sorry for my bad english, please let me know @anyone.

  • @naifbashin3099
    @naifbashin3099 Рік тому +1

    FINALLY I can do Jack Sparrow vs Barbosa Standoff in Red Dead Online

  • @nikosurfingYT
    @nikosurfingYT Рік тому +4

    Wow thank you for making this tutorial. I'm wondering can I add more models? If so, where to find it?
    Subscribed!!!

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +2

      Appreciate it! I recommend you train models, but there's a discord group called AI Hub where you can go find some models

    • @nikosurfingYT
      @nikosurfingYT Рік тому +1

      @@Jarods_Journey thanks, super excited about this

  • @oscarreyes4511
    @oscarreyes4511 Рік тому +8

    This is the main reason why I rejected my banks offer to secure my bank account using my voice over the phone! AI is freaking scary in the wrong hands!

    • @Jarods_Journey
      @Jarods_Journey  Рік тому

      I have yet to try it on voice security systems... but that'll be an interesting topic to explore.

    • @oscarreyes4511
      @oscarreyes4511 Рік тому

      @@Jarods_Journey You can use it to change your voice live and make a phonecall. That is how a Chinese investor got tricked and lost a ton of money. He thought he was talking to his business partner and sent him money for a business deal. The crook even facetimed the victim using a deepfake of the business partner face!

  • @9a8szmf79g9
    @9a8szmf79g9 Рік тому +20

    That certainly gives V-Tubers a break. I don't believe there's any way they could do the same thing everyday and not become even a little tired of it. I don't usually watch most of them exclusively, sometimes clips; but for example, I watched from the last 2.5 hours of Mumei's livestream karaoke and she was already tired and bored after the 1st hour when I joined their stream; not that I'd know what she's like but it definitely seemed like it was possible that it was someone else filling in for the night using such a voice changing program.

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +9

      Ah, it's not that good to be at that level yet, you can definitely tell when someone is trying to use an AI voice still but to this point, I think V-tubers are youtubers so even if you're an IRL streamer, it's not like you have someone else sub in for you when you're tired xD

  • @SKYGGEMUSIC
    @SKYGGEMUSIC Рік тому

    It works very well but the latency is very high (more than 2 seconds!) even with collabs pro when using the notebook page. Any idea to help solve this? Thx!

    • @slysxc1263
      @slysxc1263 Рік тому

      Im so lost on how to even get it to work lol

    • @slysxc1263
      @slysxc1263 Рік тому

      it just says in the cmd promt thing "warming up... generating sola buffer." and thats after i have it turned on but it doesnt do anything from there and my mic stay sounding default

    • @Jarods_Journey
      @Jarods_Journey  Рік тому

      Tbh, I'm wasn't sure if the Collab worked but if it does, the delay would come from having to use googles servers to host the client instead of your local device

    • @SKYGGEMUSIC
      @SKYGGEMUSIC Рік тому

      @Jarods_Journey, that's what I expected (I've subscribed to Google Collab Pro) but I still have 2 sec latency. That's frustrating! I would LOVE to use your great tool for my next live perfomance in Taiwan! Can you help? That would be fantastic

  • @bombadt-yt9818
    @bombadt-yt9818 Рік тому +1

    Awesome, I'm sure this will be put to a very very very.. Very good use.

  • @patrickdailgarcia2500
    @patrickdailgarcia2500 Рік тому +4

    Hey Jarods! A fellow Mechatronics Engineering Grad here, you make a lot of quality content and I wish to message you regarding some technicalities of AI voice cloning. And maybe some career advice for degree holders in Mech? haha
    Where can I reach you?

    • @Jarods_Journey
      @Jarods_Journey  Рік тому +1

      Hey Patrick, always great to see a fellow mecha :D! I would say linkedin is going to be the best bet for professional stuff, if not, there, then the next best bet is discord as I'll generally respond on there. I do get a lot of PMs but it shouldn't be a problem if you pmed me from my group.

    • @patrickdailgarcia2500
      @patrickdailgarcia2500 Рік тому

      Gotcha! How do I find you on Linked In btw hehehe

  • @titomo5854
    @titomo5854 8 місяців тому +4

    DUDE RTX 4090 🙂

  • @NoobsPit
    @NoobsPit Рік тому +3

    Everytime I try to launch the voice changer it shows this error and doesn't work Failed to load URL: localhost:18888/ with error: ERR_CONNECTION_REFUSED or when it does load I click on a voice and it says Cannot read properties of null (reading 'enableServerAudio')

  • @csolisr
    @csolisr Рік тому +14

    Welp, this is going to put old voice actors out of a business, but on the other hand it's also going to allow VAs to be easily replaced in case of illness, death or jail sentence (yes that last one has happened)

    • @billionaeris1183
      @billionaeris1183 Рік тому +2

      AI will erase many jobs

    • @forest1605
      @forest1605 Рік тому +5

      i mean real life people can still say a vowel for a long time without fail so

    • @muzz4355
      @muzz4355 Рік тому +1

      they still need the datasets to train the ai with which will need VAs to make so they will still have jobs just making datasets rather than the exact lines

    • @csolisr
      @csolisr Рік тому

      @@muzz4355 Which is why I specified *old* actors are out of a business - they have plenty of recorded voice to train their doppelgangers on. Newer actors are safer in virtue of having less data to train on.

    • @muzz4355
      @muzz4355 Рік тому +1

      @@csolisr its less the VAs that are in danger but rather the specific characters they voice. a VA is always changing tone, accents etc between characters . Old actors will still be wanted to come in for new characters but less likely to return to their existing characters.

  • @Marin_Mewz
    @Marin_Mewz Рік тому

    It's cool, I really admire someone like you❤

  • @OneroomBeatz
    @OneroomBeatz Рік тому +3

    Don't let these Nigerian dating scammers know about this

  • @3eeway
    @3eeway Рік тому +5

    RVC is amazing, but the latency is a huge problem

  • @harurosech.4848
    @harurosech.4848 4 місяці тому

    I just used the voice of this video on my phone to configure my settings. Thanks

  • @ThornOfSociety
    @ThornOfSociety Рік тому +1

    Followed along, did the same settings, clicked start and nothing....

  • @funnys9161
    @funnys9161 Рік тому +1

    The google drive says "Sorry, you can't view or download this file at this time."

  • @kitsune-ame92
    @kitsune-ame92 Рік тому +2

    I have a question, how to you get the trained voice material? I mean where you download the Marine's Voice(But I'm not finding Marine)

  • @Dennis-qh1sr
    @Dennis-qh1sr Рік тому

    For those who have an AMD graphics card, when you set up the whole software and have a model selected, open your Task Manager and test the said model while switching from GPU1, GPU2 etc.. One of those is your graphics card, so it won't have to use your CPU. I have an RX6800XT and still couldn't find my GPU and it was lagging due to it using my CPU. Following the steps above will sort that out, at least it did for me. GPU1 for example had my CPU at 80%. GPU0 on the other hand had my CPU at 20%, which means that GPU0 is actually my RX6800XT.

  • @B4rtek1
    @B4rtek1 Рік тому +1

    i don't hear anything, how to fix pls
    edit: also i have a white screen when im opening the voice changer app help pls

  • @turn-out1
    @turn-out1 Рік тому +2

    I don't know why, but on the input sound test feature I have quite severe noise, even though my room is quiet.
    It was quite annoying and affected the voice changer results. help :')

  • @Faze_booger
    @Faze_booger Рік тому +1

    I’m having a problem when I try to use the voice changer when I talk I hear a static sound and when I get it to work sometimes it has a very long delay

  • @XrafyTheTylo
    @XrafyTheTylo Рік тому

    this is great, i can finally act like im some online characters on video games

  • @aishams3
    @aishams3 Рік тому

    Awesome!
    Is there a colab notebook for "realtime" voice changing? I saw repo of so-vits-svc-fork but this is not work for "realtime" voice changing.

  • @defialy
    @defialy Рік тому

    omg i loved that u used houshou marines voice LMAO

  • @FurasCebulowy
    @FurasCebulowy Рік тому +1

    When I open the Google Drive the file does not show up, there is an error.

  • @XXDanXX
    @XXDanXX 3 місяці тому

    I have got the strangest problem. When I choose my microphone, RVC is hearing everything. If I play a UA-cam video it starts audio changing that. In Discord it changes my friend's voices too. How the heck do I make RVC only hear my microphone?

  • @daniloman1564
    @daniloman1564 Рік тому

    lol, i'm about to call my friend just to say "Uh, Hello! Hello Hello? Eh, i've a menssage for you, Freddy Fazbear's Pizza sends u an Happy Birthday Johnn," with phoneguy model, lol, thats crazy

  • @silverelvien6046
    @silverelvien6046 10 місяців тому +2

    I have a really good computer, so I don't understand why the voice has a 3 second delay :(
    If anyone can help me with this, please give me some advice.

  • @joewinfield244
    @joewinfield244 Рік тому

    You mentioned training your own models but I don't really understand the process like that file type I'm not familiar with. Maybe I need to watch some of your other videos🤷🏾‍♂️

  • @TheLazyDudeTV
    @TheLazyDudeTV Рік тому +1

    why mine echoes, like it repeats the word i say many times

    • @helistia01
      @helistia01 Рік тому

      Same did u find a solution ?

  • @bozydarwyrwicipka3744
    @bozydarwyrwicipka3744 11 місяців тому +1

    I tried it, but voice lags and comes with lags like I had 1990 pc. It speaks with pauses, so...me...thing... ... like... th-this