How to Clone Any Voice With AI | Tortoise-TTS Tutorial

Prompt Engineering

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 17 лис 2024

КОМЕНТАРІ • 217

@engineerprompt Рік тому ⁺¹⁷
If you liked the video, you should check out the video on how to create your own AI Avatars here: ua-cam.com/video/V2efVSXSlqc/v-deo.html
@pacovazquez665 Рік тому
where can i find help to install it on my anaconda pormpt window, its displaying errors
@ROM_2OO3 Рік тому ⁺⁹
It is incredible. I saw some comments where they say that the accent is totally lost, but I tried it and the accent is the same!!!! I thank you very much for this, it is what I was looking for a long time. Its just perfect ❤
@engineerprompt Рік тому
Thank you and glad you found it useful.
@prakash.pathak Рік тому ⁺⁵³
I have seen this problem with many UA-camrs who say AI clones of their voice is not matching with their original voice. The output created above is "exactly" sounding like you. But you can't realize that because we hear our voice in a different way than how others hear it!
@ChristianIce Рік тому ⁺⁴
The timbre is ok, the inflections and the accents got totally lost.
Last year, this technique would have been incredibly good.
Today there are much better options.
AI is evolving at the speed of light.
@MyTobirama Рік тому ⁺²
@@ChristianIce Can you link some of them?
@engineerprompt Рік тому ⁺³
Interesting point, that could actually be the case.
@engineerprompt Рік тому ⁺¹
@Mutual induction Its absolutely free. Watch the video for number of audios :)
@ProVideoScribe Рік тому
@@engineerprompt what if we want to render long paragraph? should we cut it into sentences and render it one by one, or is there any way to render all of it at once?
@postwhateverwhenever Рік тому ⁺¹¹
Thank youuu~~ i'm gonna use this for my favorite video game characters 😎
@MrTalhakamran2006 Рік тому ⁺⁹
Always love your videos...no nonsense ... straight to the topic
@engineerprompt Рік тому
Thank you!
@Nabuuug Рік тому ⁺⁶
It does sound EXACTLY like you, it's crazy. I guess it's the "hearing our own voice is weird" phenomenon that is at play here
@engineerprompt Рік тому
that seems to be the case.
@Spozinbro Рік тому
Not exactly, the replicated voice still has some missing accent.
@robbieweld7928 Рік тому ⁺¹
@@engineerprompt I disagree it sounds as if it gave you an american accent
@MyOtherworldlyLove Рік тому ⁺²
Could you please make a video with fully detailed instructions on how to install Tortoise and get it working? Like, instructions for total beginners? 😅You skipped the part of the process between finding it on Github and adding a new voice, and that's the part that's the biggest mystery for me. I'd love to use voice cloning, but I've never used Python and basically all I know about it is the fact that it exists. So detailed step-by-step instructions for those of us who know nothing about coding would be very appreciated! 😅
@JukeBoxDestroyer Рік тому
the voice sound just like your voice, minus the accent, sounds very good
@user-op3yd5gi9p 11 місяців тому
Good job! It sounds like you, I know we hear own voices differently from audio recordings We hear ourselves thru our own bodies/bones
@georgezlei 11 місяців тому
LOL. The generated voice really sounds like you. I thought it was yourself talking then I noticed you already clicked the button.
@DamienRourke Рік тому ⁺²⁶
Great walk-through, thanks! BTW, the HQ sample did sound like you. It lost a bit of your accent, but overall it did sound like you.
@engineerprompt Рік тому ⁺⁴
Some others have pointed out the same. I guess, I am not used to hearing myself like that :)
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@MrDaydreamer1584 Рік тому ⁺¹
"It lost a bit of your accent, but overall it did sound like you."
It lost all of the accent, not just 'a bit'.
@ozzy1987mr Рік тому ⁺⁶
muchas gracias estaaba buscando algo asi... excelente material y contenido del canal.. apesar que no hablo ingles muy bien y el traductor es malo sus videos se entienden y son muy claros
@engineerprompt Рік тому ⁺³
Thank you, I am glad you found it helpful. Consider subscribing to the channel, have something big planned for Spanish audience in near future 😀
@usagiracha Рік тому ⁺³
"ModuleNotFoundError: No module named 'einops'" any idea how to solve this?
@flowsolo Рік тому ⁺²
Mine made a crazy demon noise in the middle of a sentence.... SWEET. XD
@engineerprompt Рік тому ⁺¹
haha, it can be unpredictable some time. Hope you had fun with it.
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@basictutorial88 Рік тому ⁺²
I love it bro, Thank's for sharing 👍
@starcitycreations Рік тому
This tutorial is awesome! Thank you SO much!
@WhyHelloReader2Me2You-wc2br 10 місяців тому ⁺¹
Can you download that model to run it locally on your machine? Is the resulting file a .pth?
@weluvtech Рік тому ⁺³
Awesome tutorial thanks. I was looking for a good Google Colab of Tortoise-TTS. By the way, I found your generated samples sounded just like you. It was hard to pick when you were playing them.
@cybergigafactory Рік тому ⁺⁷
Great video, thanks.
Is there a limit in how much it can generate at once?
@engineerprompt Рік тому ⁺⁴
If you use it locally, then I think it will be limited by your RAM.
@planetgamecommunity817 Рік тому ⁺²
no ...its limited by the price of GPU spexs..hheheh
@lalaland322 3 місяці тому
great video!
@ajitkumar15 Рік тому ⁺⁴
Can we use this for other languages too or is it limited to English Language, thanks in advance.
@adrianaagresta Рік тому
Same question I was about to ask
@xs6819 Рік тому ⁺¹
Thanks for sharing this. Is there anyway to make it read 1,000 words at a time?
@DailyStoicisme Рік тому ⁺¹
Hey why do I always get a message "maximum stack size exceeded?"
@chesper_miguel 9 місяців тому ⁺¹
There was a Brazilian who translated this video with the tool, LOL
@engineerprompt 9 місяців тому
😀
@mrGapMan1 Рік тому
The clone is spot on, but a bit cleaner english accent.
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@mahmoud_ali_963 Місяць тому
thank you very much .. dose it work with language other than english ???
@spectrecular9721 Рік тому ⁺⁴³
It seems the AI struggles with non-American accents
@fkxfkx Рік тому ⁺²⁰
So do Americans.
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@crypticutopia7228 Рік тому ⁺²
@@fkxfkxas an Australian who went to the US for 5 weeks I can confirm this is very true🤣
@KeytoChannel Рік тому
It's a racist AI
@geese5170 Рік тому ⁺¹
It’s mostly American companies and American voices being used as examples mostly cuz it’s US English 90% of the time
@hoovy1163 Рік тому ⁺¹
here's a few issues that i have:
it doesn't have the same exact voice, it's lower pitched and sounds more older? it also has british accent for some odd reason. and when i try to form long sentences it starts babbling and making weird robot and inhuman noises lol
@Han_Ngoc_Quang 2 місяці тому
your last codes not working, everyhting is " not recognized as an internal or external command,
operable program or batch file."
@senate_shakya_ Рік тому
You sir have my respect!
@engineerprompt Рік тому
Thank you!
@Mox53 9 місяців тому ⁺¹
is it possible to make this text to speech work in another language?
@engineerprompt 9 місяців тому
Not with Tortoise, I think coqui supports that.
@amejrarmohamed8524 Місяць тому
I'm asking before cloning my voice, is it possible to upload a large file and then the software will record it for me (of course with my cloning voice ) ? Thanks
@TylerThomas 8 місяців тому
Def gonna try this iut
@csomi35 Рік тому ⁺²
Is it possible to add emotions to the generated audio? I mean after successfully cloning some voice I would like to fill up it with some emotions (exclamation, fear, sad, fading etc... like a voice actor).
@mattlegge8538 Рік тому ⁺¹
I don't know if that's possible with any software yet. Maybe bark?
@BigDaz Рік тому
The documentation says ---> you can evoke emotion by including things like "I am really sad," before your text. I've built an automated redaction system that you can use to take advantage of this. It works by attempting to redact any text in the prompt surrounded by brackets. For example, the prompt "[I am really sad,] Please feed me." will only speak the words "Please feed me" (with a sad tonality).
@imashark241 Рік тому
niceeeeeee now do not have to use effort on reading using my woice but just using woice clone
@st.magnic8592 Рік тому ⁺²
can the segments be less or more than 10 seconds? or does it have to be exactly 10?
@engineerprompt Рік тому ⁺¹
No, its can be much longer, I have tested it on upto 30seconds. Its based on the hardware you are using.
@Seii-FPV 6 місяців тому
I don't have GPU option and have pretty powerful nVidia card installed. I only get GPU T4 and for some reason it won't accept that as an option. Has anything changed in how this works now? Does it need paid subscription?
@zyrazuric1499 Рік тому ⁺¹
Can it run in a low end pc?
@funginimp Рік тому ⁺¹
I cannot actually hear the difference between the you and the first sample. Remember that you probably sound slightly different to yourself because you're hearing it through your body.
@ulisesjorge Рік тому ⁺²
Thanks, this is extremely useful; I suppose that one can feed this algorithm a file so that it can read it and output a recording?
@engineerprompt Рік тому ⁺¹
Yes, there is a text variable in the notebook. Assign the text to it and it will do the rest.
@akashshesh Рік тому
@@engineerprompt Can you elaborate on this? I am trying to have may sentences read, but it says its too long
@aka70222 3 дні тому
Can I use this for other languages except english?
@Nethrex Рік тому
I haven't yet experimented but this, but great video!!
Do you think it's possible to run locally and use it for a personal/local assistant on a PC?
Also is there a way to get it running and working even without internet (so completely local)?
@kevinehsani3358 Рік тому
thanks for the video. Have you tried bark? Looking for voice cloning model that I can train longer locally for better results. Thanks again
@RamonValdez2014 Рік тому ⁺¹
I get this error after running the "generate speech" cell: "NameError: name 'text' is not defined". Anyone???
@ecstasycheese7390 Рік тому
Play the "# This is the text that will be spoken." box first before going down to the "# Generate speech with the custotm voice." box
@RamonValdez2014 Рік тому ⁺¹
@@ecstasycheese7390 Spot on thanks a lot! I tried to install Bark on my PC last weekend but I got stuck in some dependency that just won't work. Gotta stick to Collab for the time being!
@AMDSTT Рік тому ⁺¹
Thank you i will try then tell you
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@AMDSTT Рік тому
@@saifshaikh8828 I didn't test it
@didiervandendaele4036 Рік тому ⁺¹
Great found feature ! But is it possible to use this clone voice to speak in another language with the same accent ? 😮
@engineerprompt Рік тому
Check out the latest video on thr topic
@naze8793 Рік тому
hello. i tried mine but it doesn’t play my text but the default text that comes in the colab. please any fix?
@tiagolourenco7158 Рік тому
Hi, when I run the second code block I don't have the option to upload my files, and it shows "fileexistserror", I believe is something basic but I don't know what to do. Thank you
@r_pydatascience Рік тому
I wanted to try this. Allas, it is taking months to upload the audio files. Then I upgraded my colab to use a pro version. No success.
@ss-np9gx Рік тому
how do i fix this unterminated string literal (detected at line 4) when i write my text
@jorgemarz Рік тому ⁺¹
thank you! Do you think this could work if i use other language? or you can upload other language models or somethig?
@engineerprompt Рік тому ⁺³
This specific one only supports english but check out github.com/coqui-ai/TTS for multilanguage support. Hope this helps
@jorgemarz Рік тому ⁺¹
@@engineerprompt Thanks!!!
@redlinrangerstudio5331 Рік тому
i keep getting load_voice' is not defined
@pacovazquez665 Рік тому
im having trouble installing the tortoise, can anyone here point me to a place where i can find help
@tendaimurevanhema1166 Рік тому
I’m getting “Maximum call stack size exceeded.” On the second last cell
@shanesteven4578 Рік тому ⁺²
Worked great until I used up the days GPU allocation!. Nice work, thanks for the effort and video.
@engineerprompt Рік тому
Glad it was useful :)
@vyqh Рік тому ⁺¹
Great product showcase, can I use this to generate a 1000 word text to speech in 1 go?
@engineerprompt Рік тому ⁺³
With local installation, probably yes.
@grim789 Рік тому ⁺¹
@Prompt Engineering Do you have a video on installing this locally? I'm struggling to get it setup.
@lkbanztheman Рік тому
bro it doesn work cus my ram is too high and it doesnt allow me to do it again after my first try on all my google accounts :(
@gaetanomegna4436 Рік тому
NameError: name 'load_voice' is not defined
How can I fix it?
@wnrandom98 Рік тому
great tutorial thank you
@engineerprompt Рік тому
Thank you.
@1000trilliondollars Рік тому ⁺¹
Does this model work well with languages other than English such as: Japanese , Chinese , Vienamese
@engineerprompt Рік тому ⁺³
This model can only generate English but you can retrain the whole model for any other language. Here is the link with steps: github.com/neonbjb/tortoise-tts/issues/5#issuecomment-1112705908
@gisonnisylvio818 Рік тому
Does it work only in english ?
If I want to make it better in a certain language, do i need to only add more and more samples ?
@engineerprompt Рік тому
This one is limited to English
@UnderratedKitchen Рік тому ⁺¹
does this work with other languages ?
@NewImperivm Рік тому
no
@N380N Рік тому
Thanks for the video I found it really helpful. I played with it and created a synthetic voice using a Spanish speaking person as the voice model but my results were not as good as yours... is it me or the model works best with english language mainly?
@engineerprompt Рік тому
This specific model is tailored towards English language.
@N380N Рік тому
@@engineerprompt small question. Can you tune fine the model with the files you give it in different runs?
@karonwhitehead2383 Рік тому
Can you do this on phone
@edwardecl Рік тому
7:33 - Best bit
@cybergigafactory Рік тому
Is there a way to use it on a iPad Pro?
@thegodsbrand Рік тому
Do i have to reupload audio everyime i use it?
@InquisitorGeneral Рік тому
Is it still necessary to chop audio up into 10 second segments at 22khz sample rate? I have many audio samples from 10 minutes to 45 minutes all at 48khz. Would these not work at all or would they cause some problem?
@engineerprompt Рік тому
They will work, but you will need good hardware to run it though!
@elizalapteva Рік тому ⁺²
Hey guys. I’m just wondering - if you were able to download your voice over there and sign any note like Whitney Houston - would you use it?
I mean it’s it cool to record a love song to somebody but with your own song?
@planetgamecommunity817 Рік тому
easy
@Araujo_gabbriel Рік тому ⁺¹
Oi. É possível fazer a clonagem em português nessa área que você mostrou no tutorial? Ou teria que pegar a área de um brasileiro para eu conseguir fazer isso?
@engineerprompt Рік тому ⁺²
This specific model works only with English. There are some other models that I can explore.
@SokratesStudios Рік тому
@@engineerprompt That would be fantastic.
I just wonder, is there any voice model generator that works with tones and regardless to the language that's spoken??
@manoelvitor-dev 9 місяців тому
How make translate from portuguese in this?
@amanisebele4073 Рік тому
i keep getting an error every time I try run it
@NowOrNeverAI Рік тому
Is there a limit on how many words can be spoken?
@shailendrarathore445 Рік тому
Make a video for specific person voice cloning for hindi language using google colab..
@黄毅宁-o2w Рік тому
Are other languages supported?
@oussamael7304 Рік тому
can i clone another language and make someone speak it
@jayr7741 Рік тому
How long passage it will take to cloned the voice? I wanna create the big passage in my voice, is it possible with it? Passage of At least 5000 words
@engineerprompt Рік тому
You will probably have to divide the passage into different parts and then feed that into the model. That's the best way to do it.
@shledoncooper Рік тому
Can we use different languages?
@azab14 Рік тому
Is it work with other languages such as Arabic
@Paulinhox88 Рік тому
I get this when i try to upload my audio files "MessageError: RangeError: Maximum call stack size exceeded."
Any ideas how to solve this?
@engineerprompt Рік тому ⁺¹
I haven't faced but make sure you have enough space on your google drive and have stable internet connection.
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@ryusuikagaku Рік тому
Can it clone other than english voice?
@engineerprompt Рік тому
Not with this version. There are other packages that can do it.
@nanigh2913 Рік тому
It's support any language like kannada, or only English?
@engineerprompt Рік тому
This specific one, only supports English.
@asterinycht5438 Рік тому
How to train on other language ?
@gerriebullington Рік тому
The only problem is that the software doesn't pick up your accent.
@SyntheticVoices Рік тому
Tortoise-tts also has a fine-tuning via a fork
@engineerprompt Рік тому
Would love to have a look at it, any resources you recommend?
@SyntheticVoices Рік тому
@@engineerprompt I have put a link in the description of my lastest vids to MRQs repo
@engineerprompt Рік тому ⁺¹
@@SyntheticVoices thanks, I will check it out!
@UnderratedKitchen Рік тому
bro i am getting error
@jamrhxh Рік тому
How can improve spanish?
@GfLast Рік тому
Bro I find a problem to # Imports used through the rest of the notebook.
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@sunburnfm Рік тому
Sounds exactly like you.
@engineerprompt Рік тому
Thanks, others have pointed out the same. Seems like I don't recognize my own voice 😉
@privatesoft5006 Рік тому
It Support Arabic Voices?
@vancreo Рік тому
Please what is the python version, I keep scipy installation error
@engineerprompt Рік тому
Python 3.9.16 (in google colab)
@vancreo Рік тому
@@engineerprompt thank you very much
@AntoniusTertius Рік тому
@@engineerprompt How do I install that version if there's no installer for it???? I can't install the latest Python because Tortoise doesn't work with it, right?
@boulimermoz9111 Рік тому
Great Great Ai thank you very much, do you know if it works for other languages ? french ?
@engineerprompt Рік тому
I think currently it works for English only but if you can collect data, you can retrain the model on other languages.
@mustafak.farouk1071 Рік тому
Why does it have to be 10 second segments?
@engineerprompt Рік тому
You can provide it longer segments as well but its just about the compute resources
@harsh2624 Рік тому
this is so COOOOL
@engineerprompt Рік тому
Thanks :)
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@RichardBonn Рік тому ⁺¹
cool!
@saifshaikh8828 Рік тому
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@desstuctorr5263 Рік тому
Hi guys , do someone know if an AI like 11labs or anything else exist but in French ? Im french and im really looking for that but it seems impossible to find
@engineerprompt Рік тому
check the next video :)
@Aaliyashi Рік тому
@@desstuctorr5263 11Labs do have a model now that supports a few other languages than English. French is one of them, so it should be pretty straight forward :) Just switch the model from "Eleven Monolingual v1" to "Eleven Multilingual v1" when generating your voice lines.
@desstuctorr5263 Рік тому
@@Aaliyashi Yeah I already tried it.
The voice cloning is ok but it make a canadian accent that is pretty annoying
@Aaliyashi Рік тому
@@desstuctorr5263 Oh I see, that's a shame. At least it seems like it's something they're working on.
@desstuctorr5263 Рік тому
@@Aaliyashi Yep .
And its only an experimental version after all !
@oxanaivanova8007 Рік тому
google colab now sucks because you have to pay it will only let you generate 1-4 voices then ur done this is so frustating i will just do story reading without it
@tharakamalli4366 Рік тому
Use text to speech Can videos be monetized?
@engineerprompt Рік тому
why not? watch this video to learn about Google's policy: ua-cam.com/video/VjphDyQhlW8/v-deo.html
@giooooo3522 Рік тому
NameError: name 'text' is not defined. I followed you in every step. :(
@engineerprompt Рік тому
Make sure you run the block containing this code:
# This is the text that will be spoken.
text = "Thanks for reading this article. I hope you learned something."
Seems like it didn't run that part.
@GreenHatAnimation Рік тому
@@engineerprompt was having same problem and this is the solution
@ValicsLehel Рік тому
Can be trained other then EN language?
@engineerprompt Рік тому ⁺³
This model can only generate English but you can retrain the whole model for any other language. Here is the link with steps: github.com/neonbjb/tortoise-tts/issues/5#issuecomment-1112705908
@ValicsLehel Рік тому
@@engineerprompt That is not easy at all :-)
@engineerprompt Рік тому ⁺¹
@@ValicsLehel That's true. Check out this repo, seems to have multilingual support, I haven't really looked into closely but probably something worth checking out: github.com/coqui-ai/TTS
@ValicsLehel Рік тому
@@engineerprompt I will take a look. I want to find a solution to generate my voice (or actor voices) but not the default one. Elevenlabs is ok, but cannot learn Romaniann for example. Just EN.

Наступне

Автоматичне відтворення

LLaMA & Alpaca: Install "ChatGPT" ON Your LOCAL Computer