Python 3.13's new REPL is AMAZING

Free Speech: Reviewing Coqui-ai, Mycroft Mimic3 and Tortoise TTS Libraries

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Running With Bigger And Bigger Lunchlys

ТУК ТУК ТУК репетиція 😍 Хочете чути цю пісню на концертах?

ЗАГС. 1 СЕРИЯ. Мелодрама

Python Local Text To Speech Coqui TTS | Generate Audio From Text Using Python

Hussain Mustafa

Переглядів 8 042

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 28 вер 2024
💼 Book a meeting: cutt.ly/Pegxp5rA
In this video we will build a python script that will allow us to generate speech from text locally on our system using the coqui TTS package for python. We will take a look at working with the Coqui TTS package coupled with gradio to create a web interface through which the user can upload there text and generate speech from. The concepts covered will help you understand the fundamentals of working with text to speech systems such as Coqui locally on your system, setting up and configuring a python environment, and using gradio to build a web interface to interact with your Python scripts. This is an excellent guide for beginner Python/ML developers, or anyone looking to learn about text to speech (TTS) systems and build them using Python.
Resources:
Source Code: cutt.ly/Ner6ffaE
Gradio: www.gradio.app...
Coqui TTS: github.com/coq...
Socials:
Website: hussainmustafa...
Github: github.com/hus...
LinkedIn: / hussain-mustafa-960920184
Twitter: / hussain34274892
Buy Me A Coffee: www.buymeacoff...
#python #learnpython #tts #machinelearning #artificialintelligence
Наука та технологія

КОМЕНТАРІ • 39

@davidtindell950 10 днів тому ⁺²
NEW Subscriber: Thank You. Just what I was searching for ... It would be "NICE" if Coqui TTS would install under Python 3.12.5. We hope that 'they' will maintain and update it !?!?!
@MHM-jy4uj 4 місяці тому ⁺³
How does Coqui TTS compare to other TTS libraries you've used?
@corpsSaint 9 днів тому
Is there not a more realistic voice?
@MadhavaraoPanidepu Місяць тому ⁺¹
Awesome tutorial. I wish I could create multiple audio files from a longer text (from a text file), with each audio file corresponding to a separate paragraph.
@m_hussain_mustafa Місяць тому
That would be cool!
@mmajr Місяць тому ⁺¹
Good job! How do you tune the speech speed?
@sandeeps3108 3 місяці тому ⁺¹
Bro can you make a project for voice cloning
@m_hussain_mustafa 3 місяці тому ⁺³
Hi, will try to make a tutorial on that.
@MrIMacro 4 місяці тому ⁺¹
Amazing
@m_hussain_mustafa 4 місяці тому
Thank you! Cheers!
@moneyman-ne9lw 4 місяці тому ⁺⁴
Coqui TTS setup was a breeze thanks to your step-by-step guide. 😊
@m_hussain_mustafa 4 місяці тому ⁺²
Glad it helped!
@preneure 4 місяці тому ⁺²
Can you show how to integrate this with a web application? That would be super helpful!
@ridabrahim7604 4 місяці тому
That shouldn't be a problem, you will do the same thing by sending the text from the front end and process it in the backend and deliver it again(as an audio) to the user, use flask for python to do this
@rlt_app 4 місяці тому ⁺³
You always manage to make complex topics easy to understand.
@m_hussain_mustafa 4 місяці тому ⁺¹
Thats the goal haha :)
@edgarl.mardal8256 4 місяці тому ⁺²
Very bad voice output, could you show how to train the modell so it actually sounds like a human?
@m_hussain_mustafa 4 місяці тому ⁺²
Hi, soon I'll be releasing a tutorial featuring another model that will allow to create much more human like audio, in the mean time you can play around with using other models than the one I have shown in the video, training a model will be quite resource intensive.
@edgarl.mardal8256 4 місяці тому
@@m_hussain_mustafa cool, i suggest using appolio,
@Insidestoryland Місяць тому
yes thanks for sharing. i need also taring video of modell.
@RonyHassan47 4 місяці тому ⁺²
Great one. I will forget about eleven labs
@m_hussain_mustafa 4 місяці тому
Thank you :)
@mohsenghafari7652 3 місяці тому ⁺¹
hi
coquiAI library support Persian language ?
thanks
@m_hussain_mustafa 3 місяці тому
Hi, I'd recommend checking the documentation.
@StormixDZN 4 місяці тому ⁺¹
Does it work on cpu only if I don’t use model training but just tts?
@m_hussain_mustafa 4 місяці тому
Yes it does.
@StormixDZN 4 місяці тому
@@m_hussain_mustafa thx bc I have an amd gpu and I can’t use training sadly
@RaezekenOG Місяць тому ⁺¹
Nice tutorial man! Great job!
@m_hussain_mustafa Місяць тому
Thank you. :)
@DigitalGus75 2 місяці тому ⁺¹
Except is sound like last decades speech synthesis.
@m_hussain_mustafa 2 місяці тому
Yes this is definitely a draw back. However, I'm planning on releasing another video where thr speech synthesis sounds much better.
@DigitalGus75 2 місяці тому
@@m_hussain_mustafa bark is pretty good sounding offline transcription. Not sure it is still supported, but it is still available
@shubhampadekar2590 3 місяці тому
Hi loved the content
May I know how to pass speaker index while using multilingual model while using TTS method
@ridabrahim7604 4 місяці тому ⁺¹
Great one as usual
@m_hussain_mustafa 4 місяці тому
Thank you 😊
@JoeMamaJunk Місяць тому
Great video!
@m_hussain_mustafa Місяць тому ⁺¹
Glad you enjoyed it

Наступне

Автоматичне відтворення

Python 3.13's new REPL is AMAZING

Python 3.13's new REPL is AMAZING

Free Speech: Reviewing Coqui-ai, Mycroft Mimic3 and Tortoise TTS Libraries

Free Speech: Reviewing Coqui-ai, Mycroft Mimic3 and Tortoise TTS Libraries

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Running With Bigger And Bigger Lunchlys

Running With Bigger And Bigger Lunchlys

ТУК ТУК ТУК репетиція 😍 Хочете чути цю пісню на концертах?

ТУК ТУК ТУК репетиція 😍 Хочете чути цю пісню на концертах?

ЗАГС. 1 СЕРИЯ. Мелодрама

ЗАГС. 1 СЕРИЯ. Мелодрама

Продажный бой? Боксёр испугался? Нет! Всё гораздо сложней... #shorts

Продажный бой? Боксёр испугался? Нет! Всё гораздо сложней... #shorts

The Best React Code I Wrote (Code Review)

The Best React Code I Wrote (Code Review)

Running a local Piper TTS server with Python on Linux

Running a local Piper TTS server with Python on Linux

Local voice cloning with 6 seconds audio | Coqui XTTS on Windows

Local voice cloning with 6 seconds audio | Coqui XTTS on Windows

Why Inkscape writes everything twice

Why Inkscape writes everything twice

Best AI Voice Generator | 2024.08

Best AI Voice Generator | 2024.08

Do this to Land a Flutter developer Job Today!

Do this to Land a Flutter developer Job Today!

"The Life & Death of htmx" by Alexander Petros at Big Sky Dev Con 2024

"The Life & Death of htmx" by Alexander Petros at Big Sky Dev Con 2024

HTML course for beginners - Learn HTML in 1 hour

HTML course for beginners - Learn HTML in 1 hour

Make an Offline GPT Voice Assistant in Python

Make an Offline GPT Voice Assistant in Python

impressora digital Quer o link desse produto?Pegue na bio ou Comente “eu quero”

impressora digital Quer o link desse produto?Pegue na bio ou Comente “eu quero”

Этот чехол НЕ ЗАЩИТИТ твой телефон #shorts #шортс #смартфон #факты #чехол

Этот чехол НЕ ЗАЩИТИТ твой телефон #shorts #шортс #смартфон #факты #чехол

REDMI NOTE 14 УЖЕ ЗДЕСЬ. Xiaomi сделали невозможное…

REDMI NOTE 14 УЖЕ ЗДЕСЬ. Xiaomi сделали невозможное…

Evolution of PhoneVision

Evolution of PhoneVision

Look at my 3D stereo projector, with a built-in sound response card, to experience big-screen viewin

Look at my 3D stereo projector, with a built-in sound response card, to experience big-screen viewin

Apple Event - September 9

Apple Event - September 9

#major #airdrop #telegram #web3 #listing #crypto

#major #airdrop #telegram #web3 #listing #crypto