Python Local Text To Speech Coqui TTS | Generate Audio From Text Using Python

Поділитися
Вставка
  • Опубліковано 28 вер 2024
  • 💼 Book a meeting: cutt.ly/Pegxp5rA
    In this video we will build a python script that will allow us to generate speech from text locally on our system using the coqui TTS package for python. We will take a look at working with the Coqui TTS package coupled with gradio to create a web interface through which the user can upload there text and generate speech from. The concepts covered will help you understand the fundamentals of working with text to speech systems such as Coqui locally on your system, setting up and configuring a python environment, and using gradio to build a web interface to interact with your Python scripts. This is an excellent guide for beginner Python/ML developers, or anyone looking to learn about text to speech (TTS) systems and build them using Python.
    Resources:
    Source Code: cutt.ly/Ner6ffaE
    Gradio: www.gradio.app...
    Coqui TTS: github.com/coq...
    Socials:
    Website: hussainmustafa...
    Github: github.com/hus...
    LinkedIn: / hussain-mustafa-960920184
    Twitter: / hussain34274892
    Buy Me A Coffee: www.buymeacoff...
    #python #learnpython #tts #machinelearning #artificialintelligence
  • Наука та технологія

КОМЕНТАРІ • 39

  • @davidtindell950
    @davidtindell950 10 днів тому +2

    NEW Subscriber: Thank You. Just what I was searching for ... It would be "NICE" if Coqui TTS would install under Python 3.12.5. We hope that 'they' will maintain and update it !?!?!

  • @MHM-jy4uj
    @MHM-jy4uj 4 місяці тому +3

    How does Coqui TTS compare to other TTS libraries you've used?

  • @corpsSaint
    @corpsSaint 9 днів тому

    Is there not a more realistic voice?

  • @MadhavaraoPanidepu
    @MadhavaraoPanidepu Місяць тому +1

    Awesome tutorial. I wish I could create multiple audio files from a longer text (from a text file), with each audio file corresponding to a separate paragraph.

  • @mmajr
    @mmajr Місяць тому +1

    Good job! How do you tune the speech speed?

  • @sandeeps3108
    @sandeeps3108 3 місяці тому +1

    Bro can you make a project for voice cloning

  • @MrIMacro
    @MrIMacro 4 місяці тому +1

    Amazing

  • @moneyman-ne9lw
    @moneyman-ne9lw 4 місяці тому +4

    Coqui TTS setup was a breeze thanks to your step-by-step guide. 😊

  • @preneure
    @preneure 4 місяці тому +2

    Can you show how to integrate this with a web application? That would be super helpful!

    • @ridabrahim7604
      @ridabrahim7604 4 місяці тому

      That shouldn't be a problem, you will do the same thing by sending the text from the front end and process it in the backend and deliver it again(as an audio) to the user, use flask for python to do this

  • @rlt_app
    @rlt_app 4 місяці тому +3

    You always manage to make complex topics easy to understand.

  • @edgarl.mardal8256
    @edgarl.mardal8256 4 місяці тому +2

    Very bad voice output, could you show how to train the modell so it actually sounds like a human?

    • @m_hussain_mustafa
      @m_hussain_mustafa  4 місяці тому +2

      Hi, soon I'll be releasing a tutorial featuring another model that will allow to create much more human like audio, in the mean time you can play around with using other models than the one I have shown in the video, training a model will be quite resource intensive.

    • @edgarl.mardal8256
      @edgarl.mardal8256 4 місяці тому

      @@m_hussain_mustafa cool, i suggest using appolio,

    • @Insidestoryland
      @Insidestoryland Місяць тому

      yes thanks for sharing. i need also taring video of modell.

  • @RonyHassan47
    @RonyHassan47 4 місяці тому +2

    Great one. I will forget about eleven labs

  • @mohsenghafari7652
    @mohsenghafari7652 3 місяці тому +1

    hi
    coquiAI library support Persian language ?
    thanks

  • @StormixDZN
    @StormixDZN 4 місяці тому +1

    Does it work on cpu only if I don’t use model training but just tts?

    • @m_hussain_mustafa
      @m_hussain_mustafa  4 місяці тому

      Yes it does.

    • @StormixDZN
      @StormixDZN 4 місяці тому

      @@m_hussain_mustafa thx bc I have an amd gpu and I can’t use training sadly

  • @RaezekenOG
    @RaezekenOG Місяць тому +1

    Nice tutorial man! Great job!

  • @DigitalGus75
    @DigitalGus75 2 місяці тому +1

    Except is sound like last decades speech synthesis.

    • @m_hussain_mustafa
      @m_hussain_mustafa  2 місяці тому

      Yes this is definitely a draw back. However, I'm planning on releasing another video where thr speech synthesis sounds much better.

    • @DigitalGus75
      @DigitalGus75 2 місяці тому

      @@m_hussain_mustafa bark is pretty good sounding offline transcription. Not sure it is still supported, but it is still available

  • @shubhampadekar2590
    @shubhampadekar2590 3 місяці тому

    Hi loved the content
    May I know how to pass speaker index while using multilingual model while using TTS method

  • @ridabrahim7604
    @ridabrahim7604 4 місяці тому +1

    Great one as usual

  • @JoeMamaJunk
    @JoeMamaJunk Місяць тому

    Great video!