Fine-tune Text-to-Speech Models for any Language: Introduction to TTS

Поділитися
Вставка
  • Опубліковано 2 лис 2024

КОМЕНТАРІ • 59

  • @abdelrahmanmohsen6393
    @abdelrahmanmohsen6393 Місяць тому +2

    I tried that on Arabic dataset, didn't work. Tried to increase the steps to 5000. Still didn't work, any advice?

    • @emirhanbilgicai
      @emirhanbilgicai  Місяць тому +2

      Hello Abdelrahman. Can you share the code and the dataset with me? I guess the alphabet must be the problem here. We need to define a function to convert it into English alphabet

    • @emirhanbilgicai
      @emirhanbilgicai  Місяць тому +2

      we solved the problem with Abdelrahman, indeed, if you're working with a language that has a different alphabet than English, you should conver it to English.
      Example:
      convert
      السلام عليكم
      to
      alsalam alekum.

    • @abdelrahmanmohsen6393
      @abdelrahmanmohsen6393 Місяць тому +1

      Yes the problem is that the model's tokenizer can't understand anything rather the English letter. Thus; the trained data should be "Audio+ converted text to English" not the translated though. And it should be working fine right after that.

    • @funny_animals_world28
      @funny_animals_world28 18 днів тому

      @@abdelrahmanmohsen6393 did you find a solution bro ?

    • @shahad54-g3p
      @shahad54-g3p 12 днів тому

      I am working on same problem , I already did the converting from arabic alphabet to english but the output is not clear at all is there any suggestions to solve this issue?

  • @ysancaktutan
    @ysancaktutan 6 днів тому +1

    Solid explanations, learned a lot! Thanks!

  • @Hatice-g4k
    @Hatice-g4k Місяць тому +4

    Congratulations, your work will be a light for those who aim to make progress in this direction. I wish you continued success. May your path be clear Emirhan

  • @kralx-t3g
    @kralx-t3g Місяць тому +5

    The great turkish robot from mardin teaches us how to fine-tune itself. Ai is really something else.

  • @huseyinboragurer6479
    @huseyinboragurer6479 2 місяці тому +3

    Thanks to UA-cam I've seen this video, hope Mr. Bilgiç will bless us with new videos.

  • @carlossunga2271
    @carlossunga2271 2 місяці тому +2

    This video is helpful for people who want to understand text-to-speech (TTS) and how to make it better. Emirhan,who made the video knows a lot about it, and the part where he show how to write computer code is useful, even I don't know much about coding.

  • @ahmeterdonmez9195
    @ahmeterdonmez9195 9 днів тому +2

    Very successful. I subscribed to your channel. You deserve more followers. But for this, I think you need to produce a little more content.💪 Tebrikler....

  • @ufukekingen8395
    @ufukekingen8395 Місяць тому +1

    Congratulations emirhan I wish you continued success 😊

  • @zaursamedov8906
    @zaursamedov8906 Місяць тому +1

    Looking solid! Congrats Emirhan.

  • @yunuskaan0
    @yunuskaan0 2 місяці тому +1

    Thank you for the great explanation!❤️💯

  • @okanaslan766
    @okanaslan766 2 місяці тому +2

    tebrik ederim dostum, çok temiz ve açıklayıcı bir video olmuş 💯

    • @emirhanbilgic2475
      @emirhanbilgic2475 2 місяці тому

      Desteğin için teşekkürler :) Daha ayrıntılı bir şeyler de çekebilirim ilgi olursa.

    • @emirhanbilgicai
      @emirhanbilgicai  2 місяці тому

      Desteğin için teşekkür ederim :) Daha ayrıntılı bir şeyler çekebilirim ilgi olursa.

  • @naszoom
    @naszoom Місяць тому +2

    Congrats brother 👏

  • @__________________________6910
    @__________________________6910 Місяць тому +1

    Great, make more videos on TTS, voice cloning, multilingual TTS

  • @thecloudrazor
    @thecloudrazor Місяць тому +1

    Emeğine sağlık süper olmuş. Başarılar dilerim 🤝

  • @nezirbilgic9652
    @nezirbilgic9652 Місяць тому +2

    Tebrikler oğlum çok güzel olmuş ❤

  • @figen1152
    @figen1152 Місяць тому +1

    Hayırlı olsun başarılarınin devamını dilerim

  • @AynurGerekan
    @AynurGerekan Місяць тому +1

    Başarılarının devamını dilerim sonsuz başarılar

  • @AT-ww2hi
    @AT-ww2hi 2 місяці тому +3

    best indian youtuber so far ✋🏻 no cap 🧢

    • @emirhanbilgicai
      @emirhanbilgicai  Місяць тому +3

      Thank you but I am not Indian 😄

    • @rumeysa538
      @rumeysa538 Місяць тому

      Alanım değil twitterda görüp bakayım dedim bu yoruma koptum 😂​@@emirhanbilgicai

  • @AkhandPratapSingh722
    @AkhandPratapSingh722 25 днів тому +1

    Hiii Emirhan, I am one of your new viewers. I am recently learning machine learning and now i have to fine tune a tts model for interviews based technical words like OAuth, API etc. Can you help me with it or can we connect personally because that project is really important for me

    • @emirhanbilgicai
      @emirhanbilgicai  20 днів тому

      Hey! I can give you some tips if you share the details

  • @nezmete1760
    @nezmete1760 2 місяці тому +2

    Hayırlı olsun.Tebrikler

  • @letsdigin9647
    @letsdigin9647 19 днів тому +1

    hi, is it possible to train the model in english with only certain words that its currently pronouncing inccorectly ?

    • @emirhanbilgicai
      @emirhanbilgicai  17 днів тому +1

      Hello, if you mean the abbreviations, or something else, you can define a custom function to handle that case like this:
      def preprocess(text):
      text = number_normalizer(text).strip()
      text = text.replace("-", " ")
      if text[-1] not in punctuation:
      text = f"{text}."
      abbreviations_pattern = r'\b[A-Z][A-Z\.]+\b'
      def separate_abb(chunk):
      chunk = chunk.replace(".","")
      print(chunk)
      return " ".join(chunk)
      abbreviations = re.findall(abbreviations_pattern, text)
      for abv in abbreviations:
      if abv in text:
      text = text.replace(abv, separate_abb(abv))
      return text
      I took it from: huggingface.co/spaces/parler-tts/parler_tts/blob/main/app.py
      Even if you don't do it with an additional function, you can do it by providing enough samples (more than a thousand) to the model.

  • @emirhanbilgicai
    @emirhanbilgicai  2 місяці тому +5

    For contact and everything: emirhanbilgic.github.io

  • @og_23yg54
    @og_23yg54 17 днів тому +1

    Hello, my model is generating speech, but it's only producing about two words and cutting off after approximately 0.1 seconds. Do you have any advice or help? Is there a Discord where I can reach you?

    • @emirhanbilgicai
      @emirhanbilgicai  17 днів тому +2

      Hello, this could be due to three reasons:
      Your individual data samples are small, such as having only two words per sample, making it difficult for the model to learn from longer sequences.
      Your dataset is small, for example, only containing 300 sentences. I recommend increasing the size of your dataset.
      The model hasn't been trained enough, or you may need to experiment with different hyperparameters.

    • @og_23yg54
      @og_23yg54 17 днів тому

      @@emirhanbilgicai my Audi is like 2-10s long

    • @og_23yg54
      @og_23yg54 15 днів тому +1

      @@emirhanbilgicai that was true if I fine tune with 20mins each audio will it produce 10 -20 mins long audio ?

    • @emirhanbilgicai
      @emirhanbilgicai  14 днів тому

      @@og_23yg54 yes, but it would take ages to train a model with 20min-long samples (with enough number of samples)

  • @Axecubic
    @Axecubic 2 місяці тому +3

    🧑‍💻💯

  • @nicoc6387
    @nicoc6387 Місяць тому +3

    That AI version of Harry Potter is pretty convincing.

  • @nysman
    @nysman Місяць тому +1

    Bari Türkçe altyazı koy jshs