BARK: Free Text to Speech & Voice Cloning

Поділитися
Вставка
  • Опубліковано 1 гру 2024

КОМЕНТАРІ • 94

  • @geekyprogrammer4831
    @geekyprogrammer4831 Рік тому +3

    Abhishek I have been following your videos and tutorials for last 2 years. Your content was and is gold!

    • @icanyagmur
      @icanyagmur Рік тому

      Hi bro, how did you make that your youtube profile photo ? Can you guide me ?

  • @HarendraSaiNathLella
    @HarendraSaiNathLella Рік тому +4

    @abhishekkrthakur , at 12:28, you told to clone the bark repo. But, I could not find the exact bark repo which you have shown. Can you provide the link for the bark repo? Please

    • @sabeerfaisal2619
      @sabeerfaisal2619 9 місяців тому

      did u find it?

    • @magictbjc7324
      @magictbjc7324 5 місяців тому

      @@sabeerfaisal2619 go to the huggingface model repo for bark, there is a command "clone the repo".

  • @arun279
    @arun279 Рік тому +9

    Does the quality of the generations increase if you have longer or more samples?

  • @rushirajparmar9602
    @rushirajparmar9602 Рік тому +8

    UnpicklingError: invalid load key, '

    • @gasper_101
      @gasper_101 10 місяців тому +1

      I got the same issue, did you figure out how to fix it?

    • @tarangsuri8932
      @tarangsuri8932 8 місяців тому +2

      i have figure out, u wanna know...

    • @kunalkumar-rv3pd
      @kunalkumar-rv3pd 7 місяців тому

      @@tarangsuri8932 yes please

    • @annahari610
      @annahari610 7 місяців тому

      ​@@tarangsuri8932 I wanna know bro. Help me for solving this issue

    • @mangeshkashid5389
      @mangeshkashid5389 5 місяців тому

      @@tarangsuri8932 batade bhai abhi... secret rakhane wala he kya?🤣

  • @abirahmedsohan3554
    @abirahmedsohan3554 Рік тому +3

    I am struggling with this..i dont relize how the bark folder come?
    I saw in the bark repo there is no speaker embedding..can you please give me this full code or steps which i can follow?

  • @acasualdatascientist54
    @acasualdatascientist54 Рік тому +1

    Thanks for the video, I was looking for this recently. I am too shy to talk for youtube videos was hoping to clone my voice like this for one.

  • @3Dwithdev
    @3Dwithdev 9 місяців тому +2

    bro please do mention the links also in the descriptions

  • @souvickdas5564
    @souvickdas5564 9 місяців тому +1

    I am having one problem with input context length. For example given a research paper, I am trying to find relevant papers from the vector db containing 2000 papers. How to fit the entire research paper as the input? Is there any way to solve the problem? Also the vector db is huge. Is there any way to manage it efficiently?

  • @longfellowrose1013
    @longfellowrose1013 7 місяців тому

    Where's your next video! Your channel always inspires me!!!! Cant wait to watch your new video

    • @abhishekkrthakur
      @abhishekkrthakur  7 місяців тому

      Thank you for your kind words. Ive taken a break from making videos 🙂

    • @longfellowrose1013
      @longfellowrose1013 7 місяців тому

      @@abhishekkrthakur Oh, it's a pity!!! Still wish everything goes well with your life

  • @annxiao7721
    @annxiao7721 Рік тому

    Hi Abhishek, I really like your book, thank you so much for sharing your knowledge.

  • @kumarsantosh7376
    @kumarsantosh7376 8 місяців тому

    Hi Sir, humble request, can you please share your journey of being kaggle grandmaster and guide the juniors out here. If you already have posted somewhere, would love to have link to it. 😁

  • @CapitanMegaa
    @CapitanMegaa 5 місяців тому

    I have a tts read it outloud and it takes a bit to hear the tts after clicking start code.. is there a way to make it faster? you kinda get them very fast or something i have no coding experience and yours is just in another code file mine plays the sound from media player (it have to) + if text are long he reads only 14 seconds of it.. it just take sooooooooooooo long is that normal??

  • @rushirajparmar9602
    @rushirajparmar9602 Рік тому

    Nice tutorial Abhishek!

  • @pranavnatekar4183
    @pranavnatekar4183 Рік тому

    Great video Abhishek. Can you possibly do a video on training a multitasking model in a computer vision setting? Would love to see that.

  • @azer0013
    @azer0013 Рік тому +1

    Hello thank you bro
    Where is bark folder

  • @alexdelaiglesia1926
    @alexdelaiglesia1926 Рік тому

    Awesome. Video generation for the next one!

  • @PhilosophyResurrected
    @PhilosophyResurrected 5 місяців тому

    Ok, so a bit new to all this, but can you tell me what repositories you used in your bark folder? The script is missing stuff and not sure what. Thank you.

  • @Coursdecoutureorg
    @Coursdecoutureorg 8 місяців тому

    sad you don't provide the full code c/C...

  • @gitc13
    @gitc13 2 місяці тому +1

    Requesting new videos!!!

  • @rexsan2747
    @rexsan2747 Рік тому

    For my personal questions, can you share your method of learning something new. I really don't have method to learn data industry

  • @PhucHoang-ng4vh
    @PhucHoang-ng4vh 7 місяців тому

    hi, I just found out about your AAAML book, but cant find the code repo of it, could you please share it?

  • @JOHNSMITH-sj3lg
    @JOHNSMITH-sj3lg Рік тому

    I want to clone my voice in german but it has everytime a englisch pronounce how can i set the language to german?

  • @rachitgandhi7958
    @rachitgandhi7958 Рік тому +5

    magic_number = pickle_module.load(f, **pickle_load_args)
    _pickle.UnpicklingError: invalid load key, '

  • @ashuu9257
    @ashuu9257 8 місяців тому

    please mention the computing power required

  • @rohmathur
    @rohmathur 8 місяців тому

    HI Abhishek. Thanks for posting some interesting videos. I tried doing text to speech using Bark on V100 GPU on Bark. It is taking too long. I need latency of less than a second. Can you recommend how I could achieve that.

  • @MotivationNation-f8b
    @MotivationNation-f8b Рік тому

    Great video Abhishek, How can we develop our own text to speech model , it would give 3 mins of wav.file

  • @allandclive
    @allandclive Рік тому +1

    How do you fine tune MMS-TTS models?

  • @nirsarkar
    @nirsarkar Рік тому

    Great Stuff! always. Thanks. Does Bark work on Apple silicon?

    • @nickiesnook
      @nickiesnook Рік тому +1

      yes, just have to change device to cpu or mps

  • @sarathkumar-gq8be
    @sarathkumar-gq8be Рік тому +3

    In duration of 12:25 you sad clone the repo , but i don't know exact repo where it is ,can yu share the link of repo, because if go and donwload each file one by one, it's hard, especially in speaker_embedding multiple files are there

  • @HarendraSaiNathLella
    @HarendraSaiNathLella Рік тому

    can someone tell me where is the bark repository?, which was used and shown at 12:28

  • @muhammadizhar82
    @muhammadizhar82 Рік тому +1

    Can we generate long videos like 5 to 10 min

  • @mathieuduverne9261
    @mathieuduverne9261 Рік тому

    Possible to have your wav sample you use for the voice cloning ?

  • @csowm5je
    @csowm5je Рік тому +2

    12:20 Clone which repository?

  • @xavAk
    @xavAk Рік тому

    You're amazing 🤩

  • @B.hummer
    @B.hummer 2 місяці тому

    if you could just find a way to make this whole coding process thingy a copy and paste experience, that will just boom!

  • @danielalejandronavarroluna8374

    The echo in hindi is really cool

  • @zaursamedov8906
    @zaursamedov8906 Рік тому

    is there someone that has TTS problem? I did everything tho it doesn't seem to have TTS module

  • @suhaaskatikaneni1925
    @suhaaskatikaneni1925 Рік тому

    nice video!

  • @lukasfili668
    @lukasfili668 Рік тому

    AssertionError: Torch not compiled with CUDA enabled does someone know hat this is

    • @ashwinmlk4908
      @ashwinmlk4908 Рік тому

      same error as well, did yeah get it fixed?

    • @monilsompura
      @monilsompura 9 місяців тому

      Uninstall torch and reinstall it with pytorch documetation@@ashwinmlk4908

    • @CapitanMegaa
      @CapitanMegaa 5 місяців тому

      @@monilsompura H.O.W

  • @mind6861
    @mind6861 9 місяців тому

    Great vid

  • @m.rr.c.1570
    @m.rr.c.1570 11 місяців тому

    can i change the pitch and speed of the voice in bark?

  • @talavalkov4008
    @talavalkov4008 Рік тому

    Came here through Varun Mayya.

  • @shouldibuythisgame
    @shouldibuythisgame 15 днів тому

    Nice video

  • @AjayiJoseph-ph8xx
    @AjayiJoseph-ph8xx 6 місяців тому

    Can we try doing this with a phone?

  • @Asli_
    @Asli_ Рік тому

    how are you able to play audio in vs code?

    • @hutpfff1366
      @hutpfff1366 6 місяців тому

      you can open audio files in vs code by opening the folder in vs code and then you see them

  • @abhishekkrthakur
    @abhishekkrthakur  Рік тому +17

    Please subscribe to help me keep motivated to make awesome videos like this one. :)

    • @prabhavkaula9697
      @prabhavkaula9697 Рік тому

      Cool tutorial bhaiya 😌🙌
      Would you take up small duration text-to-video in the next tutorial?

    • @ShotterManable
      @ShotterManable Рік тому

      You're the one sir, I just love your videos and you're a big motivation for all us wannabe pro. I follow you on twitter and youtube!

    • @michaeledison1974
      @michaeledison1974 Рік тому

      Hello! Could I contact you please? I urgely need your help with my Diploma thesis work. Please

    • @AjaySingh-ey7gt
      @AjaySingh-ey7gt 11 місяців тому

      Nice Abhishek

    • @vikasrai4915
      @vikasrai4915 9 місяців тому

      Hey Abhishek, can we clone our own voice using this, if so can you please make a video to educate us. Great content.

  • @ravitanwar9537
    @ravitanwar9537 Рік тому +1

    not working. also please attach codes it makes the process easier

  • @Сливыприватныхкурсов

    good, but you so small in video

  • @TheBlackClockOfTime
    @TheBlackClockOfTime Рік тому +1

    ngl it's like a light year away from ElevenLabs