I failed in the last video....but this time 😁

Поділитися
Вставка
  • Опубліковано 21 лис 2024

КОМЕНТАРІ • 244

  • @NetworkChuck
    @NetworkChuck  2 дні тому +23

    Stop storing your secrets and API keys in your code!! Try Keeper, a password manager you can use in the terminal: (built for devs/admins): www.keeper.io/networkchuck
    I did it…..after days of frustration, blood, sweat and coffee..I finally figured out a way to clone a voice to use with my fully local, AI voice assistant!!!! This isn’t using cloud-based products like ElevenLabs…no…we are using a fully-local, open-source project called Piper TTS. This works wonderfully with the Assist voice pipeline in Home Assistant.
    📝GUIDE and WALKTHROUGH: blog.networkchuck.com/posts/how-to-clone-a-voice/
    🔥🔥Join the NetworkChuck Academy!: ntck.co/NCAcademy
    **Sponsored by Keeper

    • @LukeCottrell-b1h
      @LukeCottrell-b1h 2 дні тому

      i like you content

    • @brennanmahto5305
      @brennanmahto5305 2 дні тому

      why not just slow down your videos have your ai hear it then slowly train the ai to speed it up
      that way it can hear you annunciate

    • @brennanmahto5305
      @brennanmahto5305 2 дні тому

      i often have to slow down your videos to see and take notes what youre doing why not have it do the same (not even fully done with the video yet very happy with this my dad has wanted a morgan freeman ai assistant

    • @Yuriel1981
      @Yuriel1981 2 дні тому

      That laptop 3080 is more like a 3070 or a 3070ti at best..... but still better than my 3050 6gb running my ollama lol.

    • @MacGuffin1
      @MacGuffin1 2 дні тому +1

      @@Yuriel1981 Your being very generous

  • @nickolde5341
    @nickolde5341 2 дні тому +57

    With all the dependency issues and fiddling around, someone should totally make this toolkit into a docker image!

    • @coffeegonewrong
      @coffeegonewrong День тому +2

      The problem is you can’t access GPU from Docker…. Well, you can but you’ll end up doing all the same fiddling but with extra headache of the Docker layer

  • @drenpirraku3024
    @drenpirraku3024 2 дні тому +28

    It probably has been years since i watched a 37 min video without skipping once let alone a tech video. I feel like my attention span has been permanently increased.

    • @TheFlow2006
      @TheFlow2006 День тому +2

      thanks for bringing that to my attention, i hadn´t realized it was that long , crazy

  • @RangerDK21
    @RangerDK21 2 дні тому +81

    Be careful with showing yt-dlp...

    • @lucasdealmeidacarotta3174
      @lucasdealmeidacarotta3174 2 дні тому +20

      Linus had a strike for similar reasons, I think this video might receive the same "attention" from UA-cam unfortunately

  • @FuchsDanin
    @FuchsDanin День тому +6

    Quick note -- instead of removing silence, you would have been better served splitting at silence. The output would have been more intelligible for transcription, would not have required as many mid-word cuts which cause issues, etc etc

  • @decepti0n
    @decepti0n 2 дні тому +2

    Bravo, this is the peak educational youtube content. Learning with a twisted bit of fun

  • @yangenmanuel2659
    @yangenmanuel2659 День тому +1

    The end results of all the methods were so cool. Worth watching the entire video.

  • @shane7070
    @shane7070 2 дні тому +2

    Hey thanks for all your videos on home automation. I've started my own home automation journey watching your channel and learning what's possible. Now looking forward to commanding my home like the USS Enterprise.... "Computer; make coffee" :D

  • @kingston396
    @kingston396 2 дні тому +1

    Just bought a new house and am currently working on setting up automation and localizing everything offline. Challenge I'm hitting right now is getting mics in every space that go back to the assistant instead of having pi's everywhere. Also trying to limit the response to the room from which the request came from.
    Thanks for all the content! You have definitely made the process way more understandable and fun.

  • @exploittutorial8689
    @exploittutorial8689 2 дні тому +41

    Him: a CPU will work
    Me: looking at my HP 540 g3

    • @ZIonDaWolfo
      @ZIonDaWolfo 2 дні тому

      🥲

    • @Yuriel1981
      @Yuriel1981 2 дні тому +1

      Yeah, naw dawg......I feel for you.

    • @WWSchoof
      @WWSchoof 2 дні тому

      It will work - sooner or later

    • @daverahn1711
      @daverahn1711 День тому +2

      @@WWSchoof later, much much later

  • @emad2615
    @emad2615 День тому +1

    Hey Chuck, awesome video! I’m working on image detection, and it gave me an idea for your next project. How about a video on training custom image detection models? Like recognizing specific objects (e.g., PET bottles, toys) to expand what a home assistant can do. It could add some cool features to your Raspberry Pi assistant. Would love to see your take on it!

  • @freshseeds323
    @freshseeds323 2 дні тому +19

    I had like flash backs for the 1st 10seconds, from being a kid yelling at those recorded talk back hamster toys with that same audio playing back XP

  • @marquisjohnson3849
    @marquisjohnson3849 2 дні тому

    I'm so excited to try this out, each video I've tried to keep up and implement the home assistant and local ai. The voice is a wild addition

  • @DiaburoDev
    @DiaburoDev 2 дні тому

    That was intense. I can't imagine, how much time, work, coffee and nerves you put in this project, but it really was worth it. Terry sounds great! I hope the next project is less nerve wrecking. xD

  • @WWSchoof
    @WWSchoof 2 дні тому

    The topic is so crazy and fascinating, I think I‘ll do a home project like this. The only thing that bothers me that I don‘t want to run my desktop pc 24/7.

  • @DIYenthusiastfreak
    @DIYenthusiastfreak День тому

    Thanks Chuck, that vid had me wanting more, what a project! I hope some other shanagins come about from this😊

  • @jcbenge08
    @jcbenge08 2 дні тому +5

    OMG the Terry voice is AMAZING!!!!!

  • @kalebfenley1199
    @kalebfenley1199 4 години тому

    Nice, I never recognized Mike as the voice of Mandark on Dexter's Laboratory before now. That's awesome.

  • @bertaboy
    @bertaboy 2 дні тому +8

    Lookin forward to building a local digital assistant with Multiple Personality Disorder, where Dr. Jekyl sounds like Morgan Freeman and Mr. Hyde sounds like Samuel L Jackson....

  • @4bytesuserpage
    @4bytesuserpage 2 дні тому

    Nice to see your still uploading, used to watch you after school everyday ages ago through my window (we were neighbors)

  • @vevojckproin3046
    @vevojckproin3046 2 дні тому +5

    Please need to talk about cash for servers, how it is done, and from what background should I learn this technique, and do you have courses about it?

  • @dr.hinneredv932
    @dr.hinneredv932 День тому

    This is awesome. Thank you for all your work. And special Thanks for sparing us the crying. :-)

  • @BurkenProductions
    @BurkenProductions День тому +3

    The instructions on your blog is incomplete... stuff missing and lots of libraries fail with torch and stuff. Cna you please try on a fresh ubuntu wsl install and follow your own guide and correct the errors coming up.

  • @JoseMR1992
    @JoseMR1992 7 годин тому

    When chuck asked. Dont you want this in your home? I was like. F YEAH I DO!

  • @rogerhuston8287
    @rogerhuston8287 2 дні тому +4

    Awesome! Now I can put your voice to the life-size doll I have of you....

  • @FuchsDanin
    @FuchsDanin День тому +2

    Re: training issues: garbage in, garbage out. AI transcriptions are not suitable for AI training.

  • @NFTwizardz
    @NFTwizardz День тому

    Lmfao your 1000% becoming my voice assistant when I have the time!

  • @Aineasg
    @Aineasg 2 дні тому

    Thanks, Chuck! I was really looking forward to this video. I absolutely love your content!

  • @iamdihan
    @iamdihan 2 дні тому +2

    I ended up trying a bunch of API LLMs and Open Ai 's Conversation agent and TTS is awesome and fast if you dont want to use your own hardware

  • @philrendell1767
    @philrendell1767 День тому

    I nearly wet myself when you played your voice after the training! Technology can't live without it😂

  • @timoknols3303
    @timoknols3303 2 дні тому

    This is amazing, great you figured everything out. And ofcourse i want this in my home assistant 😮

  • @dave_kimura
    @dave_kimura 2 дні тому +1

    Had a lot of issues getting it running on macOS, but was able to successfully get it up and running on my Ubuntu machine with python 3.10.12. After a few minutes of training, I tested it out and was surprised with the results. Pretty cool! If I have hours of quality recordings, what would the amount be to get a quality voice? Did you ever figure out why yours was a bit quirky?

  • @maximilianschmidt1872
    @maximilianschmidt1872 2 дні тому +6

    I also wanted to train my local ai voice assistant with my voice and started using the piper studio in the German language.
    It wanted me to say a lot of sentences that sounds like they're from an software call-center and could be used for software scam calls i.E. "The activation key you've entered is invalid" and in combination with other sentences like "Then call the police and see how far you get there" it sounds pretty strange to me.
    Then I saw a disclaimer on the page that says "By clicking Submit, you agree to dedicate your recorded audio to the public domain (CC0)".
    Is there anything known that the voice recorded by the software is distributed to the www and used for malicious phone calls?

  • @pjf
    @pjf 2 дні тому

    I will try it, hope it works for me, is the project i have been waiting for! Thanks for sharing

  • @ArifBillahOnGoogle
    @ArifBillahOnGoogle 2 дні тому

    When this guy opens up his camera gear, bugs and errors completely stop existing... I wish reality was like that.

  • @starlord2606
    @starlord2606 2 дні тому

    Hey there Chuck, Great video, One more request or suggestion, whatever seems right, Make it talk with emotions, like the LLM is giving the responses and it is just reading it as it is, Maybe it should emphasize on those words, add some filler words and talk actually like it is a human talking. For example *talks intensely* shouldn't be read, instead adapted as emotion.
    Thank you, this is one of a gem Channel I have found which actually teaches cool stuffs.

  • @jackelo911
    @jackelo911 2 дні тому

    I now know what I'm doing when I get home, Thanks Chuck!

  • @Smoth48
    @Smoth48 2 дні тому

    Lmao, the mike monologues were the best thing I've ever heard. I really need to buy a new Pi so I can set up home assistant... I have an old RPi2, but it doesn't have the specs needed to run home assistant :(

  • @TomDavenport
    @TomDavenport Годину тому

    Honestly the chuck voice had me laughing so hard after 30 min of development 😂

  • @Felicia-bi5wu
    @Felicia-bi5wu День тому

    I copied your last video and was like damn, i wish i could make my own, literally you a bit later, thank you :)
    The Ai thing is running in a virtualmachine in proxmox with a gtx970 so it's a little sheit but it works XD

  • @jokelot5221
    @jokelot5221 2 дні тому

    I made a Pi Led Agent a couple days ago. I can turn Led ON and OFF using whisper(small model) to translate my voice to llama3.2:3b, then llama generates a response that executes a condition based on the string it provides and toggles the LED. Also model can respond using voice of piper(small model) with another prompt that llama does, besides the one that controls the LED. I use pre-promts to guide. Like explain to the LLM what it is, comands it should generate, and give it a few examples of how its done, as this can improve its responses.

  • @meenstreek
    @meenstreek День тому

    Was it _really_ free, though? haha. Awesome job! Thanks for this!

  • @Danielddiniz
    @Danielddiniz 2 дні тому

    Wow the Terry crews voice was amazing! Proper voice for your beefy Terry AI server! Congratulations

  • @marcomoraschi3972
    @marcomoraschi3972 3 години тому

    You were my hero just with the other video, and now @just 1:23 you are more hero than hero .... LOL

  • @PovertyHelping
    @PovertyHelping День тому +1

    Thanks so much for this favorite online classes. You are the best teacher. Please teach me how to make a raspberry pi that contains local chatgpt for generating texts to 3d gaming characters ❓

  • @arnorenirving
    @arnorenirving 2 дні тому +2

    Fun fact: Demirkapı means iron door in Turkish (bill probably are)

  • @oldekline
    @oldekline 2 дні тому +2

    Bro has Brad Boimler vibes! I'm here for it.

    • @ethanberg1
      @ethanberg1 День тому

      He just needs the Boimler scream!

    • @oldekline
      @oldekline День тому

      @@ethanberg1 That could be the beard that Boimler has been growing all season.

  • @wayne8113
    @wayne8113 2 дні тому

    Thanks Chuck, I think I'm to dumb to do that, But it looks so cool and out of the cloud 👍

  • @danielstellmon5330
    @danielstellmon5330 2 дні тому

    Chuck says "So many little things to remember" all I hear is "take notes and write a script as you will never remember them all"

  • @Danielddiniz
    @Danielddiniz 2 дні тому +2

    Next video must be putting your voice in a Chuck the assassin doll with creepy phrases pleaaasse 😂

  • @alfadat
    @alfadat 2 дні тому +1

    Hey Chuck fantastic and clear video! thank you! However you bring mixed messages, when you mention Keeper you said that is good that is "Cloud Based", but in your video, it seems like you prefer local installations (1:11 mark)

    • @NetworkChuck
      @NetworkChuck  2 дні тому +1

      What is good for an individual (local hardware) may not be good for a company. As an individual, I’m willing to accept the cost and pain of maintaining a local infrastructure because it’s fun. For a business, the highest value becomes reliability.

    • @semondemon3787
      @semondemon3787 2 дні тому

      ​@NetworkChuck Hello my name is suck 😁🤣

  • @VorpalForceField
    @VorpalForceField 7 годин тому

    Absolute Beast Mode..!!! You Rock ...!! Cheers :)

  • @mrnee
    @mrnee 2 дні тому

    I actually was so inspired by your last video that i developed a python script that runs natively on windows and does everything satellite does... well more.. stt and tts is on windows (no docker necessary). transcript is sent to conversation API through websocket and then response is turned into speech in 1/10th of second on your master race PC (no hate - console gamer here). So you do not need an extra satellite (about $70,- in extra HW) when you have a faster machine right in front of you. It was hard for two reasons. 1) home assistants docs and API are crap. API refuse to work if you pass wrong params, but docs don't tell you, that you actually CAN talk to Ollama on Home Assistant... if you spend 3 days in trial and error. (i am not python pro... I have not touched python before. I'm php enthusiast at best. That i was able to do this - makes me proud). But, for the last day I've been trying to replicate Donald's voice. Got Onnx, without tflite file. Demo works, but not in HA. Your vid comes just in time!

  • @issaissa6257
    @issaissa6257 37 хвилин тому

    Networkchucks voice with an chinease accent sounds so funny 😆😆

  • @notedown1010
    @notedown1010 2 дні тому

    @NetworkChuck what kind of keyboard do you use? I'm dying to know because it just sounds SO good

  • @jldevezas
    @jldevezas 2 дні тому

    Oh man, love it! Freakin' cool! Totally worth the effort! 😁

  • @satirical_snake
    @satirical_snake 2 дні тому +1

    Now we're talking. Been waiting for this one!

  • @dancannyonge
    @dancannyonge 2 дні тому

    That was amazing..I am currently building mine❤❤

  • @DxBang3D
    @DxBang3D 18 годин тому

    I can recommend using stable whisper instead of whisper to get better timestamps.

  • @LordDartonStaker
    @LordDartonStaker 2 дні тому

    I've literally been following this series that you have been updating, From the Start to now - I have Ollama with AlwaysReddy setup on my Ubuntu 24.04 OS - Running this - I will be trying to implement this on a New Raspberry Pi 5 (Quick question, will it be beneficial to add the AI HAT that you get for the Pi?) But really interested in this project and thank you so much for the inspiration to follow along the journey.
    Much respect,
    Great Channel.

  • @marcsmith5880
    @marcsmith5880 2 дні тому

    Thanks! This is going to make my life so much easier. Going to use a pi zero 2 W and a keyestudio 2 mic hat.

  • @mal-avcisi9783
    @mal-avcisi9783 2 дні тому

    bro there are much easier ways to clone voice locally. but still fun to watch this video 👌👌

  • @SU3D3
    @SU3D3 2 дні тому +1

    I remember thinking ".wav" files were huge!

  • @deejayx256
    @deejayx256 2 дні тому

    I can't sleep without watching your video's 🎉🎉

  • @mrnee
    @mrnee 2 дні тому

    onnx is not a universal format for tts. There are more pth files for tts readily available. Also -- ONNX (Open Neural Network Exchange) is an open format built to represent machine learning models... any models be it stable diffusion, GPT's... I'm not guru, learned it all today

  • @Robban31013
    @Robban31013 День тому

    Hi Chuck! I want to integrate this to all the bedrooms in my soon to be home. I already plan to build in Sonos speakers into the ceiling(Sonos in-ceiling speakers). Is it possible to use this speakers instead of the small speaker that you are currently using? Thanks mate! Really enjoying your content! 🙌 (About to build my dream home and wants to make it smart/AI)

  • @kevinh5983
    @kevinh5983 День тому +1

    We all know that Morgan Freeman is what Chuck is going to change it to after the video ends.

  • @isuckatthat
    @isuckatthat 2 дні тому +3

    This is overkill. All you need is Alltalk tts and 10ish second sample of your voice.

    • @touma-san91
      @touma-san91 2 дні тому

      The accuracy won't be the same though

  • @CreativeCreed-jp7bm
    @CreativeCreed-jp7bm День тому +1

    29:20 oh no not the csauce

  • @Doran_Krotan
    @Doran_Krotan 2 дні тому

    i love you chuck. youre the best

  • @philipl8866
    @philipl8866 2 дні тому

    Good job chuck

  • @fyzxnerd
    @fyzxnerd День тому

    YOU LET OUT MAGIC SMOKE!

  • @anatolykosychenko8038
    @anatolykosychenko8038 2 дні тому

    Hi Ya & best wishes. Thanks for work. Be Happy. Sevastopol/Crimea.)

  • @pilotedge
    @pilotedge 2 дні тому

    Not sure if anyone has thought of this... But I just downloaded an Audible with a celebrity reading and now have 3 hours of perfect training material 😂

  • @TristanCampbell-Reynolds
    @TristanCampbell-Reynolds День тому

    This was so cool!

  • @crazykkid2000
    @crazykkid2000 20 годин тому

    You should do some Unriad Videos, you can do all this in Unraid with docker apps so much easier

  • @jimhark
    @jimhark 2 дні тому

    @NetworkChuck, the Terry Crews voice clone does sound great, but I feel like you must have left something out. You attempted to use an automated process to generate an onnx file from your recorded voice, but the results were poor. You went back to Piper Recording Studio to get a decent voice clone. You said Mike spent some quality time with Piper Recording Studio for good results. I don't imagine Terry used Piper Recording Studio. So what did you do differently to achieve such a good result from prerecorded audio?

  • @derivitiv
    @derivitiv 2 дні тому +1

    I just wanna say.. I am fully onboard with making my own AI assistant based on your video guides. However, the only thing holding me back is that I get Amazon Music via my Alexa. Would it be possible to include this service with this setup?

    • @derivitiv
      @derivitiv 2 дні тому +1

      Nevermind. I just found an article on how to do it.

    • @ramppage
      @ramppage 2 дні тому

      ​@@derivitivcan you post a link?

  • @ozerune
    @ozerune 2 дні тому +2

    A very good UA-camr for voice based AI is Jarods Journey. He's fantastic and I learned everything I need to know about voice AI from him.

  • @gamereditor59ner22
    @gamereditor59ner22 2 дні тому +1

    Holy clone!

  • @JamieStuff
    @JamieStuff 2 дні тому +2

    I may end up spending a month clipping "The A-Team" to get mine to talk like Mr. T, fool!

  • @jameskirk5914
    @jameskirk5914 3 години тому

    great video

  • @ShutUpKabza
    @ShutUpKabza 2 дні тому

    Mikes voice sounds like Mandark from Dexter's lab lmao 🤣🤣🤣

  • @mr.beats.3756
    @mr.beats.3756 2 дні тому +2

    Sir big fan ❤

  • @MTEX-tr6vd
    @MTEX-tr6vd 2 дні тому

    Hey chuck, still you didn't fix that longer conversation....any way to fix it? Kr summarise the context?

  • @JasonMaggini
    @JasonMaggini 2 дні тому

    "Hi, my name is Chuck. My voice is my password. Verify me."

  • @justsomeguy8982
    @justsomeguy8982 13 годин тому +1

    it doesn't work for 40xx series of gpu, so everyone on 40xx series should do the github issue fix.

  • @ernieprevost6555
    @ernieprevost6555 19 годин тому

    Hi Chuck, I really enjoyed your tutorial. Sorry if I am doing something wrong but, I have tried several times to add a lengthy comment which keep disappearing, do you know why this might be? Ernie

  • @DezFutak
    @DezFutak 2 дні тому

    "Hey Chuck! Show me how to groom my beard like yours!" "Sure! All ya gotta do is drink a LOT of my coffee!" ;)

  • @PcSuchtie
    @PcSuchtie 2 дні тому

    Hey, love your vids but can you also include AMD GPUs in your tutorials? Ik its only a small percentage of AMD but it would still be nice.
    (Or link another video as totorial, just something so i dont feel left alone lmao)

  • @mrnee
    @mrnee 2 дні тому

    "I can't handle batch size" might not be gpu ram issue, but wrong software configuration. 4070 with 12GB couldn't batch size of 2, but on CPU 16 it was not a problem. I didn't try 32

  • @joshhardin666
    @joshhardin666 2 дні тому

    Time to find me some Majel Roddenberry clips I guess (for respectful personal, non-commercial, non-distribution use of course)

  • @Tejas-yx5vf
    @Tejas-yx5vf 6 годин тому

    hell yeah...he made it work....

  • @francisyohannan6019
    @francisyohannan6019 2 дні тому

    Awesome 🥰🥰🥰🥰

  • @Weeem
    @Weeem День тому

    I Wanted to use this so much, but couldn't 'cos of the disclaimer @06:10 that dedicates all audio clips public domain.

    • @dr.hinneredv932
      @dr.hinneredv932 День тому

      ?

    • @blockcrafter2393
      @blockcrafter2393 19 годин тому

      To quote the maintainer of Piper Recording Studio: "This is only there because I'm hosting Piper Recording Studio where it is the case that submitted/uploaded audio is donated to the public domain.
      But when you run locally, it's up to you what you do with the data. I should add a flag to only show that text for my website"

  • @pearlstar159
    @pearlstar159 2 дні тому

    It might be helpful if you separate the voice audio from the video into a splitter file then clean it up remaster the clean up listen for errors then break down words run them through a phonics filter create a alphanumeric dictionary in losless audio file trim the top frequency and bottom note the speed ratio and frame rate. Yhen train on the audio dictionary you created make sure yo speak the alphabet and numerical values clearly then use the training data yo compare and compete against one another then do a live training where you compete against the e audio in a olayback recording session it should be at least 1,000 words and 1,000 numbered lol😂

  • @Badg0r
    @Badg0r 2 дні тому

    Why do you use windows more and more instead of a native linux installation? I mean, Linux is a bit more secure (depending on the OS you're using).

  • @ChristophBerg-vi5yr
    @ChristophBerg-vi5yr 2 дні тому

    Terry should use a Borg voice

  • @LawrenceOrsini
    @LawrenceOrsini 2 дні тому

    You did it!!!

  • @rajackar
    @rajackar 2 дні тому +1

    Super cool video. Tried the steps and I get an error trying to install numpy 1.24.4. :
    "module 'pkgutil' has no attribute 'ImpImporter'."
    Did you run into this as well? Can't find a solution just yet.