Descript AI Voice Cloning - DaveClone vs RealDave

Поділитися
Вставка
  • Опубліковано 27 сер 2024
  • Testing another AI voice cloning tool, Descript. Can this one produce a DaveClone?
    Eleven Labs video: • DaveClone - Testing El...
    Forum: www.eevblog.co...
    If you find my videos useful you may consider supporting the EEVblog on Patreon: / eevblog
    Web Site: www.eevblog.com
    Main Channel: / eevblog
    EEVdiscover: / eevdiscover
    AliExpress Affiliate: s.click.aliexpr...
    Buy anything through that link and Dave gets a commission at no cost to you.
    T-Shirts: teespring.com/s...
    #ElectronicsCreators #descript #ai

КОМЕНТАРІ • 158

  • @gfx2006
    @gfx2006 Рік тому +98

    To me as a non-native speaker living in Australia the 2nd one sounds a lot more "Dave" than the first. It does sound a little bit too excited at the wrong places in the script, but overall it is still a lot more convincing than the super flat AI voice of the first.

    • @komorka88
      @komorka88 Рік тому +16

      I agree. The voice trained on videos sounds more like normal Dave video. Just a bit too excited.

    • @jannb.6811
      @jannb.6811 Рік тому +4

      A mixture of both would do the business.

    • @EEVblog
      @EEVblog Рік тому

      Really? I think it's awful. I get what you mean that it's more like my excited voice, but the voice itself is horrible and grating. It's like words have been cut off or something.

    • @gfx2006
      @gfx2006 Рік тому +3

      @@EEVblog I don't disagree with it being a bit grating and words getting a bit cut off etc, but it is still a much more accurate you than the first robotic one :) If they could vary the tone and excitement throughout the script to match the context, it would be a perfect reproduction of your real voice!

    • @brumbymg
      @brumbymg Рік тому +2

      @@EEVblog Yeah .... Sorry Dave. That second one is pretty convincing. Yes, it's a bit "excited", but the tonal components are rather believable. The monotonic one is rather drone-like. The real Dave is somewhere in between in regards to expressiveness, but both have decent tonal qualities. IMHO

  • @jhonbus
    @jhonbus Рік тому +59

    😂 The first sounds like you're being held hostage by terrorists and forced to read about transistors
    The second is incredibly good! It's like your normal level of "excitement" has been tripled, but it's way more convincing.

    • @woox2k
      @woox2k Рік тому +4

      I thought it''s pretty good too but it got one emotion and sticked with it at all times. After few seconds without a change it sounds robotic and annoying.

    • @shazam6274
      @shazam6274 Рік тому +1

      @@woox2k As sometimes the ranting Dave is also.

  • @theoloutlaw
    @theoloutlaw Рік тому +66

    Well... to be honest, that second sample does sound very similar to your voice in your early video's.

    • @BenMitro
      @BenMitro Рік тому +13

      I think Dave liked the first one because it made him sound better than he normally does. Yes, I agree there was more of Dave's twang in the second one.

    • @coxyofnewp
      @coxyofnewp Рік тому +5

      @@BenMitro Yeah I think that's the case too

    • @mycosys
      @mycosys Рік тому +4

      SOLAR FREAKING ROADWAYS!

    • @EngineeringVignettes
      @EngineeringVignettes Рік тому +4

      When Dave gets excited, his voice pitches up more. I think that's what the second run of Descript was catching a bit of in the sample videos.
      The first was read by Dave from script and was read at his normal pitch... maybe because the content was not that exciting ( :) ) or done deliberately by Dave to create a better dataset for testing different cloning tools.

    • @jhonbus
      @jhonbus Рік тому +10

      Definitely agree with this. The second is better than the first, but it's like Dave is trying to out-Dave himself 😂

  • @hardwareful
    @hardwareful Рік тому +25

    "Some transistors are packaged gibber."
    The AI knows something about Chinese counterfeit audiophile power transistors. Respect.

    • @Okurka.
      @Okurka. Рік тому +1

      Real Audiophiles don't use transistors.

    • @5mxg
      @5mxg Рік тому +1

      @@Okurka. Real gibber don't jabba

  • @Okurka.
    @Okurka. Рік тому +8

    6:14 Now you know the pain we have to go through.

  • @bbrazen
    @bbrazen Рік тому +19

    Even though you listen to yourself more than most people, I think it would be challenging for anyone to objectively analyze their own voice. Great video as usual Dave!!

    • @WaffleStaffel
      @WaffleStaffel Рік тому +1

      I think the more emotive one is the most accurate... Almost spot on. He's not going to like hearing that.

    • @EEVblog
      @EEVblog Рік тому +1

      Sorry, but nope. I've bene trained on thousands of hours of my own voice played back through high quality studio monitor speakers.

    • @WaffleStaffel
      @WaffleStaffel Рік тому +2

      @@EEVblog But you have no more objectivity on the matter than anyone else. That last sample is the closest, and it's like 90% accurate.

    • @shazam6274
      @shazam6274 Рік тому

      @@EEVblogImmaterial! You inherently have subjective bias. Face it Dave, all your fans also have thousands of hours listening to your voice and the majority rule that the overemotional sample is YOU!

  • @MikrySoft
    @MikrySoft Рік тому +15

    I didn't watch the first video, but from the sample in this one "British Dave" sounded better, words were flowing one into another. Those two new samples sounded like someone made a soundboard of separate words and stitched them together into sentences.

  • @pjmelect
    @pjmelect Рік тому +5

    I think that "British Dave" sounded better and closer to your voice. You may be sensitive to the accent but non-Australians will not notice the lack of accent.

  • @unconv
    @unconv Рік тому +13

    I think the second version was actually a lot more like you, just a bit too "excited" for the text given. You should have it say "sex on a stick" or something.

  • @mycosys
    @mycosys Рік тому +8

    Sorry Dave but the one trained on your videos REALLY sounded like you in your videos, just without the context of why theyre constantly putting in voice stress. Sounded just like one of the 'solar freaking roadways' vids - and yeah it really isnt your best side XD.
    Russel Brand doing Dave impersonations was a spot on call for the first one

  • @deanrubine2955
    @deanrubine2955 Рік тому +3

    Dave, I don't know what you think you sound like, but that second one sounds much more like the emoting Aussie we're used to. Add a few "Bob's your uncle," "that's terrible Muriel" and "we're in like Flynn" and call it done.

  • @IanSlothieRolfe
    @IanSlothieRolfe Рік тому +4

    I think the DaveClone voice is better from the point of the emotion in the voice, the problem is it is putting the emphasis in the wrong place because it's not being done in the context of what is being said, which is probably why it grates so much. But the "android" voice is so close its just we're not used to monotone Dave.

  • @cirdiam1800
    @cirdiam1800 Рік тому +2

    To me, the one trained on your videos sounds much more like you than the one trained on the script.

  • @whatevernamegoeshere3644
    @whatevernamegoeshere3644 Рік тому +2

    The second one sounded exactly like you, but maybe after a couple pints

  • @GiannisKaralis
    @GiannisKaralis Рік тому

    1(text train): Battery Drained. 2(video based): Over Voltage . 3(Eleven Labs): Batteries in series with leakage .

  • @peterdkay
    @peterdkay Рік тому +1

    I am an Aussie and the second sounds more like you.
    If it turned down the excitement a bit, it would be you!

  • @Spongman
    @Spongman Рік тому +1

    yeah, the 2nd one is definitely more "Dave". maybe a little over-the-top, but much less robotic than the 1st one, especially the ending of the work "individually".

  • @jsdutky
    @jsdutky 9 місяців тому

    The second one, trained on your natural voice, actually sounds much better to me than the one trained on the recommended script, mainly because the second voice seems to capture some of your emotive expression. I find the normal, emotionless "AI" voices to be impossible to listen to for any length of time, but I could listen to a fair bit of text rendered in the second voice.

  • @chitlitlah
    @chitlitlah Рік тому +1

    The first one sounds like Dave on smack. The second one sounds like Dave on crank.

  • @StevenOBrien
    @StevenOBrien Рік тому +1

    Davebot: "g r e e t i n g s. I r e q u i r e y o u r b a n k d e t a i l s t o p r o c e s s r e f u n d"
    Customer: "Dave? Is that really you? You sound like you have a cold."
    Davebot: "... ... a i n t t h a t a b o b b y d a z z l e r"
    Customer: "Oh, haha, okay, it is you. Hold on, sending"
    Davebot: "B O B I S Y O U R U N C L E"

  • @oskimac
    @oskimac Рік тому +3

    I don't understand. the second sounds like garage to me immediately. but the first sounds spot on. is just my opinion. the second sounds like Terminator describing the rise of skynet

    • @EEVblog2
      @EEVblog2  Рік тому +1

      Everyone else here inthe comments seems to think the 2nd one is way better.

  • @JohnBurgessMusic
    @JohnBurgessMusic Рік тому +1

    I like the second one....in fact I want the AI to have an emotive multiplier, like those resident evil videos where the facial expressions are multiplied 500%, except applied to your voice.

  • @djbassaus
    @djbassaus Рік тому +1

    First one sounds like Dave has lost his will to live, second one is still pretty good bit like Dave is stuck on high energy mode.

  • @tomg0
    @tomg0 Рік тому +2

    2nd sounded much better and more like you in my opinion

  • @johannes_franciscus_kok
    @johannes_franciscus_kok Рік тому +1

    06:19 is quite how you are speaking in your clips :-)

  • @heathwellsNZ
    @heathwellsNZ Рік тому

    It's a LOT better than that previous one... but still doesn't sound like there's any emotion or "soul" in the voice. It does a better job at creating the accent.

  • @Blitterbug
    @Blitterbug Рік тому

    You got your Strine back! 9/10 if they could remove the aliasing.

  • @davebeerman
    @davebeerman Рік тому

    That is what Video-Dave and Podcast-Dave sound like after they've been partying together all weekend.

  • @erikdenhouter
    @erikdenhouter Рік тому

    In the second one, I could predict after 30 words where your voice would raise to a high pitch. Always the same sound in a wavy format.
    First one is best, but low pitched and flat; like if it had only few tones with nothing in between, like the fixed energy levels from electrons.

  • @cspower7259
    @cspower7259 Рік тому +1

    Its definatly got the accent down. Maybe a mix betweeen the two would be quite close. It seems the second lacked bass frequancies.

  • @dadsgarage738
    @dadsgarage738 Рік тому

    Haha - the second generate voice is spot on😂

  • @richardhalliday6469
    @richardhalliday6469 Рік тому

    The best one was the one you didn't like ,it was so much like you, over excited, high pitched typical Dave tone and intonation.

  • @Enigma758
    @Enigma758 Рік тому

    I knew this was coming.

  • @TheDefpom
    @TheDefpom Рік тому +1

    lol, the video trained one sounded more accurate to me

  • @breakalegfpv9532
    @breakalegfpv9532 Рік тому

    every time you try something you are teaching the beast..we have no chance.

    • @Okurka.
      @Okurka. Рік тому

      You have no chance to survive make your time.

  • @uni-byte
    @uni-byte Рік тому +1

    The first one missed the word "individually" but it got it on the 2nd one. The 2nd has some of you inflection, but in all the wrong places. Perhaps it it had a better vocabulary it would improve.

  • @TheStevenWhiting
    @TheStevenWhiting Рік тому

    I'm from the UK and I could tell the last one sounded British.

  • @reedreamer9518
    @reedreamer9518 Рік тому

    This is The Singularity!

  • @MiloszSuchy
    @MiloszSuchy Рік тому

    The second one sounds like Dave on weed having the greatest time of his life with transistors xD

  • @ITTom
    @ITTom Рік тому

    1st 8/10 no emotions Dave (tone of voice: actually very similar)
    2nd 6/10 sad Dave (aussie accent with 0 human touch)
    3th 2/10 squirrel Dave (no comments)

  • @cambridgemart2075
    @cambridgemart2075 Рік тому +1

    To a Brit, that isn't a British accent Dave!

  • @pahom2
    @pahom2 Рік тому

    The previous attempt was much better. This one literally sounds like a robot with metallic notes in its voice.

  • @mycosys
    @mycosys Рік тому +1

    You know, getting Russel Dave to read bits of your script would be kinda memeworthy XD

    • @EEVblog
      @EEVblog Рік тому

      The British Dave does actually sound good and rather pleasant, it's just not me.

  • @SuperSerNiko97
    @SuperSerNiko97 Рік тому

    You should reach out to Home Assistant saying you want to train Piper with your voice

  • @corenelius
    @corenelius Рік тому

    Better accent but like you say Dave, no characteristic inflections. 7/10

  • @6581punk
    @6581punk Рік тому +1

    It's a lot better than the last one. If it had some character and personality it would be much harder to tell.

  • @duprod5482
    @duprod5482 Рік тому +3

    Is the clone called Daiv?

  • @WereCatf
    @WereCatf Рік тому +2

    8/10?! Pssh, you're kidding! Sure, yes, it gets the Aussie accent quite well, but some words have this odd raspy glitch at the end -- I dunno, the word that comes to mind would be "sawtooth", but that's just me -- and even ignoring the monotonousness, it sounds too robotic still. I give it a 6/10. Once they figure out a way of avoiding the monotonousness, that's when I'll be impressed.

    • @chitlitlah
      @chitlitlah Рік тому

      The last one did sound a little crackly to me. I would've believed it was a microphone problem. I didn't notice it on the first one though.

  • @Martin-oo4kn
    @Martin-oo4kn Рік тому

    9/10, they both sound good to me

  • @jcthe2nd
    @jcthe2nd Рік тому +1

    2nd one is you dave for sure 8 out of 10

  • @foxabilo
    @foxabilo Рік тому +1

    I still can't tell the difference, apart from the one trained on your videos was more animated... I must have a low resolution audio decoder in my brain.

  • @TheStevenWhiting
    @TheStevenWhiting Рік тому

    First one sounded good but you can still tell its AI and weird but def sounds like you.

  • @Youtronics
    @Youtronics Рік тому

    I rate this a clear Jabber/10 !

  • @airmann90
    @airmann90 Рік тому

    Pretty decent actually. Extremely depressed version but very good lol

  • @laurentallenguerard
    @laurentallenguerard Рік тому

    Cool! It clearly lacks the emotions, but not much is missing before building a DaveBot to fix electronics for you.

  • @wolfiexii
    @wolfiexii Рік тому

    It's hollow, not terrible but not right.

  • @Pidroe
    @Pidroe Рік тому +5

    Are you trolling Dave? Screw the accent if the voice sounds like a robot. Eleven labs is waaay better the others in the video.

    • @EEVblog2
      @EEVblog2  Рік тому

      I agree, but the point is trying to match MY voice and accent.

  • @Philipp20052
    @Philipp20052 Рік тому +6

    Oh wow, I actually have to fully disagree here.. The Eleven Labs voice maybe not nailing the accent 100% right, but sounds like a real, smooth human voice. The Descript AI ones both sound like some Text-To-Speech technology out of the 90s/00s with real audible noise / unnatural sharpness to the voices. But I still much prefer the second Descript AI voice over the first. So for me it's:
    Eleven Labs >>> 2nd Descript AI > 1st Descript AI

    • @EEVblog
      @EEVblog Рік тому

      I agree that the Eleven Lab one sounds better and smooth, it's way more pleasant sounding. But it's not me.

  • @pauldenisowski
    @pauldenisowski Рік тому +6

    The real test would be to produce one of your videos with a cloned voice and see how many people notice a difference.

    • @woox2k
      @woox2k Рік тому

      Almost all people would notice! Dave has a difficult voice pattern to follow artificially. Like he said, he talks with emotion. It's clear if he's reading a script or talking about something that excites him. He might just be good at acting and not talk like this in real life but we do not care, we are used to his speech in his videos and that is difficult to replicate.

  • @patrickjean5811
    @patrickjean5811 Рік тому

    The AI forgot to say Bob's your uncle...

  • @gshingles
    @gshingles Рік тому +1

    Ah yeah, sorry Dave, the second one is deffo more "you" on an average video, but like other's say, it doesn't match the subject matter. If you put a debunking script in to it, I think it would sound on point 🙂

  • @whitebakecase
    @whitebakecase Рік тому

    Have to say, that elevenlabs one sounds just like Hugh Jeffries

  • @Xairos84
    @Xairos84 Рік тому

    Hahah 2nd comment for the algo, your energy is cracking me up. Love it!

  • @mensor
    @mensor Рік тому

    Second video is more convincing. First sounds more robotic - its interesting how people don't recognise their own voice.

  • @MetallicBlade
    @MetallicBlade Рік тому

    I don't know Dave... it needs more 'jibber and jabber'.

  • @Phantom-mk4kp
    @Phantom-mk4kp Рік тому

    Second one was spot on, first one not so good. First 5 second one 9

  • @a178design
    @a178design Рік тому

    Try play ' ht although I haven't tried their voice cloning their pre existing voices are quite good

  • @willoland
    @willoland Рік тому

    If you were calling me and asking for gift cards, I would expect the excitement of the second one.
    It would be a total sell.

  • @notsurt
    @notsurt Рік тому

    Depressed Dave vs overexcited Dave.

  • @TTTTTANK
    @TTTTTANK Рік тому +1

    That sounds a LOT more like you to me then the British one (I’m British) if only you could mix the two you had here and I think that would work well. The first one here might fool me if I thought you were deliberately reading it as if some calamity had occurred. Think it’s cloned the voice well but needs natural emotion/inflection.
    7/10

  • @MrTripcore
    @MrTripcore Рік тому

    8/10 it's dry monotone and you can hear the breaks

  • @summerforever6736
    @summerforever6736 Рік тому

    the 2d one was the best!!!That is you like it or not

  • @--Nath--
    @--Nath-- Рік тому +1

    I reckon it sounds a bit of a younger Dave.. (the one you gave 8 out of 10). About half way between the two would be probably about it.

  • @GrumsPlace
    @GrumsPlace Рік тому

    The missus prefers the first AI generated version of your voice @ 4:30 mark 🤣

  • @5mxg
    @5mxg Рік тому

    What comes from learning with mix of script reading and audio from videos? I think those have only your character and script have no life. Something around 20% between the two could give nice results?

  • @nesnioreh
    @nesnioreh Рік тому

    Second one is better. Sounds more like your voice in your videos.

  • @Yotanido
    @Yotanido Рік тому +2

    DaveClone is a bit too fast and a bit too high pitched, but overall I'd say it is much better than The Transistor. It's not as monotone, but more importantly: It actually sounds a lot more like you.
    It's definitely not fooling anyone, but small snippets might actually be believable.

  • @TheStevenWhiting
    @TheStevenWhiting Рік тому

    2nd one also sounded like you but like you're trying not to laugh.

  • @nigelgunn322
    @nigelgunn322 Рік тому

    I have to disagree. The "chalkboard" sample sounds more like you. The first sample is too low in frequency.

  • @bobbydazzler6990
    @bobbydazzler6990 Рік тому

    Can we give "AI Dave Voice" the name 'Dave from the Old Dart'? 🤣🤣🤣
    Aussie slang kills me. 😁

  • @Xairos84
    @Xairos84 Рік тому

    Hahah I couldn't hear it. US native here and I tried to pick out certain words and I still couldn't tell

  • @sklepa
    @sklepa Рік тому

    Call him Dawid

  • @MrBCRC
    @MrBCRC Рік тому

    The second one is closer to you than the first. Yeah. We all see our own voices as cringe. LOL

  • @BMSWEB
    @BMSWEB Рік тому

    Not perfect but wow, so much better and at least it's not a British accent lol

  • @frankbose544
    @frankbose544 Рік тому

    to my yankie ear the old ai one was close to me

  • @dazednconfused31337
    @dazednconfused31337 Рік тому

    The first had a depressed deadpan delivery, like you were being sarcastically uninterested when reading about a new product.
    The second sounded a bit like horse racing commentary 🏇

  • @summerforever6736
    @summerforever6736 Рік тому

    sorry to say but you sounds like the 2rd one you don't like it but it is true !! impressive

  • @Land-of-reason
    @Land-of-reason Рік тому +1

    Nothing like.

  • @TzOk
    @TzOk Рік тому

    My ranking would be:
    1. Eleven Labs - most fluent.
    2. Descript AI trained on videos - most natural, yet slightly too excited.
    3. Descript AI trained on a script - very artificial sounding, with lots of artifacts, didn't like it at all.

  • @TradieTrev
    @TradieTrev Рік тому

    For shits and giggles could you start one of your video with a Russell Brand "Hello Beautiful People!" :P

  • @QsTechService1
    @QsTechService1 Рік тому

    Pretty soon we won't know what's real or Fake Especially after google rolls out there ai

  • @organiccold
    @organiccold Рік тому

    Portuguese here and i said straight away: British accent 😂😂

  • @navalenigma
    @navalenigma Рік тому

    I'm British but could tell that last one had a twang of aussie but basically British. In second you could tell wasn't right. On second, much better but lacks good flow, you could tell wasn't human. Pace seemed wrong too. Third was not good. Overall second was probably best for me.

  • @MattyEngland
    @MattyEngland Рік тому +1

    I thought the original one sounded more like you than this one! Maybe it's because I'm English? 🤔 Weird and interesting in equal measure lol.

    • @EEVblog2
      @EEVblog2  Рік тому +2

      The Eleven Labs one is "smoother" I think, and more pleasant to listen to, but the accent is completely wrong so it's a fail on cloning my actual voice.

    • @jhonbus
      @jhonbus Рік тому

      I'm English and I thought the ElevenLabs one was terrible. It has a very slight Aussie twang, like someone from Australia who's lived in England for 30 years.

  • @emielkosse
    @emielkosse Рік тому

    DaveClone sounds like you 10 years ago😂

  • @morantaylor
    @morantaylor Рік тому

    You have been assimilated.

  • @jdlives8992
    @jdlives8992 Рік тому +2

    i’m sure a lot of us viewers are in the states. we couldn’t tell cause we don’t even question our own election. 😂. kidding but kinda not

  • @framegrace1
    @framegrace1 Рік тому

    This one sounds more like you, but is more artificial. Sounds robot, with no afflections, no modulation. Just-as-if-you-read-a-word-at-a-time.

  • @jamesmauer7398
    @jamesmauer7398 Рік тому

    Yeah this one is closer but totally lacking emotion/ inflection. Sorry can't replace you yet!

  • @bastianfromkwhbsn8498
    @bastianfromkwhbsn8498 Рік тому

    Very strange, to me the training script one sounds the least like you. And even I'm very used to the Aussie accent (lived there for a while) I even prefer the eleven labs one over the training script one from descript, yes the accent is a bit off but your voice is pretty good.