Open AI creates PERFECT Voice Clones - Incredibly Emotive!

Поділитися
Вставка
  • Опубліковано 29 бер 2024
  • Use code MATTVIDPROAI at the link below to get an exclusive 60% off an annual Incogni plan: incogni.com/mattvidproai
    Thank you Incogni for sponsoring this video.
    ▼ Link(s) From Today’s Video:
    Open AI Voices: openai.com/blog/navigating-th...
    Grok 1.5: x.ai/blog/grok-1.5
    Elon's boast about Grok 2: / 1773655245769330757
    Universal Claude 3 Jailbreak: / 1773455789056745782
    Amazon invests in Anthropic: / 1773030824927015369
    ► MattVidPro Discord: / discord
    ► Follow Me on Twitter: / mattvidpro
    -------------------------------------------------
    ▼ Extra Links of Interest:
    ✩ AI LINKS MASTER LIST: www.futurepedia.io/
    ✩ General AI Playlist: • General MattVidPro AI ...
    ✩ AI I use to edit videos: www.descript.com/?lmref=nA4fDg
    ✩ Instagram: mattvidpro
    ✩ Tiktok: tiktok.com/@mattvidpro
    ✩ Second Channel: / @matt_pie
    -------------------------------------------------
    Thanks for watching Matt Video Productions! I make all sorts of videos here on UA-cam! Technology, Tutorials, and Reviews! Enjoy Your stay here, and subscribe!
    All Suggestions, Thoughts And Comments Are Greatly Appreciated… Because I Actually Read Them.
    -------------------------------------------------
    ► Business Contact: MattVidProSecond@gmail.com
  • Наука та технологія

КОМЕНТАРІ • 299

  • @MattVidPro
    @MattVidPro  2 місяці тому +5

    What do you guys think of Open AI's Voice tech? Use code MATTVIDPROAI at the link below to get an exclusive 60% off an annual Incogni plan: incogni.com/mattvidproai Thank you Incogni for sponsoring this video.

    • @HiddenPalm
      @HiddenPalm 2 місяці тому

      I believe Elon Musk being a staunch supporter of the most horrid genocide of your lifetime, should be a warning your audience should get before marketing or reviewing his properties and assets to the public.
      Over 32,000 Gazans have been mass murdered in a US-sponsored genocide, 2/3rds of which are women and children. This genocide is still ongoing.

    • @seakyle8320
      @seakyle8320 2 місяці тому +3

      german has a really really strong US accent.

    • @shinniehildebrand
      @shinniehildebrand 2 місяці тому +2

      Can confirm about the German... very strong US accent, especially on how it pronounces the Ls

    • @GaryJr530
      @GaryJr530 2 місяці тому +2

      bro so close to 250k!

    • @NotThatVinny
      @NotThatVinny 2 місяці тому +1

      Looking forward to it.
      I can say that it's better at translation than I can muddle through it.
      I would like to see a feature where you can adjust the tone and gradient on certain parts of the audio though.

  • @huni_19
    @huni_19 2 місяці тому +36

    I'm from Kenya. Swahili is our native language while "Sheng" is popular slang. They did a good job. I'm impressed!

  • @FunIsGoingOn
    @FunIsGoingOn 2 місяці тому +64

    Well as a german I was impressed by the spanish one and disappointed by the german one, that sounded like a dutch trying german.

    • @cyroc6705
      @cyroc6705 2 місяці тому +16

      The German one has a good natural rhythm to it, but the voice has a distinct accent which makes it sound non-native

    • @highdefinist9697
      @highdefinist9697 2 місяці тому +10

      Yeah the German was pretty bad... it sounded like some weird, inconsistent English accent: Some words basically fine, some were only slightly off, and a few, for example "Kulturen" and "alle" had a really strong American accent, and the accent was the same every time she said the same word. (Also, I believe I have seen enough Anime to judge that the Japanese voice likely suffers from the same problem...)

    • @googleSux
      @googleSux 2 місяці тому +3

      Bin deiner Meinung.

    • @seakyle8320
      @seakyle8320 2 місяці тому +1

      Meddl loide

    • @fodiographer
      @fodiographer 2 місяці тому +1

      Ich spreche Deutch sehr gut mein freund

  • @tylerchambliss8379
    @tylerchambliss8379 2 місяці тому +11

    Hey Matt. As far as the audio of the voice engine sounding low quality if you listen to the audio they're feeding it that's why. That audio sounds like some teacher recording in a room on a crappy laptop mic. That's actually the impressive part not only is it very emotionally and phonetically accurate to how the guy in the source recording sounded but it's also mimicking the sort of edited sound of the audio and the conditions of the recording. As an audio engineer I find this insane.

  • @ahmedkagabo
    @ahmedkagabo 2 місяці тому +19

    The German and French versions were not good. I was a bit surprised by the Swahili version, which was a bit better. Open AI still has a lot of work to do on non-English languages.

  • @amkire65
    @amkire65 2 місяці тому +3

    In some cases, where the generated audio sounded low-quality, the original didn't sound like a studio recording, either. I guess, it was doing as good as it could with what it had to work with. Amazed by the fact that you can "give someone back their voice" using such a small amount of audio content, and the way people are always recording themselves these days, we probably all have at least 18 seconds of audio... if not, put some aside as an archive for the future, just in case.

  • @WeissM89
    @WeissM89 2 місяці тому +4

    I'm surprised no one's talking about how cool this is for patients with speech impairments.

  • @DarkandTwisted
    @DarkandTwisted 2 місяці тому +6

    Too many safety concerns with OpenAI. That's the only reason I am not too excited.

  • @AISpeculator
    @AISpeculator 2 місяці тому +93

    Clearly NOBODY is reading the blog post... when Voice Engine does translation it RETAINS the accent of the original speaker from their native language. Feature, not bug.

    • @user-vj5fb3ig4z
      @user-vj5fb3ig4z 2 місяці тому +12

      yeah, but why would anybody want that "feature". not me. I don't see any use for it.

    • @RackerTheRascalMashup
      @RackerTheRascalMashup 2 місяці тому

      @user-vj5fb3ig4z then Don't use it lol

    • @justinwescott8125
      @justinwescott8125 2 місяці тому +19

      I like hearing people with accents. I don't want every person I talk to, to have the exact same accent.

    • @thepermman
      @thepermman 2 місяці тому +6

      @@user-vj5fb3ig4z Imagine Mr. Miyagi sounding like Arnold from Happy Days. Accents are charming.

    • @Cradien
      @Cradien 2 місяці тому +5

      @@user-vj5fb3ig4z
      I can imagine it could be useful for dubbing. For example MrBeast dubs his videos into Spanish and Portuguese and posts them on different channels, with something like this he could have those dubbed video be in HIS voice

  • @TheSopk
    @TheSopk 2 місяці тому +4

    The most important aspect is the input; if you have an emotional voice in the input, the output would sound amazing. I'd like an AI that enhances voice input to make it more emotional. The input at 7:15 sounds monotone.

  • @draken5379
    @draken5379 2 місяці тому +4

    The thing is, Eleven Labs, is a product, not research, if that makes sense.
    Those samples from OpenAI, are the raw outputs from the model. Where as something like Eleven Labs, you can be sure they have a insane pipeline to take the raw outputs from their models, and clean them up. You could even create custom neural networks for this task etc.
    Also, you can try Voice Engine. You can use it via OpenAIs APIs, but you dont get to provide a reference, you can only pick from a selection of provided voices. Its what powers chatGPT voice.

  • @blackestjake
    @blackestjake 2 місяці тому +6

    🎵 Everyone together, sing it with me! 🎵
    🎵This is the worst it’s ever gonna be! 🎵

  • @gerkim3046
    @gerkim3046 2 місяці тому +3

    that swahili one is amazing! one can tell it is ai generated but it is still so good.

  • @surreal_dreams
    @surreal_dreams 2 місяці тому +29

    French here. I confirm that the french voice does have a weird accent, but that's honestly still very good.

    • @dahozabich
      @dahozabich 2 місяці тому +6

      The french had a hint of an american english person trying to speak canadian french. I am fluent in both.

    • @DeGandalf
      @DeGandalf 2 місяці тому +1

      Same for german

    • @MarcusBuer
      @MarcusBuer 2 місяці тому +1

      Same for portuguese. It sounds right, but has stops at the wrong places.

    • @testboga5991
      @testboga5991 2 місяці тому +1

      German has the same weird American accent

    • @cagnazzo82
      @cagnazzo82 2 місяці тому +2

      @@testboga5991 I'm leaning towards the native accents being intentional. In a way it sounds more authentic.

  • @AkariTheImmortal
    @AkariTheImmortal 2 місяці тому +2

    The German one, while I do like the intonation and all, it definitely has a strong accent. Without that accent, this could've been the best AI generated voice translation, I've heard.

  • @vainezaiven6677
    @vainezaiven6677 2 місяці тому +2

    I mean, if they're going to wait until all of their "conditions" are met before they release this voice engine, then they're never actually going to release it.

  • @esuus
    @esuus 2 місяці тому +2

    German: light accent, like an American who's lived in Germany and spoken German fluently for 3+ years. This is also what chatGPT sounds like when it speaks German.
    French: light accent, maybe a tiny bit thicker than German.
    I don't speak Spanish but that accent sounded very very heavy to me.
    Is this because they were trained with American voice actors speaking other languages, or does this happen naturally when an english trained model speaks another language? That would be fascinating.

  • @3rdeyesociety
    @3rdeyesociety 2 місяці тому +1

    @MattVidPro my boy done sauced up in that sponsor message, chain looks good bro 💪

  • @sarahhardwig2765
    @sarahhardwig2765 2 місяці тому +17

    As a regular ChatGPT voice chat user, I can definitely tell that the quality of the audio generated from the reference audio is very reminiscent of GPT voice chats. It doesn't necessarily have the best quality, but I know it can be better, as proven by companies like ElevenLabs. And another thing. I think ElevenLabs translation feature can be a little bit iffy when it comes to how natural a person's voice sounds once it's translated to another language. However, for Voice Engine in particular, I was very shocked to hear how natural a voice still sounded after being Used to translate something else into another language. I also found the Americanized pronunciation of some words in other languages (German, Chinese, Spanish, and others) to be particularly funny, but I think AI can definitely progress past that point.

    • @justinwescott8125
      @justinwescott8125 2 місяці тому +4

      Everyone is missing this, but the accent is on purpose. The blog post says that the languages will retain the accent of the original speaker.

    • @michaelsimonsen2017
      @michaelsimonsen2017 2 місяці тому +2

      @@justinwescott8125 I'm really happy about that. Retain accents so people can express there backgrounds.

    • @kuromiLayfe
      @kuromiLayfe 2 місяці тому +1

      Heard many multilangual people speak and their accents and tone of voice tend to differ between languages.. if the voice has the same accent and tone, it most likely is AI generated and not a recording

    • @brexitgreens
      @brexitgreens 2 місяці тому

      ​@@kuromiLayfe Only in Japanese. Sounding like an idiot is mandatory in that language. Joking aside, you are right. However I strive to maintain the same voice and prosody across all languages without hurting pronunciation. I just steer clear from Japanese.

    • @brexitgreens
      @brexitgreens 2 місяці тому

      ​@@kuromiLayfe Only in Japanese. Sounding like an idiot is mandatory in that language. Joking aside, you are right. However I strive to maintain the same voice and prosody across all languages without hurting pronunciation. I just steer clear of Japanese.

  • @renan777
    @renan777 2 місяці тому +5

    I speak English and portuguese, and man, English with Portuguese accent is amazingly good!

    • @32rq
      @32rq 2 місяці тому

      I thought the one I understand might seem worst, but the Portuguese was great!

  • @MadsterV
    @MadsterV 2 місяці тому +1

    Spanish reference sounded stilted and unnatural, while the generated audio sounded VERY natural. Weird.
    Spanish from English reference had a slight English accent, which is very interesting and I hope it keeps doing that.

    • @claudioestevez61
      @claudioestevez61 2 місяці тому

      I confirm this. The first Spanish already sounds generated and the AI sounds more natural in comparison. In the second sample, the voice has an English accent.

  • @ananthakrishnank3208
    @ananthakrishnank3208 2 місяці тому

    A small dedicated segment covering the overall flow of the model architecture would be great.
    If you have the domain knowledge, it would be even greater to discuss the "why"s regarding the working of the model.
    The demos were amazing!

  • @matters-and-facts
    @matters-and-facts 2 місяці тому

    I've been using the voice generator in the Simplified app. and it sounds like me but it does have a bit of difficulty with emoting, but it's not a huge problem, so it works for me.

  • @marwangs686
    @marwangs686 2 місяці тому +14

    Open Ai writing it name in the history of the beginning of artificial intelligence

    • @treudden
      @treudden 2 місяці тому +2

      OpenAI has been in the lead for 2 years

    • @ryzikx
      @ryzikx 2 місяці тому +3

      her 😂

    • @helix8847
      @helix8847 2 місяці тому +3

      @@treudden Not anymore... ElevenLabs shits on OpenAI Voice and Claude 3 shits on GPT 4. While Gemini has 1 million token count and is also very good. ClosedAI better hurry up otherwise they will be left behind.

    • @helix8847
      @helix8847 2 місяці тому

      How dare you assume its gender..!!

    • @bigglyguy8429
      @bigglyguy8429 2 місяці тому

      @@helix8847 Geminis it terrible.

  • @Jay33721
    @Jay33721 2 місяці тому

    Man, you should really get the DarkReader extension. This video was very very bright lol.

  • @JhonataCosmo
    @JhonataCosmo 2 місяці тому +3

    I'm Brazilian and the Portuguese part wasn't as good as ElevenLabs.

  • @SebSenseGreen
    @SebSenseGreen 2 місяці тому +2

    French one has an accent but it's really good, like a non-native with a high level French.

  • @Black-Re4per
    @Black-Re4per 2 місяці тому +4

    The German sounds like someone with a very heavy American accent, but otherwise it was correct.

  • @Dron008
    @Dron008 2 місяці тому

    That's great that we have a competition here. We'll see soon what Meta and Apple show.

  • @CozyChalet
    @CozyChalet 2 місяці тому

    I am waiting for a time that I could listen to the text part of my ebooks with ease. I use Apple’s screen reader but it’s painful.

  • @tehPlacebow
    @tehPlacebow 2 місяці тому

    Yo matt! Im curious what your opinion is on the best local TTS software? :D

  • @blindstreet
    @blindstreet 2 місяці тому

    The audio quality depends on the source quality being fed into it.

  • @XetXetable
    @XetXetable 2 місяці тому

    When they say "preset voices", I'm pretty sure they're referring to the built-in TTS voices that all OSs come with by default. You know, Microsoft Sam and friends; the light-weight handcrafted roboty voice that screen readers default to.

  • @IceMetalPunk
    @IceMetalPunk 2 місяці тому

    That is the most realistic TTS I've heard so far! How much do you want to bet *this* is the model being used in Figure 01?

  • @NirvanaFan5000
    @NirvanaFan5000 2 місяці тому +1

    the voices have a good cadence but low overall clarity quality... still very impressive

    • @NirvanaFan5000
      @NirvanaFan5000 2 місяці тому +1

      p.s. once we get good translation and audio for all the world languages, it's going to have a huge impact. e.g. I work with immigrants from east africa. many barely speak english and may have never used a computer in their life. it is very difficult for them to learn to use. having a computer they can just talk to in their native language can mean the difference between computer usage or none at all.
      Right now we have good auto-translate for around 100 languages (which do represent the majority of the planet), but researchers are now working on the next 1,000. (Then there are still a lot of tiny, local languages.)

  • @MoDs_3
    @MoDs_3 2 місяці тому

    Looks like we've all been successfully SHOCKED! 😅
    Amazing!

  • @Glowbox3D
    @Glowbox3D 2 місяці тому +1

    I love how Anthropic pulls ahead, great competition all around, we all win. I've had GPT4 for some time now, I've loved it's abilities, but Dalle being added to the package is the clincher. If Opus added an image gen to their product, I would definitely move over to them. That is...until SORA comes out...see? What do we do?

    • @AkikoAika
      @AkikoAika 2 місяці тому

      Worth adding also: Not that I'm super into benchmarks (I feel a bit guilty nitpicking on this): When mentioning the domination of Claude 3 Opus even in comparison to GPT-4, this is in comparison to GPT-4's original paper back in early 2023. From what I understand, GPT-4 Turbo is much better, e.g. on HumanEval & others (can search up "EvalPlus Benchmark", which also has the original HumanEval benchmark).

    • @brexitgreens
      @brexitgreens 2 місяці тому

      Do we all win by Stability AI collapsing under the weight of competition? I'm not sure about that.

    • @Glowbox3D
      @Glowbox3D 2 місяці тому +1

      @@brexitgreens bit bummed, my buddy works over there, and I'm rooting for 'em!

    • @brexitgreens
      @brexitgreens 2 місяці тому +1

      @@Glowbox3D Only bad guys are _not_ rooting for them.

    • @brexitgreens
      @brexitgreens 2 місяці тому +1

      Speaking of Anthropic getting their own image generator - they are allied with Amazon and Amazon already has their own named Titan. Not many people know. In terms of quality, it's between DALL·E 2 and 3. Comparable to SD XL.

  • @tanahirygallardohuizar3981
    @tanahirygallardohuizar3981 Місяць тому

    Wild, is Alexa's new gig gonna be a voiceover actor?

  • @leandro3710
    @leandro3710 2 місяці тому +2

    Brazilian here, brazilian portuguese is sounding very good!

    • @Kiiush
      @Kiiush 2 місяці тому +1

      It seemed robotic to me, I mean, without emotion, IDK, it was a bit weird the way he was finishing the sentences

    • @goldenhok
      @goldenhok 2 місяці тому +3

      ​@@Kiiush Brazilian here, normally people talk more robotic in a studio setting even the reference audio is not that normal sounding, in a studio you normally try to be very formal and say every syllable in this monotone way, which is not how people talk irl

  • @gizmomismo7071
    @gizmomismo7071 2 місяці тому

    In Spanish, it clearly has an English accent, but it sounds very natural... love it!

  • @alvaroluffy1
    @alvaroluffy1 2 місяці тому

    its good but specially in the translation theres some englishness that filters to the translated languages, you can notice it in all languages actually

  • @sunnywest28
    @sunnywest28 2 місяці тому +14

    As someone fluent in Japanese, the Japanese audio you showed sounded very foreigner sounding and not Japanese. Not good quality 😭

    • @justinwescott8125
      @justinwescott8125 2 місяці тому +8

      That's on purpose. The blog post specifically says that the accent of the original speaker is maintained period it's supposed to sound like an American speaking Japanese.

    • @southcoastinventors6583
      @southcoastinventors6583 2 місяці тому

      Japanese is 100% pure 外人 or put another way 日本語上手. Was waiting for it to say さようなら at the end.

    • @brexitgreens
      @brexitgreens 2 місяці тому

      "The Japanese audio didn't sound Japanese as someone fluent in Japanese"? Maybe try to learn English first.

  • @angelgarcia3410
    @angelgarcia3410 Місяць тому

    *whispers* Did my phone start imitating people?

  • @Flizyx
    @Flizyx 2 місяці тому

    the english to spanish one is tricky, it sounds like spanish but with english accent, so not full spanish

  • @budekins542
    @budekins542 2 місяці тому

    This will be the "SORA" of A.I voice generation😂

  • @Joe-SoftwareEngineer
    @Joe-SoftwareEngineer 2 місяці тому

    7:24 my native language is spanish, and I understand what she's saying but at times it sounds like an american who is learning spanish and hasn't fully mastered the "r" sounds. When she says "aporta" and "importar", all 3 letter "r" sound like an english "r" rather than spanish.

  • @alexanderalcantara3932
    @alexanderalcantara3932 Місяць тому

    Creepy, or the future of virtual assistants?

  •  2 місяці тому

    🇫🇷🇪🇸 For French and Spanish, there's a strong American accent while speaking these languages, hope trained data gets broader to improve audio generation !

  • @hitmusicworldwide
    @hitmusicworldwide 2 місяці тому +1

    The Chinese is a hair better than the French, Japanese and German accent wise. The original English model occasionally overcomes the actual Chinese weights. A- for the Chinese. B+ for the others to me. Portuguese is an A match to the initial voice. Forget emotive I'm happy about diversity. Pi P8 is the standard for diversity in English.

  • @damondragon324
    @damondragon324 2 місяці тому +1

    The german one has a strong accent. But it's understandable.

  • @ThomasJDavis
    @ThomasJDavis 2 місяці тому

    Killer App: voice cloning for texting.

  •  2 місяці тому

    The Portuguese had inflections in the wrong places, it's pretty good, tho.

  • @cesarsantos854
    @cesarsantos854 2 місяці тому

    I can confirm Portuguese sounds natural.

  • @JREinaNutshell331
    @JREinaNutshell331 2 місяці тому

    I still miss the option to give a prompt besides the information i want it to voice. Something like "sound angry, sound drunk, make long pauses, etc"
    Btw: The German generated text was horrible, it sounded like an american trying to speak german.

  • @kenrock2
    @kenrock2 2 місяці тому +1

    I wish Stephen hawking was alive to use this voice box

  • @aaronanimations9527
    @aaronanimations9527 2 місяці тому +1

    Who do you think is still ahead of the competition matt?

  • @user780-98
    @user780-98 2 місяці тому

    I can only comment on the English audio. It was surprising that the text didn't have punctuation other than periods, and it still knew where to short or long pause.

  • @nachod9772
    @nachod9772 2 місяці тому +3

    as a native speaker i can tell spanish version is so fcking good

    • @brexitgreens
      @brexitgreens 2 місяці тому

      Finally someone using the "as" construction correctly: with the subject ("I") agreeing in both clauses. Very rare in 2024.

  • @wenhanzhou5826
    @wenhanzhou5826 2 місяці тому +2

    The mandarin version sounds like an English speaker who got into university studying Chinese for couple of years.

    • @bastienpetit5161
      @bastienpetit5161 2 місяці тому +1

      Did the ai nailed the tones at least ?

    • @ruizhao5057
      @ruizhao5057 2 місяці тому +1

      @@bastienpetit5161 It did nail the tones, that's a low standard for ai though.

    • @ChristianIce
      @ChristianIce 2 місяці тому

      I guess that was the point.

  • @bgill7475
    @bgill7475 2 місяці тому +2

    The Mandarin one was good but it sounds kinda American...

  • @EQORIA
    @EQORIA 2 місяці тому

    What do you think about Singularity Intelligence for New Earth (QORA*) for EQORIA, United Earth? It is the first introduction of the vision and more to come... check it on youtube channel. EQORIA will begin promotion to mass media beginning December 12, 2024 on 12 year anniversary.

  • @noeaguilar5945
    @noeaguilar5945 2 місяці тому

    The Spanish cloned voice sounds amazing 😍, the best one I've ever heard, edit: la traducción es bastante mala

  • @Youtuber-lh3ky
    @Youtuber-lh3ky 2 місяці тому

    The Spanish translation has a very strong accent.

  • @rishabhsingh1406
    @rishabhsingh1406 2 місяці тому

    What are you opinion on Emads leaving Stablility AI. Do you think with time Open Source will have less and less competitive.

  • @devlisandro
    @devlisandro 2 місяці тому

    GCP has something similar but not that emotional

  • @martianingreen
    @martianingreen 2 місяці тому

    8:30 The German has a really tick accent (it basically sounds like an american one). Doesn't sound fantastic tbh, at least not if the goal is like very good dubbing / translation. But it sounds good as in AI voices go

  • @nonetrix3066
    @nonetrix3066 2 місяці тому +7

    I am learning Japanese but to me at least with someone that has listened to it a lot seems really strange accent wise

    • @brexitgreens
      @brexitgreens 2 місяці тому

      Your user image leaves no doubt about it.

  • @robertsousasantos6766
    @robertsousasantos6766 2 місяці тому

    O Português ficou perfeito, idêntico a uma pessoa real falando numa gravação real, ficou realmente perfeito.

  • @dot_zithmu
    @dot_zithmu 2 місяці тому

    In Chinese style, this situation is called "Million Model Warfare".

  • @IM2awsme
    @IM2awsme 2 місяці тому

    I just wish I could set the playback speed 😅 I use text to speak because I read slow, I shouldn't be outpacing the ai.

  • @youtube_moderator
    @youtube_moderator 2 місяці тому

    Parity-wise, Elevenlabs is better at most of the multilingual voice cloning, although I was especially impressed by the quality of intonation and pauses in the first English example.
    On a side note, voice recovery is not new - it's just voice cloning from old footage but it unfortunately retains the bad audio qualities from the same footage. It would have been more impressive to have just cloned the woman from her post brain-damaged voice in this particular case. Or even better blended them both together but maybe using EQ matching.

  • @microcontrolledbot
    @microcontrolledbot 2 місяці тому

    Bro have you not been talking to OpenAI in the app. Their voices have been around for like 6months.

  • @RobinRehmann
    @RobinRehmann 2 місяці тому +18

    The german wasn‘t realy good

    • @AiVaultGuy
      @AiVaultGuy 2 місяці тому

      im a spanish and portuguese native and the voice pronunciation sounds horrible, not natural at all

    • @Onoma314
      @Onoma314 2 місяці тому

      I'm kinda wondering how this would handle speech when it comes to text like a list of ingredients off a cereal box. Would sound odd being emotive

    • @smartduck904
      @smartduck904 2 місяці тому

      I could tell too they sound very robotic

    • @alexkaa
      @alexkaa 2 місяці тому +1

      True. German was bad...

    • @resumindo857
      @resumindo857 2 місяці тому +2

      Spanish neither

  • @Wasaia
    @Wasaia 2 місяці тому

    Just wanted to compliment you on your audio quality using the RE20. Really good clarity and not boomy.

  • @xponentialdesign
    @xponentialdesign 2 місяці тому

    the french voice sounds like its read by an english locutor

  • @fredthomson2384
    @fredthomson2384 2 місяці тому

    Audible is in big trouble.

  • @life_is_a_ride
    @life_is_a_ride 2 місяці тому

    En español se escucha robótica todavía...yo hablo español y tu inglés por eso nos suena más real en el idioma que no conocemos bien..yo entiendo muy bien el inglés..

  • @mralbertteacheralbert8619
    @mralbertteacheralbert8619 2 місяці тому

    @7:25 It's spanish but sounds very much like a white person speaking spanish (it has an accent). The same way a foreigner sounds when they try speaking English. I have been teaching ESL 3 years, believe me I can hear the accent.

  • @StoreHouseApp
    @StoreHouseApp 2 місяці тому

    You didn't even talk about one of the most impressive features, the translated language has an accent!

  • @manzell
    @manzell 2 місяці тому

    I want to see more foreign language stuff translated into English so I can evaluate it.

  • @ykles24
    @ykles24 2 місяці тому +1

    French one is an american talking french.

  • @markmuller7962
    @markmuller7962 2 місяці тому

    "On mars by next week" Elon in a nutshell

  • @silencedandshadowbanned7277
    @silencedandshadowbanned7277 2 місяці тому

    Ham sandwich here I can confirm the Swahili is a weird accent

  • @dot_zithmu
    @dot_zithmu 2 місяці тому

    Now I know, Musk's Grok-1 is fairly early.

  • @STONJAUS_FILMS
    @STONJAUS_FILMS 2 місяці тому

    I speak spanish and chinese and they sound like an American accent speaking those languages … but in a way thats very understandable, as if they learned the language very well but did not manage to get rid of the accent …. Did not robotic to me if thats the concern… the accent might bother some natives

  • @cagnazzo82
    @cagnazzo82 2 місяці тому +2

    As a french and english speaker the french that they spoke was with the woman's american accent... therefore making it impressive.
    It intentionally tries to mimic the original speaker's intonations, making it sound more like them but pretty much transmitting their native tongue accent to different languages.

    • @panomaniac5399
      @panomaniac5399 2 місяці тому

      Agree, it sounds like a American woman who speaks very good French, but with an accent. Not an overly strong accent, but definitely identifiable a North American English speaker.

    • @cobb8613
      @cobb8613 Місяць тому

      Where can we try Chat Gpt voice? I’m French, and i want to try if the english accent dissapear…

  • @abdulbyrd7902
    @abdulbyrd7902 2 місяці тому

    The spanish one is full gringo, eleven labs is way further along in this aspect

  • @Scott-Zakarin
    @Scott-Zakarin 2 місяці тому

    To me, the voice generator sound like a voice generator for it's emotive capabilities. Far from human.

  • @guerric
    @guerric 2 місяці тому +1

    The French one is weird. It sounds robotic and as a French I don't see what accent it is, doesn't sound like France French, doesn't sound like Canadian French, just weird
    Japanese seems to have a very heavy accent too that I can't categorize
    Spanish sounds very bad imo
    The translation keeps a weird accent that makes it sound very weird. I don't think it's the woman's American accent that is kept but a weird mix that is quite uncanny

  • @rikorobinson
    @rikorobinson 2 місяці тому

    The Mandarin sounded a little monotonous to me, but take anything I say with a grain of salt. I'm somewhat new to learning the language.

  • @PuppiesAreNice.
    @PuppiesAreNice. 2 місяці тому

    It is so strange to hear ai generated voices with an accent. like im german and the german one sounded like an american tried to read out german text

  • @EndonTruth
    @EndonTruth 2 місяці тому

    The second Spanish chick (AI) speaks better than the human lol sounds like a Mexican children's author. The translation from English to Spanish, has brutal pronunciation.

  • @jumbleblue
    @jumbleblue 2 місяці тому

    German has English accent. Slight. French too.

  • @thiagoduarte610
    @thiagoduarte610 2 місяці тому +1

    Portuguese was great

  • @ryglitheegg
    @ryglitheegg 2 місяці тому +1

    Nice

  • @justinwhite2725
    @justinwhite2725 2 місяці тому +4

    I speak French. Sounds like an Anglephone (English speaker) who has learned French (which the boiler plate on that video says is the intent)
    ... Though it still rolls the rs better than I do.

  • @pierre-samuelgreau-hamard6379
    @pierre-samuelgreau-hamard6379 2 місяці тому

    French is still a bit robotic, and the voice seems to speak with a slight english accent.

  • @muzy8768
    @muzy8768 2 місяці тому

    I think the translated audio keeps the original accent and sounds a bit weird and not perfect

  • @StefanSchmidtRegensburg
    @StefanSchmidtRegensburg 2 місяці тому

    German has a strong american accent. Just like the voice out from ChatGPT

  • @AlexanderWeixelbaumer
    @AlexanderWeixelbaumer 2 місяці тому

    In the german example the "r" was pronounced like it was an english word. Germans pronounce the r much harder.