Yes!! replace the traditional TTS! Please bring this in google play books! I would love to have my books being read to me like an audio book! Game changer!
When is not quite seamless you are prompting it to speak that way in the Open I want you don’t have to prompt it. It automatically adapts to you and it’s voice to voice speech to speech. This one seems like text to speech.
Actually with advanced voice I was having it speak english, french, elvish, and simlish in one sentence. The actual game-changer is being able to prompt the AI to do this. You can do this through voice commands with OpenAI, but for some reason ignored the ability to prompt for voices. Plus I think the whole 'her' situation got them rattled from voices almost altogether.
I've just figured it out as I have to change from "Create Prompt" to "Stream Realtime" in the left pane. However I can't seem to change the audio effect. Whispering doesn't work with me although it is demonstrated in the Google post. How can we add these audio effects?
Would be better and more impressive if it kept the same voice or character when switching languages rather than using a totally different voice for each language. But all fun and looking forward to using this model.
Yes!! replace the traditional TTS! Please bring this in google play books! I would love to have my books being read to me like an audio book! Game changer!
Up!
the speed of this is astonishing
This is how the world ends... one perfectly-pitched whisper at a time 😊
Yeah remember when world ended when we had text to speech for years
Ill take it.
The whisper… all those ASMR youtube videos were useful
Its the early Christmas gift from Google
SOO exited for an alternative for OpenAI advanced voice!!!!
When is not quite seamless you are prompting it to speak that way in the Open I want you don’t have to prompt it. It automatically adapts to you and it’s voice to voice speech to speech. This one seems like text to speech.
@@TECHNOSTARTERSS this one is not text to speech. that's why the title is "Native audio output".
And you even change the order of the speakers, making the female voice lead. Kudos to the CX team. Wittily played!
A man of Native American ancestry has been feeding all AI developers for the last decade, behind the scenes.
Yes! Please use this for the different UA-cam soundtracks, because right now the generated ones are really bad
when is available ?
As soon as Google kill their older products
I think it already is
@@aquilesdg4305 And _where_ exactly is it available?
It's available right now in Google ai studio @@aquilesdg4305
Right now on AI Studio
Her is truly here
it's him & her .... hhhhhhhhh
This isnt new but maybe its better than what was out there before! Cant wait to try it
Exceptional work 👏 love the example of the model intelligently adapting to fit the speed of reply it thinks you need.
I will build my IA girlfriend now 😂
Haha yes
Ah yes, Irtafacial Antelligence
@@games528 This is funnier than it should be
@@games528 In many languages, the adjective comes after the noun.
@@games528 It's called IA in many languages
Whoever made this video is cool
2:32: damnnn ok am I'm impressed 👌 👏
This will put so many call-center agents out of work worldwide
Great! Now we can hear non-sensical facts from history with native audio output!
Google doing what OpenAI said they would months ago but we still didnt get 👏
Actually with advanced voice I was having it speak english, french, elvish, and simlish in one sentence. The actual game-changer is being able to prompt the AI to do this. You can do this through voice commands with OpenAI, but for some reason ignored the ability to prompt for voices.
Plus I think the whole 'her' situation got them rattled from voices almost altogether.
? OpenAI Advanced Voice mode is already out
0:01 Has anyone find "Video understanding" demo? All other topics have a video on this channel
That thumbnail goes hard
GOOGLE ships! Pixel phone stocks jumping!
level 4 AI is the next big thing, thats when Science gonna get intresting 😎
lets start building with gemini 2.0
Now that's impressive
Mind blown. Playing with it now :)
woah this is so cool
I don't have the "Output Format" and "Voice" options under "Model" in the AI Studio. I just have the "Token Count" immediately after Model.
I've just figured it out as I have to change from "Create Prompt" to "Stream Realtime" in the left pane. However I can't seem to change the audio effect. Whispering doesn't work with me although it is demonstrated in the Google post. How can we add these audio effects?
Bro this thing is crazyyy 😱
Pretty epic
This is so lovely
is the model in the video the flash version? i am unable to get it to whisper or laugh or change how it speaks
Same here
I think that's the part that is available in January. It is a bit confusing since they ended with saying to go to ai studio...
@@ShawnFumo Oh ok I see! Thanks for clarifying
ok that Hindi was nice, damn. Is there a paper on this?
haha yoo that's dope ..well done
This is AVM but with less restrictions
nicely done
I cant help myself but feel like the next decade is going to get very weird
How do we get it?
Hopefully it's not another lie.
We all remember the Gemini "hands on demo".
Look amazing, but did anyone get it to work in the actual AI studio? I ran into a ton of bugs, especially with non-English languages.
incredible❤❤
I tried but it does not whisper yet.
who's going to fill the hollow the emptiness? idk something is super weird bout ai generated audio trying to be friendly and humanly..
한국어 스피치는 조금 부자연스럽군요
❤
I wonder why I am able to use it on my pc, but my phone doesnt have 2.0 unlocked 😮💨
Gimme my sandevistan , time to get chromed up
So you're saying it can roleplay as a furry femboy fox? Asking for a friend of course
This is how tts ends😂😂😂
I need an agent that can use any program on my computer.
Just give us AIOS.
Exactly
Exactly, but yes its already out , but for browser so far , and not released yet .... I hope google releases one :0
I love it
Oh Calculon
Nothing that OpenAI's model can't do so far, but hey more competition is better for everyone!
Hopefully it has cheaper API access. I blew through so much money just testing a few use cases of the OpenAI audio model through the API.
AGI is coming soon 2025
Dude, I swear. Like, that clock slaps 12 on, idk, january 5? I swear AGI will be here. Or anytime afterward.
it does not support Sinhala language!
Oh well.
hell yeah
Would be better and more impressive if it kept the same voice or character when switching languages rather than using a totally different voice for each language. But all fun and looking forward to using this model.
Surely this won't be used to cause suffering at the expense of talented voice actors just so rich creeps get even richer. Surely.
😮
I haven’t found Chinese 😂
I don’t like the tone, cadence structure. They should call it “podcast voice”
Wow! That is completely pointless!
What.. are you even talking about. How is this pointless?
Copium
the most exciting thing 🎉