Building with Gemini 2.0: Native audio output

Поділитися
Вставка
  • Опубліковано 11 гру 2024

КОМЕНТАРІ • 97

  • @raghavamrev5245
    @raghavamrev5245 14 годин тому +70

    Yes!! replace the traditional TTS! Please bring this in google play books! I would love to have my books being read to me like an audio book! Game changer!

  • @maxcomperatore
    @maxcomperatore 15 годин тому +55

    the speed of this is astonishing

  • @Tinman462
    @Tinman462 17 годин тому +116

    This is how the world ends... one perfectly-pitched whisper at a time 😊

    • @sj00100
      @sj00100 16 годин тому +11

      Yeah remember when world ended when we had text to speech for years

    • @vectoralphaSec
      @vectoralphaSec 16 годин тому +1

      Ill take it.

    • @DaveK-q9y
      @DaveK-q9y 8 годин тому +2

      The whisper… all those ASMR youtube videos were useful

  • @hahoang9542
    @hahoang9542 16 годин тому +21

    Its the early Christmas gift from Google

  • @BeepBeepBeepbop
    @BeepBeepBeepbop 13 годин тому +12

    SOO exited for an alternative for OpenAI advanced voice!!!!

    • @TECHNOSTARTERSS
      @TECHNOSTARTERSS 6 годин тому

      When is not quite seamless you are prompting it to speak that way in the Open I want you don’t have to prompt it. It automatically adapts to you and it’s voice to voice speech to speech. This one seems like text to speech.

    • @raydosson2025
      @raydosson2025 Годину тому

      @@TECHNOSTARTERSS this one is not text to speech. that's why the title is "Native audio output".

  • @AIrtesan
    @AIrtesan 16 годин тому +15

    And you even change the order of the speakers, making the female voice lead. Kudos to the CX team. Wittily played!

  • @michaelcharlesthearchangel
    @michaelcharlesthearchangel 16 годин тому +16

    A man of Native American ancestry has been feeding all AI developers for the last decade, behind the scenes.

  • @Momixer
    @Momixer 9 годин тому +2

    Yes! Please use this for the different UA-cam soundtracks, because right now the generated ones are really bad

  • @LaPetiteCuillère
    @LaPetiteCuillère 16 годин тому +26

    when is available ?

    • @HUEHUEUHEPony
      @HUEHUEUHEPony 16 годин тому

      As soon as Google kill their older products

    • @aquilesdg4305
      @aquilesdg4305 15 годин тому +7

      I think it already is

    • @JaBigKneeGap
      @JaBigKneeGap 14 годин тому

      ​​@@aquilesdg4305 And _where_ exactly is it available?

    • @Ethereal_Enigma
      @Ethereal_Enigma 12 годин тому

      It's available right now in Google ai studio ​@@aquilesdg4305

    • @MainInternetUser
      @MainInternetUser 11 годин тому +1

      Right now on AI Studio

  • @aron2922
    @aron2922 11 годин тому +8

    Her is truly here

    • @__J____ff
      @__J____ff 48 хвилин тому

      it's him & her .... hhhhhhhhh

  • @CODE7X
    @CODE7X 14 годин тому +3

    This isnt new but maybe its better than what was out there before! Cant wait to try it

  • @aiforculture
    @aiforculture 11 годин тому +1

    Exceptional work 👏 love the example of the model intelligently adapting to fit the speed of reply it thinks you need.

  • @OumarDicko-c5i
    @OumarDicko-c5i 15 годин тому +31

    I will build my IA girlfriend now 😂

    • @CODE7X
      @CODE7X 14 годин тому

      Haha yes

    • @games528
      @games528 13 годин тому +17

      Ah yes, Irtafacial Antelligence

    • @aron2922
      @aron2922 11 годин тому +1

      @@games528 This is funnier than it should be

    • @IceMetalPunk
      @IceMetalPunk 5 годин тому +2

      ​​@@games528 In many languages, the adjective comes after the noun.

    • @flyingstapler1241
      @flyingstapler1241 5 годин тому +2

      ​@@games528 It's called IA in many languages

  • @friedpizza262
    @friedpizza262 15 годин тому +6

    Whoever made this video is cool

  • @ShubharthakSangharsha
    @ShubharthakSangharsha 4 години тому +1

    2:32: damnnn ok am I'm impressed 👌 👏

  • @ThomasOberhoff
    @ThomasOberhoff 4 години тому +1

    This will put so many call-center agents out of work worldwide

  • @trutenantedboderampt
    @trutenantedboderampt 2 години тому

    Great! Now we can hear non-sensical facts from history with native audio output!

  • @IN-pr3lw
    @IN-pr3lw 15 годин тому +11

    Google doing what OpenAI said they would months ago but we still didnt get 👏

    • @cagnazzo82
      @cagnazzo82 6 годин тому

      Actually with advanced voice I was having it speak english, french, elvish, and simlish in one sentence. The actual game-changer is being able to prompt the AI to do this. You can do this through voice commands with OpenAI, but for some reason ignored the ability to prompt for voices.
      Plus I think the whole 'her' situation got them rattled from voices almost altogether.

    • @IceMetalPunk
      @IceMetalPunk 5 годин тому +1

      ? OpenAI Advanced Voice mode is already out

  • @flamyf
    @flamyf Годину тому

    0:01 Has anyone find "Video understanding" demo? All other topics have a video on this channel

  • @GeneralKenobi69420
    @GeneralKenobi69420 6 годин тому

    That thumbnail goes hard

  • @DistortedV12
    @DistortedV12 2 години тому +1

    GOOGLE ships! Pixel phone stocks jumping!

  • @RichardPinewood
    @RichardPinewood 12 годин тому +1

    level 4 AI is the next big thing, thats when Science gonna get intresting 😎

  • @rakeshkumarrout2629
    @rakeshkumarrout2629 5 годин тому +1

    lets start building with gemini 2.0

  • @DanielMK
    @DanielMK 17 годин тому +4

    Now that's impressive

  • @demonsynth
    @demonsynth 15 годин тому

    Mind blown. Playing with it now :)

  • @devagarwal3250
    @devagarwal3250 4 години тому +1

    woah this is so cool

  • @MichealAngeloArts
    @MichealAngeloArts 10 годин тому

    I don't have the "Output Format" and "Voice" options under "Model" in the AI Studio. I just have the "Token Count" immediately after Model.

    • @MichealAngeloArts
      @MichealAngeloArts 4 години тому

      I've just figured it out as I have to change from "Create Prompt" to "Stream Realtime" in the left pane. However I can't seem to change the audio effect. Whispering doesn't work with me although it is demonstrated in the Google post. How can we add these audio effects?

  • @Chaotic-n5n
    @Chaotic-n5n 15 годин тому

    Bro this thing is crazyyy 😱

  • @Blooper1980
    @Blooper1980 4 години тому +1

    Pretty epic

  • @Kenykore
    @Kenykore 17 годин тому +2

    This is so lovely

  • @Kiririn
    @Kiririn 13 годин тому

    is the model in the video the flash version? i am unable to get it to whisper or laugh or change how it speaks

    • @shadydragon22
      @shadydragon22 9 годин тому

      Same here

    • @ShawnFumo
      @ShawnFumo 7 годин тому +1

      I think that's the part that is available in January. It is a bit confusing since they ended with saying to go to ai studio...

    • @shadydragon22
      @shadydragon22 7 годин тому

      @@ShawnFumo Oh ok I see! Thanks for clarifying

  • @999satyam
    @999satyam 14 годин тому +1

    ok that Hindi was nice, damn. Is there a paper on this?

  • @janjahrademusic
    @janjahrademusic 16 годин тому +5

    haha yoo that's dope ..well done

  • @BroskiPlays
    @BroskiPlays 5 хвилин тому

    This is AVM but with less restrictions

  • @phiarchitect
    @phiarchitect 17 годин тому +2

    nicely done

  • @Terrantulla
    @Terrantulla 14 годин тому +2

    I cant help myself but feel like the next decade is going to get very weird

  • @Fwuzeem
    @Fwuzeem 16 годин тому

    How do we get it?

  • @Happ1ness
    @Happ1ness 7 годин тому

    Hopefully it's not another lie.
    We all remember the Gemini "hands on demo".

  • @niceplace123
    @niceplace123 50 хвилин тому

    Look amazing, but did anyone get it to work in the actual AI studio? I ran into a ton of bugs, especially with non-English languages.

  • @snowhan7006
    @snowhan7006 15 годин тому

    incredible❤❤

  • @AyyazZafar
    @AyyazZafar 2 години тому

    I tried but it does not whisper yet.

  • @MemeConnoisseur
    @MemeConnoisseur 15 годин тому +2

    who's going to fill the hollow the emptiness? idk something is super weird bout ai generated audio trying to be friendly and humanly..

  • @pathring
    @pathring 5 годин тому +1

    한국어 스피치는 조금 부자연스럽군요

  • @DominickZollinger-e3r
    @DominickZollinger-e3r 6 годин тому +1

  • @1brokkolibaum
    @1brokkolibaum 27 хвилин тому

    I wonder why I am able to use it on my pc, but my phone doesnt have 2.0 unlocked 😮‍💨

  • @braineaterzombie3981
    @braineaterzombie3981 13 годин тому

    Gimme my sandevistan , time to get chromed up

  • @GeneralKenobi69420
    @GeneralKenobi69420 6 годин тому +1

    So you're saying it can roleplay as a furry femboy fox? Asking for a friend of course

  • @ROHIT-wx4nu
    @ROHIT-wx4nu 15 годин тому +1

    This is how tts ends😂😂😂

  • @pandoraeeris7860
    @pandoraeeris7860 16 годин тому +5

    I need an agent that can use any program on my computer.
    Just give us AIOS.

    • @J3R3MI6
      @J3R3MI6 16 годин тому +1

      Exactly

    • @CODE7X
      @CODE7X 14 годин тому

      Exactly, but yes its already out , but for browser so far , and not released yet .... I hope google releases one :0

  • @AstroZoe1804
    @AstroZoe1804 16 годин тому

    I love it

  • @InternetKilledTV21
    @InternetKilledTV21 16 годин тому +2

    Oh Calculon

  • @ShpanMan
    @ShpanMan 15 годин тому +3

    Nothing that OpenAI's model can't do so far, but hey more competition is better for everyone!

    • @IceMetalPunk
      @IceMetalPunk 5 годин тому

      Hopefully it has cheaper API access. I blew through so much money just testing a few use cases of the OpenAI audio model through the API.

  • @vectoralphaSec
    @vectoralphaSec 16 годин тому +5

    AGI is coming soon 2025

    • @JaBigKneeGap
      @JaBigKneeGap 14 годин тому +2

      Dude, I swear. Like, that clock slaps 12 on, idk, january 5? I swear AGI will be here. Or anytime afterward.

  • @ruchirahasaranga8076
    @ruchirahasaranga8076 5 годин тому +1

    it does not support Sinhala language!

  • @lakshiBro
    @lakshiBro 16 годин тому +2

    Oh well.

  • @4letterdc
    @4letterdc 16 годин тому +1

    hell yeah

  • @mightynathaniel5355
    @mightynathaniel5355 15 годин тому +1

    Would be better and more impressive if it kept the same voice or character when switching languages rather than using a totally different voice for each language. But all fun and looking forward to using this model.

  • @MidgarMerc
    @MidgarMerc 41 хвилина тому

    Surely this won't be used to cause suffering at the expense of talented voice actors just so rich creeps get even richer. Surely.

  • @notvedxp
    @notvedxp 15 годин тому

    😮

  • @BoydLIN-c3w
    @BoydLIN-c3w 9 годин тому

    I haven’t found Chinese 😂

  • @ashleigh3021
    @ashleigh3021 8 годин тому

    I don’t like the tone, cadence structure. They should call it “podcast voice”

  • @joelcarter9137
    @joelcarter9137 13 годин тому

    Wow! That is completely pointless!

    • @bluepandaman
      @bluepandaman 12 годин тому +1

      What.. are you even talking about. How is this pointless?

    • @SabiUddin
      @SabiUddin 56 хвилин тому

      Copium

  • @naughtycat9894
    @naughtycat9894 9 годин тому +1

    the most exciting thing 🎉