OpenAI Realtime API: The future of Voice AI?

Поділитися
Вставка
  • Опубліковано 7 лют 2025

КОМЕНТАРІ • 54

  • @LucasMarquesAI
    @LucasMarquesAI 4 місяці тому +1

    Great video as always Jannis, let's go 🔥

  • @patrickzupanc1795
    @patrickzupanc1795 4 місяці тому +1

    Great video, thank you, Jannis!

  • @mikearmstrong-ai
    @mikearmstrong-ai 3 місяці тому

    Very informative, will start jumping in, thanks for the free resources.

  • @BrockMesarich
    @BrockMesarich 4 місяці тому

    Was waiting for you to release this!

  • @HenrykAutomation
    @HenrykAutomation 4 місяці тому

    Love its speed, unmatched by anything else out there right now!

  • @clairedubiel1
    @clairedubiel1 4 місяці тому

    Thanks for the helpful video Jannis!

  • @mohammedzihan7382
    @mohammedzihan7382 4 місяці тому +3

    For Developers, feel voice providers like VAPI wouldn't be required in near future. Directly integrate the OpenAI API, and have components like WebRTC, real time streaming, client server connection mapping, DB connections & data mapping implemented. For handling workflow management, state management, could integrate certain frameworks on top like Langraph.

    • @jannismoore
      @jannismoore  4 місяці тому +2

      Those platforms are already not required anymore, but I believe the realtime API will be the reason they become even more popular. Will share more on that soon.

  • @_arav_patel_
    @_arav_patel_ 4 місяці тому +1

    Great video. I wonder what the future will be like with Voice AI becoming this realistic. How long do you think it will take for Vapi to implement this? (few days, weeks, months?)

  • @7_Tom
    @7_Tom 4 місяці тому +4

    Great video as always! Since you are probably in contact with the Vapi team... Can you estimate how long it will take until this is implemented? Thanks.

    • @jannismoore
      @jannismoore  4 місяці тому +2

      I’m not quite sure, but I assume we should see something being released soon.

  • @moatazelkersh6129
    @moatazelkersh6129 4 місяці тому

    What a great video! Thanks so much for doing the work and providing us with the template for free. If you don’t mind me asking, how can I reduce my costs with Twilio and set up an open-source phone system to act as the call gateway? Another thing I was planning to implement WebRTC as it has the functionality to reduce Eco and noise reduction in case someone will call in a loud environment!

    • @jannismoore
      @jannismoore  3 місяці тому

      I think OpenAI handles the noise reduction part by themselves. If you're referring to SIP trunking, you most likely need to see how you can do the connection. Not every platform allows you to add a SIP URL to it, sometimes it's the other way around.
      If you want to try it, use something like Zoiper

  • @naryanzaninja7367
    @naryanzaninja7367 4 місяці тому

    What are your plans Jannis? Run the agency long term, or switch completely to saas, or voice ai education, or something else?

    • @jannismoore
      @jannismoore  4 місяці тому

      I haven’t even started with voice AI education.
      Honestly, for now I’m happy helping others build out extremely powerful systems, but the educational route might certainly be interesting one I see the need for it

  • @thereviewer5562
    @thereviewer5562 4 місяці тому

    You are as always authentic in your opinion. It is exciting thing for someone who is beng introduced to this voice stuff with ai for the first time. What do you thinkis the basic thing a beginner can learn in low code development ? What is the skill that moves the needle?

    • @jannismoore
      @jannismoore  4 місяці тому +1

      Understanding the concept and foundations.
      I think that’s the most important thing.
      Try some of my examples so you have a working solution, and then try to understand how it’s done.
      That’s a great point to start. 👍🏻

    • @thereviewer5562
      @thereviewer5562 4 місяці тому

      @@jannismoore that is good to hear.

  • @angeloh-u1q
    @angeloh-u1q 4 місяці тому +2

    I'm surprised that vapi isn't on top of this already.

  • @greendsnow
    @greendsnow 4 місяці тому +9

    İt's just way too expensive. Some people payed $3 for 5 minutes, even though the pricing catalogue says it's around 30 cents a minute... Simply unacceptable

    • @TrueCrimeShorties
      @TrueCrimeShorties 4 місяці тому +4

      Cost will go down soon just like other API costs

    • @jannismoore
      @jannismoore  4 місяці тому +5

      You can achieve the same with Vapi by dropping 50k tokens into your master prompt :)
      Anyways, API costs will definitely come down, so that isn’t a concern in my opinion

    • @dazdazfzf
      @dazdazfzf 4 місяці тому

      ⁠@@jannismooreexactly. Just a way to raise the bar of the value of their product because they cannot already scale.

  • @pjm17
    @pjm17 4 місяці тому

    SO could I build a conversational chat app. Basically give someone a person to talk to as they walk around and chat with? are prices too limiting right now??

    • @jannismoore
      @jannismoore  3 місяці тому +1

      You can do that, but yes, prices are still limiting as of now.
      I do believe that those will come down quite rapidly.

  • @radoslav07
    @radoslav07 3 місяці тому

    Can you share your replit link? Thanks

    • @jannismoore
      @jannismoore  3 місяці тому

      I did! It’s in my resource hub which you’ll find in the description

  • @8888-u6n
    @8888-u6n 4 місяці тому

    How do we get acces to the code you made? 👍

    • @jannismoore
      @jannismoore  4 місяці тому

      Via my resource hub - the links for that are in the description :)

  • @lakergreat1
    @lakergreat1 4 місяці тому

    could it work with Microsoft Teams Phone? I would like to use it in an IVR setup

    • @jannismoore
      @jannismoore  3 місяці тому

      We haven't tried that yet, but if you have a number, you can most likely make calls to it through a provider like Twilio. There are also other approaches that you might be able to leverage long term, such as daily.co

  • @jeelanshahtlyr6076
    @jeelanshahtlyr6076 4 місяці тому

    Jannis is the ONLY way to go when it comes to AI Voice and Automations.

  • @pauledam2174
    @pauledam2174 4 місяці тому

    Can anyone suggest how this could be used for real-time translation? Actually it doesn't need to be voice to voice just voice to text

    • @jannismoore
      @jannismoore  4 місяці тому +1

      In that case you might just want to look at Deepgram

  • @tuaitituaiti1565
    @tuaitituaiti1565 4 місяці тому

    Hey there. Thank you for tge value bombs you are dropping...Heads up the link to the resource seem to be broken...thanks again

    • @jannismoore
      @jannismoore  4 місяці тому

      Appreciate it! Both of the links work when opening them. What do you see once you click on them?

  • @jamesballantyne9214
    @jamesballantyne9214 4 місяці тому

    This seems as slow as vapi. What advantages does this, will this have, if it’s the same speed without and of the features of vapi?

    • @jannismoore
      @jannismoore  4 місяці тому

      Are you sure you watch your videos on normal playback speed? :D
      I've mentioned some of the benefits in the video. If that's not enough, I'll drop a more detailed one soon.

    • @rarf2142
      @rarf2142 4 місяці тому

      Bro this is not slow at all… You do realise it should sound human and not respond in 0.005 milliseconds? The delay makes it sound human smh

  • @jerkmeo
    @jerkmeo 14 днів тому

    nice intro..thank

  • @shanes.6227
    @shanes.6227 3 місяці тому

    can't wait til this kills customer service phone jobs. calling my wireless carrier for something is often a big trouble, taking hours!

  • @Kevinsmithns
    @Kevinsmithns 4 місяці тому +1

    How can we use it for ai call bots?

    • @jannismoore
      @jannismoore  4 місяці тому

      You can use the custom example I showed for Twilio, or you can give it another couple of days and Vapi will most likely have something available too

  • @NeuralDev
    @NeuralDev 4 місяці тому

    The cost is way too high, we need open source models

    • @jannismoore
      @jannismoore  4 місяці тому

      I don't think the price will be that high for long

  • @SzamBacsi
    @SzamBacsi 4 місяці тому

    Laughable. It works in English or German, with simple Indo-European languages. But it dies with Hungarian. instantly.

    • @jannismoore
      @jannismoore  4 місяці тому +2

      I can see what causes your disappointment.
      You'll always see major languages being implemented at a faster pace. Honestly, I'm already impressed it properly handles multilingual conversations as smooth as now, as this was already incredibly hard with the orchestration layers we've seen so far.
      We should be happy about those advancements and help them with enough input to make it even better, which on the other hand will also increase your chances of having better results for other languages.

    • @rarf2142
      @rarf2142 4 місяці тому

      @@jannismooreI hope Dutch works already, I really need a Dutch agent. VAPI starts hallucinating on Dutch and speaking half German after a while lol

    • @SzamBacsi
      @SzamBacsi 4 місяці тому

      @@jannismoore Indeed, I am disappointed, as I have experience applying language models in IVR systems since the 2000s, and I understand that implementing a new model in 2024 should not pose a problem. The underlying issue seems to be a lack of concern for anything outside a specific "cultural" circle. In summary, they simply don't care.
      But I do hope I am mistaken.
      I truly appreciate your videos; they bring a refreshing perspective to this emerging area .

  • @gslvqz8812
    @gslvqz8812 4 місяці тому

    You need to change your thumbnail. It looks evil

    • @jannismoore
      @jannismoore  4 місяці тому

      Seems like you clicked on it nevertheless