"Training" an AI Agent for ONE Specific TASK with OpenAI-o1 API

OpenAI Swarm AI Agents - Is It Time To Be ALL IN on Agentic Workflows?

How to Build an AI Agent Using OpenAI Realtime API (Step-by-step Guide)

Кирилл Набутов. Арестович в Кремле, кто взорвал командующего в Москве, война России с НАТО

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

Cat mode and a glass of water #family #humor #fun

OpenAI Realtime API - The NEW ERA of Speech to Speech? - TESTED

All About AI

Переглядів 29 401

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 27 гру 2024

КОМЕНТАРІ •

@almirkaza 2 місяці тому ⁺¹²
can you share the url to the repo?
@boxeemusic 2 місяці тому ⁺¹²
where can i find the code? pls help
@KCM25NJL 2 місяці тому ⁺³¹
Yeah, no basement dweller dev's are gonna be messing with that API until the costs drop by at least 100x, which I honestly only see as a near term incentive for Meta to get a Llama Voice model cookin'
@jamesjonnes 2 місяці тому
I'll use it, but can't wait for an uncensored open source version. Text only is too boring. I lack the patience to use text only for too long for the tasks I want, like learning languages.
@karmcy Місяць тому ⁺¹
Well said, 3 tests today ~2mins each conversation. $1.5. Yikes!
@sykexz6793 2 місяці тому ⁺¹³
I don't think this is the same model as advanced voice mode.
@ibrahimaba8966 2 місяці тому ⁺⁴
I just integrated it on Twilio, it changes everything, but it took me a bit of time.
@OliNorwell 2 місяці тому ⁺⁵
Great work! You must have had a busy couple of days getting it working
@meetsummdev 2 місяці тому ⁺¹
you can really implement it in a few hours
@DarrenJohn10X 2 місяці тому ⁺⁹
Looking forward to seeing your alleged "spaghetti" code! (Right now 2 weeks ago is your latest repo)
@Bangs_Theory 2 місяці тому ⁺³
Which function controls the interruption?
@gaijinshacho 2 місяці тому ⁺³
VAD
@viduraerandika8296 16 днів тому
@@gaijinshacho even i use it in turn detection it continue talking until it finishes.
@Distillated 29 днів тому
5:58 - I felt that 😂 Currently having the same conundrum with the Anthropic API! (Claude 3.5 Sonnet is so good...)
@DeepSucess 2 місяці тому
can we have speech/voice as input to this app using websockets and get result as text as output?
@Akander20 2 місяці тому
where can i get the repo?
@hamzakhanswati9087 Місяць тому
when will you upload it on github??
@d3xrd527 Місяць тому
Where to find code?
@jamesyoungerdds7901 2 місяці тому
Great video, thanks Kris! I'm interesting in the function calling and structured output from the voice websocket return. Can you use agents or agentic flows with constrained and structured outputs with the voice mode 🤔
@drewpeer 2 місяці тому
Does everyone have access to this beta? Anything we have to do?
@pjm17 2 місяці тому ⁺¹
Could you achieve these results in an app just using the text to speech and speech to text with native ios features alongside openai NON realtime api's?
@bassemibrahim3798 18 днів тому
yes I can, I have already implemented that
@AtheistAdam 2 дні тому
You are cool :) Thanks for all you share.
@nmana9759 2 місяці тому
Why wouldn't you share the repo?
@alarconfilms1 2 місяці тому ⁺¹
What is the code used?
@khalifarmili1256 2 місяці тому
It's not out yet
@romera9662 2 місяці тому
@@khalifarmili1256 How long will it take?
@DeepSucess 2 місяці тому
can It work for other languages such as urdu, hindi?
@MagagnaJayzxui 2 місяці тому
What is AVA?
@三川富資訊股份有限公 2 місяці тому ⁺³
The Realtime API cost is high. I suggest that there is a cheaper way. 1.Using Google STT to get user's speech texts. 2.Send texts to GPT. 3. Get responses from GPT. 4.Send responses to Google TTS. 5.User gets AI responses in both texts and voices. The response time is longer and it costs lower.
@李征-u3n Місяць тому ⁺¹
In that case, you don't need to use realtime API. OpenAI chat completion API I think works just fine.
I think the key point is that realtime API has the ability to not miss any information from your voice (tone, intonation or accent), which means it can feel you like a real person, as least it is trying to.
@MrAnonymousCitizen 22 дні тому
Yes you said it yourself. The response time is longer and the cost is cheaper… thank you Sherlock…. Case solved
@Cutestreetcats Місяць тому
where is the code?
@JaredVBrown 2 місяці тому
Would love the bankrupt myself with your code, i wont judge spaghetti, tried for 20 prompts with the new claude to get it up and running - no dice. Examples would be much apricated :)
@dievas_ 2 місяці тому
I still don't have access to it :/
@DesignDesigns 2 місяці тому
This is mindblowing...
@李征-u3n Місяць тому
I don't quite understand what realtime means here, especially in text version
In voice version, yes, you can interact with it like really talking to a person, such as you can interrupt the conversation, or maybe openAI can understand extra information from your tone or intonation or accent.
But in text version, I don't see any difference with just use OpenAI chat completion API
@Dea07thox 2 місяці тому
Can't you just better prompt it to have a less talkative output so you don't have to break it's response that often? That would make a big difference and everything more seamless :)
@icydemon9749 Місяць тому
can you provide a code ? please
@saksham3 2 місяці тому
Doesn't it have emotions?
@micbab-vg2mu 2 місяці тому
Thanks :)
@tommoves9935 2 місяці тому
Happy to be the first to comment. Kris you are always up to date. Once again cool stuff from you. Spaghetti code... 🤣. Great that you did talk about the costs as well. I like your creative and often real funny ideas. Please keep up the great work! Regarding your phone call: saw a video from a guy in the US weeks ago (no Realtime API) - he did let his AI order a Pizza and it worked great. Latency even back then was good enough - should work perfectly. Maybe try it with an italian accent 😉. Thx from Tom!
@AI_Escaped 2 місяці тому
No one is going to be even able to develop at these prices other than those with deep pockets. Just testing and figuring things out would be too expensive to even try.
@contentfreeGPT5-py6uv 2 місяці тому
i tested yesterday ,but
Error al conectar: 403
Acceso denegado. Verifica tu clave de API y los permisos para usar el API Realtime.
@elprox1290 2 місяці тому
try checking your api key or just making a new one
@contentfreeGPT5-py6uv 2 місяці тому
@@elprox1290 again, thanks
@thenoblerot 2 місяці тому
By telling it it is playing a game with the user, it might be failing on purpose to let you win!
@TheTrainstation 2 місяці тому
Im waiting to hear the Irish accent to be sure
@benbrahimjamil1976 2 місяці тому
How to get the repo ?
@DhairyaMarwah-l1u 2 місяці тому ⁺⁵
Can you share the repo link ?
@khanhhq2044 2 місяці тому ⁺³
Can you share the repo link ?

Наступне

Автоматичне відтворення

"Training" an AI Agent for ONE Specific TASK with OpenAI-o1 API

"Training" an AI Agent for ONE Specific TASK with OpenAI-o1 API

OpenAI Swarm AI Agents - Is It Time To Be ALL IN on Agentic Workflows?

OpenAI Swarm AI Agents - Is It Time To Be ALL IN on Agentic Workflows?

How to Build an AI Agent Using OpenAI Realtime API (Step-by-step Guide)

How to Build an AI Agent Using OpenAI Realtime API (Step-by-step Guide)

Кирилл Набутов. Арестович в Кремле, кто взорвал командующего в Москве, война России с НАТО

Кирилл Набутов. Арестович в Кремле, кто взорвал командующего в Москве, война России с НАТО

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

Cat mode and a glass of water #family #humor #fun

Cat mode and a glass of water #family #humor #fun

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

КТО НЕ ДВИНЕТСЯ, ПОЛУЧИТ МАШИНУ!

OpenAI DevDay | Realtime Speech to Speech API + Image Fine-tuning TESTED

OpenAI DevDay | Realtime Speech to Speech API + Image Fine-tuning TESTED

Windsurf vs Cursor: In-Depth AI Code Editor Comparison

Windsurf vs Cursor: In-Depth AI Code Editor Comparison

Understanding OpenAI Real Time API With a Python Demo

Understanding OpenAI Real Time API With a Python Demo

The Future of Knowledge Assistants: Jerry Liu

The Future of Knowledge Assistants: Jerry Liu

YouTube is now on EASY Mode (Anyone Can Blow Up in 2025)

YouTube is now on EASY Mode (Anyone Can Blow Up in 2025)

OpenAI Realtime API vs Voice AI Platforms

OpenAI Realtime API vs Voice AI Platforms

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

OpenAI DevDay 2024 | Multimodal apps with the Realtime API

OpenAI DevDay 2024 | Multimodal apps with the Realtime API

Ditch Expensive Tools and Build Anything with Bolt.new for FREE

Ditch Expensive Tools and Build Anything with Bolt.new for FREE

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

Этот бой - Самое большое РАЗОЧАРОВАНИЕ за всю КАРЬЕРУ БУАКАВА!

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

СИНИЙ ИНЕЙ УЖЕ ВЫШЕЛ!❄️

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

Мама загинула у блокадному Чернігові, а тато у полоні РФ #війна #люди #україна #shorts #смерть

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

НА ЦЕ можна дивитись ВІЧНО! Такої ПАЛКОЇ зустрічі НІХТО НЕ ЧЕКАВ

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

The Security Guard Fell Into The Trap Of The Beauty #still #parkour #funny#skate

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

ЧТО ОПАСНЕЕ? ОТВЕТЫ ВАС ШОКИРУЮТ... (1% ОТВЕЧАЮТ ПРАВИЛЬНО) #Shorts #Глент

Перший наступ КНДРівців

Перший наступ КНДРівців