World’s Fastest Talking AI: Deepgram + Groq

Two GPT-4os interacting and singing

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Как просверлить гранит медной трубкой? Тот самый эксперимент #ученые_против_мифов

Мы…почти доехали на Волге 500км до Краснодара🔥

Normal vs Psychopath vs Rich How to heal a cut on your finger ☝️❤️‍🩹

Low latency AI voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming.

Linguflex

Переглядів 5 116

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 16 сер 2023
Short prove of concept code for a real-time ai companion. Note: This demo is conducted on a 10Mbit/s connection, so actual performance might be more impressive on faster connections.
Project link: github.com/KoljaB/AIVoiceChat

КОМЕНТАРІ • 22

@k0hacuu 6 місяців тому ⁺¹
Incredible work!
Found your projects today and I cannot describe in words how impressive this all is. +1!
@ZalexMusic 9 місяців тому ⁺¹
very impressive work!
@RolandoLopezNieto 2 місяці тому ⁺¹
Impressive work, thanks
@aorusaki 10 місяців тому ⁺¹
Great work! If you had a strong enough computer you can run a smaller 13B model with fast tts with much lower latency
@alexandresajus 6 місяців тому
Incredible! I was working on the same project and had the issue of TTS latency: any Cloud TTS service has latency that is too high for real-time purposes. Definitely going to implement you approach. Thanks!
@Linguflex 6 місяців тому
May I also point you to this one which can greatly help with TTS and latency: github.com/KoljaB/RealtimeTTS
@sergitorrabadella 5 місяців тому ⁺¹
It's impressive! Which GPU are you using?
@Linguflex 5 місяців тому
Thank you. I have a RTX 2080 Super.
@sergitorrabadella 5 місяців тому
@@Linguflex Thanks for your answer! I have some questions. I've seen your email in the comments, can I email you?
@akashraut8129 5 місяців тому
hi Buddy!!
Im trying this approach but getting error, I have trained voice assitant using langchain and gpt 3.5 turbo and using elevenlabs api and opean ai api but latency is not reducing
@HowExactly Місяць тому
Very nice. Greatjob❤
Out of curiosity, how would you handle back to back conversation with interruptionhandling without using space?
@Linguflex Місяць тому
Thank you. We talked about how to do solid interruption in my discord channel recently: discord.gg/f556hqRjpv
Highly encourage you to join, it's a great place to ask questions, share progress and get support from tech enthusiasts. Would love to see you there!
@arsenlupin4825 10 місяців тому ⁺¹
Wow this project is insane is it possible to exchange openai with an llm instead to have 100% offline voice assistant ?
@Linguflex 10 місяців тому
A local LLM is not the problem. Local TTS is much harder. There is only Tortoise or Bark afaik and they are not comparable to Elevenlabs quality sadly.
@Anonymos739 10 місяців тому ⁺¹
Please make a tutorial video for installing ai, I tried following the guide but I couldn't do it.
@Linguflex 10 місяців тому ⁺²
You have python installed? Mail me your install probs or send screenshot at lonligrin@gmail.com, I will help as good I can
It's basically: copy the files, enter api keys there, open a cmd shell as admin, enter "pip install openai elevenlabs pyaudio wave keyboard faster_whisper numpy torch" there. After that enter python voice_talk_vad.py or python voice_talk.py
Not sure how to do a good tutorial video...
@aorusaki 10 місяців тому ⁺²
@@Linguflexwow youre really nice mate
@preenanahnaf 6 місяців тому
Hey brother! When i am running your program it is showing rate limit error. btw I am using free tier of openai
@Linguflex 6 місяців тому
Elevenlabs or Openai API ran into rate limit. Check characters used in elevenlabs and settings limits in your openai account
@preenanahnaf 6 місяців тому
@@Linguflex it is saying openai limit crossed.
i am using free tier of openai. is free tier enough for this program to run or i must upgrade to paid tier?
@Linguflex 6 місяців тому
Paid account, it needs openai api key.
@Smashachu 5 місяців тому ⁺¹
Actually you want about 100 MS of delay at the very least. We're human and take time to process information and it would just seem unnatural to have a conversation where you felt like someone was finishing your sentences for you all the time.

Наступне

Автоматичне відтворення

World’s Fastest Talking AI: Deepgram + Groq

World’s Fastest Talking AI: Deepgram + Groq

Two GPT-4os interacting and singing

Two GPT-4os interacting and singing

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

Как просверлить гранит медной трубкой? Тот самый эксперимент #ученые_против_мифов

Как просверлить гранит медной трубкой? Тот самый эксперимент #ученые_против_мифов

Мы…почти доехали на Волге 500км до Краснодара🔥

Мы…почти доехали на Волге 500км до Краснодара🔥

Normal vs Psychopath vs Rich How to heal a cut on your finger ☝️❤️‍🩹

Normal vs Psychopath vs Rich How to heal a cut on your finger ☝️❤️‍🩹

СТОИТ ЛИ СБЕГАТЬ ОТ РОДИТЕЛЕЙ? 2 ПОПЫТКА! ‍👩‍👧‍👦

СТОИТ ЛИ СБЕГАТЬ ОТ РОДИТЕЛЕЙ? 2 ПОПЫТКА! ‍👩‍👧‍👦

Fastest speech to text transcription, 100% offline - Whisper.cpp | Zero latency

Fastest speech to text transcription, 100% offline - Whisper.cpp | Zero latency

Instant Audio Streaming with ElevenLabs AI Voice API - Here's How

Instant Audio Streaming with ElevenLabs AI Voice API - Here's How

The BEST, Local Text-to-Speech Generator - AI Voice Cloning (Tortoise TTS)

The BEST, Local Text-to-Speech Generator - AI Voice Cloning (Tortoise TTS)

Which AI can generate the most realistic voice? ElevenLabs vs Synthesia vs Murf AI!

Which AI can generate the most realistic voice? ElevenLabs vs Synthesia vs Murf AI!

I Built a Personal Speech Recognition System for my AI Assistant

I Built a Personal Speech Recognition System for my AI Assistant

AI Content Search (RAG) with Docs Agent | Build with Google AI

AI Content Search (RAG) with Docs Agent | Build with Google AI

SUPER Fast AI Real Time Speech to Text Transcribtion - Faster Whisper / Python

SUPER Fast AI Real Time Speech to Text Transcribtion - Faster Whisper / Python

How To Clone ANY Voice In Under 5 MIN w/ Eleven Labs AI

How To Clone ANY Voice In Under 5 MIN w/ Eleven Labs AI

Don't Use MemGPT!! This is way better (and easier)! Use Sparse Priming Representations!

Don't Use MemGPT!! This is way better (and easier)! Use Sparse Priming Representations!

РОЗБОМБЛЯТЬ усе: відповідь Путіну за Охматдит ГОТОВА. НАТО наважиться! Індія КИНЕ Кремль

РОЗБОМБЛЯТЬ усе: відповідь Путіну за Охматдит ГОТОВА. НАТО наважиться! Індія КИНЕ Кремль

🔴 Орбан змінює плани / Деталі ракетного удару по Україні

🔴 Орбан змінює плани / Деталі ракетного удару по Україні

Іспанія - Німеччина. ПРЯМА ТРАНСЛЯЦІЯ. Футбол. Євро-2024. Перший Футбольний канал. Аудіотрансляція

Іспанія - Німеччина. ПРЯМА ТРАНСЛЯЦІЯ. Футбол. Євро-2024. Перший Футбольний канал. Аудіотрансляція

Чи оновлювати дані тим хто в Україні та яка відповідальність | Адвокат Ростислав Кравець

Чи оновлювати дані тим хто в Україні та яка відповідальність | Адвокат Ростислав Кравець

"Майже 2 роки він був у полоні". Сестра військового з Херсонщини чекала повернення брата #shorts

"Майже 2 роки він був у полоні". Сестра військового з Херсонщини чекала повернення брата #shorts

Час на цвинтар ❗️ Кім Чен Ин подарував Путіну надгробок

Час на цвинтар ❗️ Кім Чен Ин подарував Путіну надгробок