81
205 025

Low latency voice to text transcription in real time

1:22

How to run Llama locally using Python

1:03

Transcribe streaming audio within web browsers

2:35

Speaker Diarization for Web-based Applications that run across Chrome, Safari, Firefox and Edge

2:16

Convert voice to text using Javascript

2:34

Speaker Recognition Across Chrome, Safari, Firefox, and Edge

2:06

Voice Assistant in Node.js using on-device LLM, STT, TTS and Wake Word tech

Demo of an on-device LLM-powered voice assistant running a 2022 MacBook Air M2 w/ 8GB memory
Microsoft Phi-2 quantized by picoLLM Compression: picovoice.ai/picollm/compression/
picoLLM Inference: picovoice.ai/picollm/inference/
Porcupine Wake Word: picovoice.ai/platform/porcupine/
Cheetah Streaming Speech-to-Text: picovoice.ai/platform/cheetah/
Orca Streaming Text-to-Speech: picovoice.ai/platform/orca/
Code: github.com/Picovoice/pico-cookbook/blob/main/recipes/llm-voice-assistant/nodejs/
Start Free: console.picovoice.ai/
#largelanguagemodels #llms #voiceai #voiceassistant #aiagents #artificialintelligence #machinelearning

Відео

Low latency voice to text transcription in real time

1:22

Low latency voice to text transcription in real time

Переглядів 206Місяць тому

Discover how to transcribe speech to text in real-time using the Picovoice Cheetah Streaming Speech-to-Text Node.js SDK! Cheetah's ability to process voice data locally in real time makes it ideal for real-time applications whether you're broadcasting, adding transcription to meeting applications, or building AI agents. Resources: Overview: picovoice.ai/platform/cheetah/ Quick Start Guide: pico...

1:03

How to run Llama locally using Python

Переглядів 128Місяць тому

Discover how to run Llama 2 and Llama 3 models locally on your desktop using Picovoice’s picoLLM Inference Engine Python SDK and compressed Llama Models. This tutorial walks you through setting up the environment, running the models, and exploring the capabilities of LLMs without relying on cloud infrastructure. Resources: Overview: picovoice.ai/platform/picollm/ Quick Start Guide: picovoice.ai...

Transcribe streaming audio within web browsers

2:35

Transcribe streaming audio within web browsers

Переглядів 47Місяць тому

Discover how to add real-time transcription to your JavaScript applications using Picovoice's Cheetah Streaming Speech-to-Text engine. This tutorial guides you through setting up the project, initializing Cheetah, and transcribing speech-to-text on-device to ensure privacy and performance. Resources: Overview: picovoice.ai/platform/cheetah/ Quick Start Guide: picovoice.ai/docs/quick-start/cheet...

Speaker Diarization for Web-based Applications that run across Chrome, Safari, Firefox and Edge

2:16

Speaker Diarization for Web-based Applications that run across Chrome, Safari, Firefox and Edge

Переглядів 47Місяць тому

Learn how to integrate speaker diarization into your web applications using Picovoice's Falcon Speaker Diarization Web SDK. This tutorial covers the process of identifying "who spoke when" in audio streams, making it ideal for multi-speaker environments. Resources: Overview: picovoice.ai/platform/falcon/ Quick Start Guide: picovoice.ai/docs/quick-start/falcon-web/ API Reference: picovoice.ai/do...

2:34

Convert voice to text using Javascript

Переглядів 79Місяць тому

Learn how to add cloud API transcription accuracy to your web applications without sacrificing privacy using Picovoice's Leopard Speech-to-Text Web SDK. This tutorial explains how to set up and use on-device speech recognition, ensuring privacy and real-time performance. Resources: Overview: picovoice.ai/platform/leopard/ Quick Start Guide: picovoice.ai/docs/quick-start/leopard-web/ API Referen...

Speaker Recognition Across Chrome, Safari, Firefox, and Edge

2:06

Speaker Recognition Across Chrome, Safari, Firefox, and Edge

Переглядів 41Місяць тому

Implement Speaker Recognition for Web Applications Using JavaScript Learn how to add speaker recognition to your web applications using Picovoice's Eagle Speaker Recognition Web SDK. This tutorial guides you through setting up speaker enrollment and recognition, allowing your application to identify and verify speakers in real time. Resources: Overview: picovoice.ai/platform/eagle/ Quick Start ...

Real-time Deep Learning-powered Noise Suppression for Web Applications

2:33

Real-time Deep Learning-powered Noise Suppression for Web Applications

Переглядів 49Місяць тому

AI-powered Noise Cancellation engine, Koala Noise Suppression removes background noises, and voices from online meetings and voice chats, enhancing speech quality and intelligibility in real time, while maintaining user privacy. In this tutorial, we show how to add noise suppression to your web applications using Picovoice's Koala Noise Suppression Web SDK. Resources: Overview: picovoice.ai/pla...

Detect Speech and Silence in Conversations using Cobra Voice Activity Detection Web SDK

1:40

Detect Speech and Silence in Conversations using Cobra Voice Activity Detection Web SDK

Переглядів 77Місяць тому

Learn how to implement real-time voice activity detection (VAD) in your JavaScript applications using Picovoice's Cobra VAD engine. This tutorial covers setting up the project, initializing the Cobra engine, and detecting voice activity in real-time while keeping all processing on-device for privacy and low latency. Resources: Overview: picovoice.ai/platform/cobra/ Quick Start Guide: picovoice....

Adding Voice to Web Applications: Convert Text to Speech in JavaScript

2:49

Adding Voice to Web Applications: Convert Text to Speech in JavaScript

Переглядів 882 місяці тому

Learn how to convert text into speech within your JavaScript applications across web browsers using Picovoice's Orca Text-to-Speech Web SDK. This guide covers the installation, model setup, and implementation of text-to-speech in modern browsers. Resources: Overview: picovoice.ai/platform/orca/ Quick Start Guide: picovoice.ai/docs/quick-start/orca-web/ API Reference: picovoice.ai/docs/api/orca-...

Building a hands-free voice AI agent for web with ReactJS

3:56

Building a hands-free voice AI agent for web with ReactJS

Переглядів 1172 місяці тому

Learn how to integrate wake word detection into your React.js voice AI agents to create hands-free experiences using Porcupine Wake Word. This tutorial covers the setup process, training custom wake word models, and deploying them in a React environment for efficient, low-latency voice activation. Resources: Overview: picovoice.ai/platform/porcupine/ Quick Start Guide: picovoice.ai/docs/quick-s...

Speaker Diarization: How to detect different speakers in an audio recording

1:09

Speaker Diarization: How to detect different speakers in an audio recording

Переглядів 2982 місяці тому

Learn how to implement speaker diarization in Python, identifying "who spoke when" in audio streams. This tutorial covers setup, speaker enrollment, and accurate identification using Falcon Speaker Diarization Python SDK. Don't forget to check out the tutorial to see other Python-based speaker diarization frameworks - pyannote, NVIDIA Nemo, and Simple Diarizer - as well: picovoice.ai/blog/speak...

Convert Speech to Text locally using Python across Linux, macOS, Windows, and Raspberry Pi

1:20

Convert Speech to Text locally using Python across Linux, macOS, Windows, and Raspberry Pi

Переглядів 822 місяці тому

Learn how to convert speech to text using Picovoice's on-device Speech-to-Text engine Leopard's Python SDK. Leopard Speech-to-Text performs speech recognition locally on the device, keeping your voice data private (i.e. GDPR and HIPAA compliant by design). The SDK runs on Linux, macOS, Windows, and Raspberry Pi. On-device Speech-to-Text: picovoice.ai/platform/leopard/ Tutorial: picovoice.ai/blo...

How to generate srt files and add subtitles to videos using Python

1:55

How to generate srt files and add subtitles to videos using Python

Переглядів 8402 місяці тому

This tutorial shows how to add subtitles to any video using the Picovoice Leopard Speech-to-Text Python SDK. On-device Speech-to-Text: picovoice.ai/platform/leopard/ Tutorial: picovoice.ai/blog/speech-to-text-with-reactjs/ Demo: github.com/Picovoice/leopard/tree/master/demo/python-subtitle #speechtotext #pythonprogramming #python #pythontutorial #voiceai #transcription #deeplearning

Real-time Speaker Recognition and Identification in Python

2:14

Real-time Speaker Recognition and Identification in Python

Переглядів 7472 місяці тому

Learn to add real-time speaker identification to your Python app using Picovoice's Eagle Speaker Recognition Python SDK. This tutorial covers setting up the Eagle SDK, enrolling speakers, and recognizing speakers in real time. Resources: Eagle Speaker Recognition: picovoice.ai/platform/eagle/ Eagle Speaker Recognition Quick Start: picovoice.ai/docs/quick-start/eagle-python/ Eagle Speaker Recogn...

1:38

ReactJS Speech Recognition Tutorial

Переглядів 3203 місяці тому

ReactJS Speech Recognition Tutorial

Record Audio from Web Browser: Web Audio API & MediaStream API

1:35

Record Audio from Web Browser: Web Audio API & MediaStream API

Переглядів 3063 місяці тому

Record Audio from Web Browser: Web Audio API & MediaStream API

Add Speaker Diarization to OpenAI's Whisper Speech to Text

1:10

Add Speaker Diarization to OpenAI's Whisper Speech to Text

Переглядів 1,1 тис.3 місяці тому

Add Speaker Diarization to OpenAI's Whisper Speech to Text

Enhance Speech by Suppressing Background Noise in real time using Python

1:38

Enhance Speech by Suppressing Background Noise in real time using Python

Переглядів 1563 місяці тому

Enhance Speech by Suppressing Background Noise in real time using Python

How to Detect Voice Activity in Real Time using Python

1:22

How to Detect Voice Activity in Real Time using Python

Переглядів 5143 місяці тому

How to Detect Voice Activity in Real Time using Python

On-device Text-to-Speech in Python to build real-time AI agents with low latency

1:06

On-device Text-to-Speech in Python to build real-time AI agents with low latency

Переглядів 1653 місяці тому

On-device Text-to-Speech in Python to build real-time AI agents with low latency

AI powered Audio Search Engine : Voice Search for YouTube

1:09

AI powered Audio Search Engine : Voice Search for YouTube

Переглядів 653 місяці тому

AI powered Audio Search Engine : Voice Search for UA-cam

1:35

Detect & Record Audio using Python

Переглядів 3843 місяці тому

Detect & Record Audio using Python

Build your own voice assistant with end-to-end recognition using Python:

2:11

Build your own voice assistant with end-to-end recognition using Python:

Переглядів 1193 місяці тому

Build your own voice assistant with end-to-end recognition using Python:

Computer! Tea, Earl Grey, Hot: On-device Voice AI Agent on NodeJS

2:28

Computer! Tea, Earl Grey, Hot: On-device Voice AI Agent on NodeJS

Переглядів 683 місяці тому

Computer! Tea, Earl Grey, Hot: On-device Voice AI Agent on NodeJS

Build your own Voice Assistant with End-to-End Voice Recognition using Python

2:14

Build your own Voice Assistant with End-to-End Voice Recognition using Python

Переглядів 2503 місяці тому

Build your own Voice Assistant with End-to-End Voice Recognition using Python

Add custom wake words to your application using Python

1:49

Add custom wake words to your application using Python

Переглядів 2413 місяці тому

Add custom wake words to your application using Python

Fast Real-time Speech Recognition with zero latency using Python

2:11

Fast Real-time Speech Recognition with zero latency using Python

Переглядів 3834 місяці тому

Fast Real-time Speech Recognition with zero latency using Python

How to Run Llama Locally on Mac using Node.js and picoLLM Inference

1:46

How to Run Llama Locally on Mac using Node.js and picoLLM Inference

Переглядів 2034 місяці тому

How to Run Llama Locally on Mac using Node.js and picoLLM Inference

1:46

How to Run a Local LLM using Node.js

Переглядів 4154 місяці тому

How to Run a Local LLM using Node.js