RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

Поділитися
Вставка
  • Опубліковано 7 вер 2024
  • github.com/Kol...
    Features wake word activation and instant transcription. Designed for real-time applications like voice assistants.

КОМЕНТАРІ • 12

  • @LostPostProductions
    @LostPostProductions 6 місяців тому

    Thank you for showing us your library in action as well as letting us know how we can support it!

  • @ccapp3389
    @ccapp3389 5 місяців тому

    Nice one! I look forward to trying this out

  • @aboudezoa
    @aboudezoa 9 місяців тому

    This is awesome ! Thanks

  • @modicool
    @modicool 3 місяці тому

    That's incredibly accurate. Nice work! Can you active-transcribe AND wake-word for commands? It'd be great if you could have it always listening and then do something on wake word.

    • @Linguflex
      @Linguflex  3 місяці тому

      No, currently not. The idea is good, I can see some use-cases for this. I'll think about that.

  • @datfeeshboi9526
    @datfeeshboi9526 3 місяці тому

    im gonna try to make a Vrchat STT app that puts the words above my head using their osc system :D

  • @allfather_ogre
    @allfather_ogre 2 місяці тому

    Great work..do you have any ideas to reduce latency in text to speech..im working on it..

  • @kimjongoof5000
    @kimjongoof5000 6 місяців тому

    Is it possible to make the voice dictation instantaneous at the cost of accuracy? I want to try controlling the servos on an animatronic mouth with voice dictation. It doesn't have to be accurate, it just needs to be accurate enough to be convincing and as fast as possible

    • @Linguflex
      @Linguflex  5 місяців тому

      You probably want to use whisper.cpp with a quantized tiny model and grammar sampling, look up Georgi Gerganov's chess example.

    • @Linguflex
      @Linguflex  5 місяців тому

      You could also train a wake word model to do this. They are crazy fast and reliable but specialized on few keywords. Check Openwakeword or PvPorcupine.

  • @Moicestlunehiver
    @Moicestlunehiver 4 місяці тому

    I don't understand how to use it...

    • @Linguflex
      @Linguflex  4 місяці тому

      What do you want to do?
      The "tests" folder contains some examples how you can use it:
      github.com/KoljaB/RealtimeSTT/tree/master/tests
      Maybe also the "tests" of RealtimeTTS can help, they also use RealtimeSTT a lot:
      github.com/KoljaB/RealtimeTTS/tree/master/tests