A.I. Speech to Text Software - Speech Note for Linux

Поділитися
Вставка
  • Опубліковано 3 жов 2024
  • TechHeart takes a look at Speech Note, a new Linux open-source software that provides speech-to-text and text-to-speech functionality. It runs locally on your machine so no data goes to the cloud, or anywhere! It's powerful with many language and voices available - a very robust new software that we just have to take a look at!!!
    Find us on Discord / discord
    TechHeart.life

КОМЕНТАРІ • 15

  • @Rbourk252
    @Rbourk252 19 днів тому +1

    Enjoyed this demonstration. Installed the app. Using it a lot.

  • @jamesb2877
    @jamesb2877 6 місяців тому +1

    I really like this program, I'm very much disabled, have difficulty using my hands, used to use only Google's equivalency, didn't really like it, but it was the only thing we had, now I have something on Linux.

    • @techheart6090
      @techheart6090  6 місяців тому

      That's great, James!! I was blown away w/ SpeechNote and I still use it b/c it's useful to me - I can only imagine how important it is for you! You might like the new Plasma 6 desktop environment as they have some good accessibility tools in it - I just dropped a video covering installing it on Arch Linux. :P Go take a look - thanks for being here!!

  • @erleuchtungenl
    @erleuchtungenl 4 місяці тому +2

    Great video! Thanks a lot!

  • @TheGrateful108
    @TheGrateful108 14 днів тому

    What about web reading, web kindle reading? and audio file to text?

  • @TangBengYong
    @TangBengYong 18 днів тому

    The video title is Speech to Text, not Text to Speech, but the video content is almost all about text to speech.

    • @techheart6090
      @techheart6090  14 днів тому

      You're right - sorry about that, I'll make changes.

  • @michaelwright2986
    @michaelwright2986 6 місяців тому +1

    I agree that Speech Note is not the most polished app, but it's the first I know that lets you use LLMs without rolling your own Python scripts.
    The quality of speech to text depends strongly on what model you use. Vosk and Coqui, as demonstrated, are not much better than you can get on your phone. My first trial (in English) was with a Whisper model (FasterWhisper Large) on a pretty nerdy bit of medieval history text, and I was genuinely astonished at how good it was. A quick try with the small Vosk model, however, produced output that was good for a laugh, but not much better.
    As one quickly sees with transcription of voice to text, the models take so much computation that we're going to have to wait a long time for real time translation from speech input. For the time being, speech to text and then translate the text. I'm also not sure how good the translation capacity of the present models is: a quick test showed something a bit better than Google translate (perhaps) but still with the sort of errors that a human learner would have had drilled out of them in 101.
    For my needs, the way Speech Note makes the best models accessible locally without programming skills is transformational, but I'm sure we will see (soon) front ends with more facilities; but Speech Note is quick enough to get used to if you want to use it. A big plus is that it can read and transcribe a pre-recorded MP3 file, which can be a very good way of working. You have to edit its output, but that's true of stuff you compose at the keyboard, and the output of these systems has at least already been through the spell check. For people with difficulty typing, this is the beginning of something big.

    • @techheart6090
      @techheart6090  6 місяців тому

      Thanks for the post; it's more info than my video!! :P

  • @Jose-Beltran
    @Jose-Beltran 9 місяців тому

    I use this app in fedora and work good but now I pass to Arch its close me and no work for me, and i dont know whats happend, please i need help to make work it.

    • @techheart6090
      @techheart6090  9 місяців тому

      Flathub / flatpack isn't an option?? I always put flatpack on Debian to pad its package capabilities...

  • @LossyLossnitzer
    @LossyLossnitzer 2 місяці тому

    does it work on a CLI?

    • @techheart6090
      @techheart6090  2 місяці тому

      I believe this is a GUI application - but I did not doublecheck...

  • @Jose-Beltran
    @Jose-Beltran 9 місяців тому

    I think must be a deppendency