Real-time Speech to Text with DeepSpeech - Getting Started on Windows and Transcribe Microphone Free

Поділитися
Вставка
  • Опубліковано 20 сер 2024

КОМЕНТАРІ • 288

  • @FedericoTerzi
    @FedericoTerzi  3 роки тому +13

    If you are interested in these topics, you can also follow me on Twitter :) twitter.com/terzi_federico

    • @jeongwonkim247
      @jeongwonkim247 3 роки тому

      was there a video on how to transcribe the audio files into text? Please let me know and thank you!

  • @KuboF
    @KuboF 3 роки тому +10

    Thanks for this short, straightforward, to-the-point video! By reading the manual I thought I am going to need to take a vacation to learn just to run DeepSpeach, now I am very confident about doing it quite quickly!

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому +1

      Thanks! Running it is pretty easy with the prebuilt model. Things start to get real complex when you want to train your own :)

    • @KuboF
      @KuboF 3 роки тому

      @@FedericoTerzi Yeah, using pre-built model is my first step to training my own 😅

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому +2

      Good luck! If you succeed, please let me know how hard it was :)

    • @KuboF
      @KuboF 3 роки тому

      @@FedericoTerzi I very much hope I could one day 😅

    • @patataboom2645
      @patataboom2645 3 роки тому

      Soo have you finished? :))))

  • @dayworkhard
    @dayworkhard 4 роки тому +16

    thank you for sharing. i donated my voice there. this is so cool!

    • @FedericoTerzi
      @FedericoTerzi  4 роки тому +2

      That's great! :) We are one little step closer to an open voice model

  • @ALZlper
    @ALZlper 3 роки тому +1

    I really like, that you mention the platform at the end!

  • @SivaShankarsss
    @SivaShankarsss 3 роки тому +2

    I was looking for this kind of video..
    Currently I am working on creating AI assistant.
    This will help me a lot

  • @dibu28
    @dibu28 2 роки тому +1

    Thank you. Started DeepSpeech in a minutes.

  • @sisfabricio
    @sisfabricio 2 місяці тому

    Works on Windows after struggling for a while, many thanks

    • @shravanhegde2237
      @shravanhegde2237 2 місяці тому

      what struggles were they ,could u please tell em?i need to set it for my project so it would really be helpful

  • @marly1017
    @marly1017 3 роки тому +5

    can you please do a video about implanting this code to a project please?

  • @silversurfer8057
    @silversurfer8057 3 роки тому +3

    realy helpful for me (I think your video is the only one on the subject?). in addition to this, a tutorial on mozilla's TTS would actually be great. I would like something more detailed for that. I currently don't understand how to use new datasets to get other voices. i guess you have to train a model with a dataset. a tutorial on this would be really really cool! maybe you have also dealt with it?In any case, deepspeech and tts can theoretically be combined well.

  • @samriviera6299
    @samriviera6299 3 роки тому

    Thanks for this video! I got everything working. As you said, it's not as good as proprietary solutions but for simple commands like "start", "stop" or "turn on light" it should work. Looking forward to contribute.

  • @jane_shi
    @jane_shi 2 роки тому

    Thanks for ur video! I used Python 3.8.6 and DeepSpeech v0.9.3 and it worked well!

    • @hssp1534
      @hssp1534 Рік тому

      but im not able to find the deepspeech library in jupyter. How did you install it?

    • @jane_shi
      @jane_shi Рік тому

      I just did what he showed in the video

  • @izufarahiyahizzuddin2119
    @izufarahiyahizzuddin2119 Рік тому +1

    i already run the code, but it cannot recognize my voice, anyone has solution for it

  • @Karma-vf2qu
    @Karma-vf2qu 4 роки тому +2

    Uuu, really good content here! Grandee

  • @LukeHildreth
    @LukeHildreth 3 роки тому +1

    Got this working on windows! thanks for the tut!

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Glad to hear that :)

    • @sauravprashar
      @sauravprashar 3 роки тому

      Could you please help me I am getting a DLL error

    • @LukeHildreth
      @LukeHildreth 3 роки тому

      @@sauravprashar I'm actually not sure how to answer that. I'm pretty new to programming. Hope you find the answer!

  • @TTTrouble
    @TTTrouble 2 роки тому

    Thanks so much for making this video, it was exactly what I was looking for!

  • @aznperswazinable
    @aznperswazinable 2 роки тому +1

    (deepspeech) C:\Users\user\Documents\deepspeech>pip3 install deepspeech
    ERROR: Could not find a version that satisfies the requirement deepspeech (from versions: none)
    ERROR: No matching distribution found for deepspeech
    pip and pip3 not working on version 3.10 any ideas?

  • @tommyboy3164
    @tommyboy3164 2 роки тому +4

    was wondering if you could help. I'm getting this error: ERROR: Could not find a version that satisfies the requirement deepspeech (from versions: none)
    Also, where do you put the two model files after you download

    • @KPawan108
      @KPawan108 10 місяців тому

      I am also getting the same error. Did you get the answer now?

  • @vasanthmaisa293
    @vasanthmaisa293 10 місяців тому +1

    how did you directly get mic_vad_streaming folder inside the deepspeech folder without doing anything

    • @abdullamasud4278
      @abdullamasud4278 5 місяців тому

      he cut out that part from the video. After downloading the file, he simply copy pasted it inside the folder

  • @sslaia
    @sslaia 3 роки тому +1

    Excellent. If you could make a tutorial on how to train own model. The big players have already done that for well-known languages. In contrary this one could help with neglected languages like mine. So a tutorial on how to train own model in a new language would be very helpful.

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Thank you! Unfortunately, I don't know the model that well...

  • @sebastianochipocomancini1853
    @sebastianochipocomancini1853 3 роки тому +2

    What should I do if I want to use an application like this one for another language like spanish?

    • @stefang5639
      @stefang5639 3 роки тому

      You can download the language model for other languages as well from the source shown in the video.

  • @yacinemamdouh1271
    @yacinemamdouh1271 3 роки тому

    Great Video, I had some problems but now it works. Thank you

  • @wellingtonfurtado2074
    @wellingtonfurtado2074 3 роки тому +1

    Do you can do a tutorial teaching about how use deepspeech in unreal engine?

  • @chaitanyamalpure6226
    @chaitanyamalpure6226 3 роки тому +2

    Thank you for the video. Nice tutorial to get familiar with!!!!! Also, I have found a german pre-trained model. could you please explain how to work with german or any other pre-trained model.

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      You should be able to simply pass the german model and scorer and you should be ready to go :)

    • @chaitanyamalpure6226
      @chaitanyamalpure6226 3 роки тому

      @@FedericoTerzi Thanks alot. It worked!!!!!!!!!!!!!!!!!!!!!

  • @niharjani9611
    @niharjani9611 2 місяці тому

    Heyy, Pls Solve my query , How many languages does it support ? Like english , spannish could you provide a list of it., I tried to find it on Github and reddit, but was unsucesfull !!!

  • @yohannesayana9456
    @yohannesayana9456 2 роки тому

    How can we build a speech to text model from scratch in other less resourced languages using deepspeech?

  • @khalidelgazzar
    @khalidelgazzar 2 місяці тому

    Great video. Thank you 😊

  • @hitlab
    @hitlab 4 роки тому

    Thanks for making this man!

  • @dhanushabuddhikasandaruwan2677
    @dhanushabuddhikasandaruwan2677 2 роки тому +1

    I think DeepSpeech is not compatible to run on Windows (Windows 10). Try in a Linux environment.
    pip install deepspeech -----> Did not work successfully on Windows 10
    pip3 install deepspeech -----> Did not work successfully on Windows 10
    ERROR: Could not find a version that satisfies the requirement deepspeech (from versions: none)
    ERROR: No matching distribution found for deepspeech

    • @Steven-jf4cs
      @Steven-jf4cs 2 роки тому

      I just ran 'pip3 install deepspeech' from cmd and it ran like a champ. What version of Python are you running? Depending upon your version you may need to install up/down accordingly. I'm running Python V. 3.8.5

  • @shampoo1296
    @shampoo1296 2 роки тому

    help
    Import Error: DLL load failed: no se puede encontrar el modulo especificado

  • @ramsimmha8672
    @ramsimmha8672 3 роки тому

    Its Really cool! I tried this its working but its not printing the text which got listened. Is anyone here faced this? Please help me to fix this.

  • @tobiaskarl4939
    @tobiaskarl4939 3 роки тому

    1)
    Python 3.6.5 doesn't work. I updated to 3.6.7
    2)
    activate give an error ... edit activate.bat in Scripts folder and put and '.' after "delims=:" in line 4
    then execute Scripts\activate.bat explicitly

  • @sayyidumarshiddiq2397
    @sayyidumarshiddiq2397 2 роки тому

    What should i do if my laptop has installed python 3.8 version

  • @liamblu
    @liamblu 3 роки тому

    I get stuck at installing the requirements.txt
    ERROR: Could not find a version that satisfies the requirement deepspeech~=0.8.0
    ERROR: No matching distribution found for deepspeech~=0.8.0
    Edit: I already downgraded to Python 3.9.0 which is said to be compatible...

  • @tobiaskarl4939
    @tobiaskarl4939 3 роки тому

    Different numpy versions requirements make it fail for me.
    deepspeech 0.9.3
    numpy 1.14.4
    pip 10.0.1
    PyAudio 0.2.11
    scipy 1.5.4

  • @Luc_Skywalker
    @Luc_Skywalker 2 роки тому

    ERROR: Cannot install deepspeech==0.9.3 and numpy>=1.15.1 because these package versions have conflicting dependencies.
    deepspeech 0.9.3 depends on numpy=1.12.0
    I am unable to get around this to work, any idea?

  • @soulkingdom4600
    @soulkingdom4600 3 роки тому

    what is the difference between deep speech and deep speech 2?

  • @Monsieur.Nobody.
    @Monsieur.Nobody. 4 місяці тому

    Do you think we can run whisper or fast whisper llm on esp32's? Sort of in a form factor like the carputer or beepberry?

  • @adribmahmud
    @adribmahmud 2 роки тому

    can you please make a video how to train ?

  • @christosangelopoulos
    @christosangelopoulos 3 роки тому

    Job nicely done and presented, thank you.

  • @LukeHildreth
    @LukeHildreth 3 роки тому +1

    Is it possible to write these commands into a python file and just run that?

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Sure! You can simply edit that script file to fit your needs :)

  • @doodlearsh739
    @doodlearsh739 2 роки тому

    hi , i cant install requirement.txt with pip . can you help me

  • @waquezemerson4863
    @waquezemerson4863 2 роки тому

    Hi can I ask on how I can integrate this to my application? My application is now working on ionic environment is it possible to integrate this one?

  • @Dumpitzz
    @Dumpitzz 4 роки тому

    „Scripts\activate“ is not working. I get a error „parameter wrong -850“

  • @maputo658
    @maputo658 4 роки тому

    super nice! was able to follow it successfully, but on a mac.

    • @FedericoTerzi
      @FedericoTerzi  4 роки тому

      Glad to hear that :)

    • @LukeHildreth
      @LukeHildreth 3 роки тому

      I'm trying this too. Hid did you activate the script after setting up the virtual environment?

  • @shampoo1296
    @shampoo1296 2 роки тому +1

    File "C:\Users\jose\deepspeech\lib\site-packages\deepspeech\__init__.py", line 23, in
    from deepspeech.impl import Version as version
    File "C:\Users\jose\deepspeech\lib\site-packages\deepspeech\impl.py", line 13, in
    from . import _impl
    ImportError: DLL load failed: No se puede encontrar el módulo especificado.

  • @robc3863
    @robc3863 3 роки тому +1

    Many users will need to install pipwin and pyaudio, etc. to get this to work by the way.

    • @sauravprashar
      @sauravprashar 3 роки тому

      Mine is still giving me a DLL error

    • @nasocha1494
      @nasocha1494 2 роки тому +1

      I wish I had read your comment before watching this tutorial, It would have saved me 3 days trying to fix conflicting dependencies. Thank u so much!

  • @balajicmb1132
    @balajicmb1132 2 роки тому

    Speech to text transcribe open source library using python pycharm an another id Es using method code is available bro?

  • @DOKOTV
    @DOKOTV 4 роки тому +1

    is this only with english langauge?

    • @FedericoTerzi
      @FedericoTerzi  4 роки тому

      You can search online for other pre-trained models, try to google "Deepspeech model"

  • @imsteven3044
    @imsteven3044 3 роки тому

    Why is teh function of the scorer?

  • @samuelige9368
    @samuelige9368 3 роки тому

    Can you use deepspeech for a diacritic system

  • @rosarangithalagahawatta6300
    @rosarangithalagahawatta6300 2 роки тому

    how can i download mic_vad_streaming

  • @fashadahmedsiddique8412
    @fashadahmedsiddique8412 2 роки тому

    Hey, can it be possible upon using colab environment

  • @ariefsaferman
    @ariefsaferman 2 роки тому

    does the vad streaming work outside deepspeech? i wanna use it in another ASR framework

  • @bouchradahamni9881
    @bouchradahamni9881 3 роки тому

    very nice . plz make a video of how you train your own model

  • @abdulbaqi6170
    @abdulbaqi6170 2 роки тому

    There is an article on internet how to make srt files for movies via deepspeech. I can't get that working in the windows can you make a video how to convert audio files into text or srt via deepspeech pls? it would be very useful and increase your video views

  • @ilyasayusuf5447
    @ilyasayusuf5447 3 роки тому

    Wow great library thank you

  • @potpu
    @potpu 3 роки тому

    Hi Federico, thank you for your video. do you know how to integrate Deepspeech into talon?

  • @DaeOh
    @DaeOh Рік тому

    Thanks. I can't find the follow-up video though

    • @DaeOh
      @DaeOh Рік тому

      Nevermind, I used Whisper for this application!

  • @watevakid
    @watevakid 4 роки тому +1

    hmmm after I install DeepSpeech into my venv, I do not see "mic_vad_streaming"... any idea on how to install it?

    • @FedericoTerzi
      @FedericoTerzi  4 роки тому +1

      You have to download it from the deepspeech examples: github.com/mozilla/DeepSpeech-examples

  • @sibyllasystem1209
    @sibyllasystem1209 Рік тому

    Hope we could use it in the Windows environment so that I can study foreign languages easily somemday : )

  • @SivaShankarsss
    @SivaShankarsss 3 роки тому

    How to train with Indian ascent

  • @abhignaconscience358
    @abhignaconscience358 3 роки тому

    At 5:04 You told you're going to show nice little project what is it ??

  • @lemon3335
    @lemon3335 Рік тому

    How to integrate into UE4

  • @murtazahussain8224
    @murtazahussain8224 3 роки тому

    Is deepspeech compatible with nvidia Rtx3090 ?

  • @weweweqeqeqe3240
    @weweweqeqeqe3240 2 роки тому

    can this use for movies ?

  • @simgplusnervt4698
    @simgplusnervt4698 2 роки тому

    Nice video. Can you make a video about the use in android?

  • @sebastianochipocomancini1853
    @sebastianochipocomancini1853 3 роки тому +1

    Hi! You are using an already pre-trained model to do this speech-to-text application. But what if you want to train this model with another dataset, like for example in spanish or in italian? Which would be the steps to take in order to train the model to recognize speech in another language that isn't english?

    • @ThesongsIlikeThemost
      @ThesongsIlikeThemost 3 роки тому +1

      hi, you can find already trained model for Spanish, Italian, German, Polish, and French here. gitlab.com/Jaco-Assistant/deepspeech-polyglot

    • @sebastianochipocomancini1853
      @sebastianochipocomancini1853 3 роки тому +3

      @@ThesongsIlikeThemost Thank you so much, I finally found the spanish model here: drive.google.com/drive/folders/1-3UgQBtzEf8QcH2qc8TJHkUqCBp5BBmO (which is a link that was on the url you sent me). Replacing the .pbmm and the .scorer files in the command line, it works fine for spanish!

  • @stefang5639
    @stefang5639 3 роки тому

    Thanks, finally a good tutorial for Deepspeech!

  • @istiyakahamedmilon6512
    @istiyakahamedmilon6512 3 роки тому

    Can I use it to generate Bengali language?

  • @techtree1369
    @techtree1369 2 роки тому

    Thank you!

  • @ragnov3286
    @ragnov3286 2 роки тому

    Can you also Integrate deepspeech into a web app with some API? thanks

    • @FedericoTerzi
      @FedericoTerzi  2 роки тому

      If you're using Chrome or Safari, you might want to check out the Web Speech API, which is much simpler for web apps :) developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API

  • @Cezar-on8lb
    @Cezar-on8lb 7 місяців тому

    Hello! How DeepSpeech can be compared with Open AI Whisper?

    • @FedericoTerzi
      @FedericoTerzi  5 місяців тому

      No reason not to use Whisper today! It's amazing

  • @jargolauda2584
    @jargolauda2584 3 роки тому

    IBM Via Voice worked perfectly already in 1998, I wonder what happened to it? With IBM Via Voice you could speak and the text was fed into text editor.

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      There are a ton of great (commercial) speech to text products out there. The biggest selling point of DeepSpeech, even though it doesn't perform as well as commercial alternatives, is that it's opensource and free to use, which opens up a ton of possibilities by itself! Unfortunately, the future for DeepSpeech is uncertain at the moment, as Mozilla is cutting all non-essential projects...

    • @tamgaming9861
      @tamgaming9861 3 роки тому

      @@FedericoTerzi can you make a tutorial for python3.8 or higher? I cant downgrate python and the higher deepspeech versions have different filetypes now. would be awesome if you can show also how to train your own model. I would love to do it in ubuntu, because its also free.

    • @murtazahussain8224
      @murtazahussain8224 3 роки тому

      @@FedericoTerzi fed can u help me with my project .. willing to pay or hire any developer if u can help

  • @CULTURE_dz
    @CULTURE_dz 4 роки тому

    hello i install everything like you but finally the message of missing dll appere ..
    ImportError: DLL load failed: Une routine d’initialisation d’une bibliothèque de liens dynamiques (DLL) a échoué.
    can you help me please thanks

    • @at-ro9217
      @at-ro9217 4 роки тому

      same issue here

    • @FedericoTerzi
      @FedericoTerzi  4 роки тому +1

      Hey guys, try with these steps: github.com/tensorflow/tensorflow/issues/23683#issuecomment-532522740

  • @drin1drin
    @drin1drin 2 роки тому

    How can I implement an Italian Recognizer?

    • @FedericoTerzi
      @FedericoTerzi  2 роки тому

      You might prefer Vosk with an italian model for that :) alphacephei.com/vosk/

  • @ritwikghorui2731
    @ritwikghorui2731 3 роки тому

    Thank you so much, but if anyone has done this in a python file kindly please share the link. I'm facing some problems kindly please if anyone has done please provide the link. I have a deadline coming up, please help me.

  • @robc3863
    @robc3863 3 роки тому +1

    Thanks for the video! Is any guidance on how to integrate DeepSpeech into an application on Windows? I'm sure that would be very useful for developers! Thanks!

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Hey, If you app is written in Python, the integration would be pretty easy. Otherwise, your best bet is to look at "tensorflow-lite deepspeech", although I don't have any experience with that

    • @robc3863
      @robc3863 3 роки тому +1

      @@FedericoTerzi Hi, thanks but our app is C++, but so far not found any example of binding DeepSpeech to it. We also don't have many clients with nVidia GPUs...

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Nvidia GPUs are really not needed (as long as you are not training the model on the client's PC), CPU will handle inferring ok for most use-cases. Regarding the lack of examples, I'm sorry about that, probably the recent Mozilla layoffs did not help the project...

  • @user-xk4sj2lz9h
    @user-xk4sj2lz9h 3 роки тому

    What should I add to change the voice recognition language?

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      If you are lucky, you might be able to find a pretrained model for your language online. At that point, you can simply point the script to the other model. If you can't find it, then you could create your own model in theory, but that is very difficult in practice

  • @sauravprashar
    @sauravprashar 3 роки тому +2

    Hi, I am new to coding and I am getting an error (ImportError: DLL load failed while importing _impl: A dynamic link library (DLL) initialization routine failed.) I have tried searching up on google but no solution so far. I am using Pycharm. Please help also try to make things clear for a beginner.
    I want to use it for making a voice assistant, so any other suggestions would be appreciated.

    • @princessobuzor1992
      @princessobuzor1992 2 роки тому

      I'm stuck on that too! please help :(

    • @sauravprashar
      @sauravprashar 2 роки тому

      @@princessobuzor1992 didn’t find a solution so eventually dropped it. Also try going to their forum

    • @princessobuzor1992
      @princessobuzor1992 2 роки тому

      Yh I'm gonna try that, the last forum I want on want very helpful :( or friendly 😅

  • @mytop5602
    @mytop5602 3 роки тому

    amazing, thank you. can you please make a new video how to install it on debian and train it?

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Thank you! The installation process should be pretty similar on Debian, as long as you have the right python version. Regarding training the model, that's very difficult and expensive to do...

  • @explorefoodculture
    @explorefoodculture 2 роки тому

    Hi Terzi, can this software run on mac? and can it translate movie videos in to any language? thanks in advance!

  • @droidsons1371
    @droidsons1371 3 роки тому

    NIce Tutorial..!
    So I have a custom trained Language model which has (.model) extenstion, how to I convert it into .scorer file?

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Thanks! They are two different things, you can't convert one into the other :)

  • @jeongwonkim247
    @jeongwonkim247 3 роки тому

    was there a video on how to transcribe the audio files into text? Please let me know and thank you!

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Yes, you can use the script to transcribe audio files as well, but be prepared for some not-so-good results. What's the script "--help" option

  • @ahmedsaeed5149
    @ahmedsaeed5149 2 роки тому

    Thank you thank you thank you

  • @waveNiaC
    @waveNiaC 4 роки тому

    Can we somehow play with the energy(loudness) levels under which audio is captured , triggering the transcription.? I mean every little sound triggers deepspeech, while we want it to be triggered only when a person speaks. Can somehow an energy threshold be determined? I'm working on it, but I could save some time if there is already a solution. There seems to be a condition in vad_collector() that I am finding hard to understand. Thank you

    • @FedericoTerzi
      @FedericoTerzi  4 роки тому

      Hey, yes that's almost surely possible by playing around with the audio stream. I don't know exactly how though

  • @1979gian
    @1979gian 3 роки тому

    Ciao Federico, grazie per il fantastico tutorial! Mi chiedevo se magari potevi gentilmente potevi farne uno con l Italian Model per i principianti come me

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Ciao Gianluca, grazie per i complimenti! Non posso prometterti niente dato che non è la mia area di competenza, ma me lo segno :)

  • @Piriponzolo
    @Piriponzolo 3 роки тому

    Ciao, Federico. Complimenti per il video, molto bello e interessante! Deep Speech funziona anche per l'italiano?

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Grazie mille! Si c'è un modello italiano, la performance non è il massimo ma funziona: github.com/MozillaItalia/DeepSpeech-Italian-Model

    • @Piriponzolo
      @Piriponzolo 3 роки тому

      @@FedericoTerzi Ciao e grazie, Federico. Ho scompattato lo zip, ma poi mi sono arenato. Come si va avanti?

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Dopo il processo dovrebbe essere simile a quello del video, anche se non ho mai provato a farlo girare direttamente (ho solo fatto delle prove con il Bot telegram che lo usa). Ti conviene guardare gli esempi sulla repo o contattare il maintainer, che sembra molto preparato a riguardo :)

    • @Mr_Yod
      @Mr_Yod 3 роки тому

      @@FedericoTerzi Grazie: lo proverò.
      Gli altri sistemi che ho provato in Python sono atroci o richiedono la connessione all'internet (quello di google).
      Certo però che essere compatibile solo con Python 3.6 quando siamo alla 3.9 già da un po'... =(
      EDIT: Dal link che hai messo dice "Requisiti: 'Python 3.7+' "

  • @at-ro9217
    @at-ro9217 4 роки тому

    Everything is fine only that is not working
    Traceback (most recent call last):
    File "C:/Users/Administrator/PycharmProjects/Deepspeech/mic_vad_streaming/mic_vad_streaming.py", line 9, in
    import deepspeech
    File "C:\ProgramData\Anaconda3\envs\DeepSpeechEnv\lib\site-packages\deepspeech\__init__.py", line 23, in
    import deepspeech.impl
    File "C:\ProgramData\Anaconda3\envs\DeepSpeechEnv\lib\site-packages\deepspeech\impl.py", line 13, in
    from . import _impl
    ImportError: DLL load failed: A dynamic link library (DLL) initialization routine failed.
    Process finished with exit code 1

    • @FedericoTerzi
      @FedericoTerzi  4 роки тому

      Hey, are you sure you are using Python 3.6?

    • @at-ro9217
      @at-ro9217 4 роки тому

      @@FedericoTerzi yes, maybe yo make a video about how you had your env setup from bare metal ?

  • @maxge8504
    @maxge8504 3 роки тому

    Interesting topic. I expected to use it in my project but when I tested, it doesn't regognize my voice as good as your :(
    It catches 50% of my words, and most of the time, it writes a wrong one :(
    But thank you anyway!

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому +1

      Yeah, I've experienced the same problem myself. The model is not comparable with cloud-based solutions as of now, especially for non-native speakers like me :)

  • @purushothaman2783
    @purushothaman2783 3 роки тому

    please put how to use as python api

  • @freegsbox
    @freegsbox 3 роки тому

    Awesome!! can it recognize from files too? and how, please?

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому +2

      If I'm not mistaken, the script used in the video also accept an argument for wav files :)

  • @elhammadjidi3038
    @elhammadjidi3038 2 роки тому

    How to fine tuning it with our own dataset?

  • @patataboom2645
    @patataboom2645 3 роки тому

    I'm working on a college project and I need to make the speech-to-text in my language. Any idea how to use deepspeech in Romanian? I saw the language is available

    • @mozes_ma
      @mozes_ma 3 роки тому

      Hey, similar challenge here, any ideas so far?

  • @parinaypanwar2027
    @parinaypanwar2027 3 роки тому

    Bro, I am getting error Could not find a version that satisfies the requirement deepspeech

  • @mo9204
    @mo9204 2 роки тому

    How much work and time does it need to create a library for new language with its own rules which are not in these libraries?

    • @FedericoTerzi
      @FedericoTerzi  2 роки тому +1

      A lot of time, effort and computational power :) You might also want to check out Vosk alphacephei.com/vosk/models

    • @mo9204
      @mo9204 2 роки тому

      @@FedericoTerzi is there tutorials for creating own model and training?

  • @mouradtoumi7296
    @mouradtoumi7296 3 роки тому

    I have no skills in Python, I'm trying to read from wav file instead of mic and display metadata, I tried -f arg but didn't work :( any help ?

    • @tamgaming9861
      @tamgaming9861 3 роки тому +1

      I havent got it to work because i cant install python3.6, my python is already higher. But what i read is that you need a special version of wav-format. I mean to remember it was 8 bit, and mono and 16khz but not sure. MP3 does not work so far. There are some softwares who can translate from mp3 to wav online. Hope it helps.

  • @anujsharma-my5ll
    @anujsharma-my5ll 3 роки тому

    hello i am a visually impaired person how can i get setup file of mozilla tts for screen reader called NVDA.. is it possible

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому

      Hey, unfortunately, I don't think the deepspeech project is good enough yet for your needs...

  • @fahrul8025
    @fahrul8025 3 роки тому

    Awesome video ! i have already following this instruction the step one by one, but at the end, when the last step. i have a problem "ModuleNotFoundError: No module named 'webrtcvad' . can you help me with this problem? thanks.

    • @FedericoTerzi
      @FedericoTerzi  3 роки тому +1

      You might need to install the package with: "pip install webrtcvad"

    • @fahrul8025
      @fahrul8025 3 роки тому

      @@FedericoTerzi Awesome! It works now. But when im speaking there is no any results. Im sure my microphone works well. Any suggestion?

  • @zy-blade
    @zy-blade 3 роки тому

    Now I need to find out how to build the native client (c++) version of this, anyone already done that? Couldn't find any good information.

    • @imsteven3044
      @imsteven3044 3 роки тому

      Why do you want to do that?

    • @zy-blade
      @zy-blade 3 роки тому +1

      @@imsteven3044 For an Unreal Engine implementation. But I already did some prototyping and will use another STT library for that.

    • @imsteven3044
      @imsteven3044 3 роки тому

      @@zy-blade oh that is awesome, if you have videos or github I would like to see your project

  • @Luc_Skywalker
    @Luc_Skywalker 2 роки тому

    You know, it would of been nice if you would of mention in your title that this would be in python!