Creating a Speech to Text Program with Python

CS Coach

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 10 вер 2024

КОМЕНТАРІ • 88

@Knot2goodAtIt 2 місяці тому ⁺²
I had nooooo idea this was so straightforward! Thank you! I want to create a translator and I think this is the perfect base!
@danoconnell5292 5 місяців тому ⁺¹
I've spent 5 days trying to figure this out knowing nothing literally about nothing and by this I mean virtual assistants you can talk to. With the learning curve I will say I wish I watched your video first. Your the best to learn from so far my man nice job.
@CSCoach 5 місяців тому ⁺³
That means a lot man. Thanks so much for the feedback.
@ymhtpat 6 місяців тому ⁺²
nvm figured it out. for those of you asking about the last step for windows os, go to note pad and type in " C:\Users\[your computer]> " then the first prompt then do the same for the second. save the file as a .txt. it should work from there.
@File_corupt 7 місяців тому ⁺²
yo this tutorial is great as I have tried to watch other tutorials as a noob but they talk like I know the stuff but I dont but you explain the stuff to me great
@DevonAIPublicSecurity 10 місяців тому ⁺⁶
Hey Oscar this was a great tutorial, keep doing the work you are doing and this was very clear and it made perfect sense.
@CSCoach 5 місяців тому
Thanks for the feedback. It means a lot. Going to ramp up the channel with more content in the coming months :)
@artetridimensionale 9 місяців тому ⁺⁵
ok so it's a bit simple, the problem is if you stop talking and you have to make sure the text is justified and doesn't end up at the end of a line every time you interrupt yourself
@goner007 6 місяців тому
this worked so well. i never knew it was that easy until you explained it. Hope to learn more from you thanks
@ColinTimmins Рік тому ⁺⁵
Cool stuff! I have struggled so much with language as I’m extremely dyslexic. ChatGPT has opened up the door 🚪 for me! 😊
@CSCoach Рік тому
That's awesome :D So glad ChatGPT is able to help you :)
@ymhtpat 6 місяців тому ⁺³
On the last step I'm hung up. I'm on the latest windows os, Are you running the touch output.txt and tail output on macs equivalent of command prompt? I ran it on mine and came up with " 'touch' is not recognized as an internal or external command, operable program or batch file."
where did I go wrong?
@joyaljijimon3419 4 місяці тому
same here bruh
@poisoned_durian8 5 місяців тому ⁺¹
im using windows im not so sure if the problem is the commands "echo. > output.txt and type output.txt | more"
@mmjuuno Рік тому ⁺²
how do i see the appended msgs in the terminal? it worked and I could open the output txt file and see what i was saying, but I'm not sure how to see it realtime ig (using Windows 10 and the windows subsystem for linux to install kali linux, and then opened bash in cmd). also when i stop the script (using pycharm) it spits back a bunch of errors and I'm not sure why, or at least i think they're errors
@CSCoach Рік тому
I did this by running the tail command in another terminal. Though, you could also change line 43 to be print(text) rather than print("Wrote text")
@kavito5947 8 місяців тому
@@CSCoachwhats the name of the terminal you used?
@Yajnco 4 місяці тому
My language Hmong is not available for voice to text and I have been searching for programs or ways to show me how it's done, but can't find any. Not sure if Python is the solution. Frustrating of not knowing where to start.
@fransuacordero5407 Місяць тому
one question, this works only for english lenguage? or can be used with other languages?
@pattuchiitu8978 7 місяців тому
Thank you for the tutorial. its works now 👋
@Kenoki-yi3gf 3 місяці тому
i am doing this on windows os and i cant seem to find where the text is saved.... can someone please help
@mohitpandya_2228 6 місяців тому
This is just working fine for the first run. After the first run the text generated takes about 10 mins to get to the output. How can i fix that thing and make it just as fast as the first run
@alexanderkartvelian4274 8 місяців тому
You are awesome! It would be great if you teach us how to send generated text from recognizer into "Text to speech" . thanks for the video!
@SirMrMystery 6 місяців тому
he already did something like that in which he created a jarvis like program ua-cam.com/video/BEw5EFqCCEI/v-deo.htmlsi=wEeSWa18kFPppBUF
@MrScgaming28 10 місяців тому ⁺²
Can you tell me how to do the last step of making that output file and tailing the outputs in a windows OS?
@anderson3889 6 місяців тому
did you find a way
@arvindh13 5 місяців тому
In windows it will record your audio & print it to the notepad, but you need to restart your notepad to see it.
@sandrasajeev8640 Місяць тому
@@arvindh13 Hey! I'm also stuck at this step. Can you pls explain like where do I've to put the commands to touch and tail the output?
@aotrakstar Рік тому ⁺¹
Hi coach. do you think maybe using openai's whisper will have a more accurate outcome for transcribing speech?
@CSCoach Рік тому
I found the python library to be enough personally. Though, I would imagine whisper to be better given that it's made by openAI. I'd expect it to be a better trained AI
@Ddetektiv Рік тому
how do you get terminal? When I clicked on terminal, new shows me and there was PS C:\Users\[my name]>, but as I can see, in ur it's not. Any way how to fix it?
@snaxsammy6472 10 місяців тому
At the top bar of the terminal where it says "problems" click on the drop down arrow next to the "+" in the further right of that bar, then click on "command prompt", you should be able to run the code through there. At least thats what I did and it worked for me.
@user-uh7cu6nk5d 21 день тому
Could you please post a link to the full file
@Hbdisaster_28 2 місяці тому
For me whenever i say something, the terminal shows wrote text but idk where it writes the text(im on windows btw) can anyone plz help
@arpitv2003 2 місяці тому ⁺¹
it would be creating a text file with name "output.txt" and writing into it
@Hbdisaster_28 2 місяці тому ⁺¹
Thank you
@tugpsx640 Рік тому
This is great, thanks for sharing you tips and tricks.
@CSCoach Рік тому
You bet!
@user-ve8pd3lg9t 4 місяці тому
@@CSCoach what is the name of the app u used in the video ??
@mohamed_Sultan1504 4 місяці тому
Does anyone know how to out it in a user-friendly app so anyone can use it easily?
@Illogical. 9 місяців тому
I need a thing that does a few steps less than what this does. I only want it to record phonetics, maybe spacing between words, maybe intonation, and maybe recognize when I'm pausing to remember a word.
@OmarAbdelrazek-tn9du 9 місяців тому
I still don’t understand how to download the libraries, I have a Mac m2 and documentation u gave didn’t help me. So can u or anyone else help me?
@felixforster5836 8 місяців тому
You have to type in the terminal "pip3 install ...."
@thunderchills640 21 день тому
thanks
@manchishirisha8013 Місяць тому
where are u writing the code
@usus8420 4 місяці тому
hi it's only desktop solution ?
@anirudhsrisai3397 Рік тому
Think of integrating it with chatgpt and getting answers for it just by using the text which we have received from speech to text
@CSCoach Рік тому ⁺¹
Haha, did you see this video I made :) I believe I do what you suggested in the comment.
ua-cam.com/video/BEw5EFqCCEI/v-deo.html
@snaxsammy6472 10 місяців тому ⁺¹
Hi Coach! MrScgaming28 and I, from the comment section seem to have the same issue, could you re-explain how to do the last step of making that output file and tailing the outputs in a windows OS?
@arvindh13 5 місяців тому
Create a text document & copy the source --> paste it in the program.
@story-4-you-man 3 місяці тому
@@arvindh13 can you explain this? program?
@mohamed_Sultan1504 4 місяці тому
A real king
@user-db4nm2rp2w 6 місяців тому
if i want to do only speech to text, then I don't need to install pyaudio, right??
@arvindh13 5 місяців тому
Yes, you need Pyaudio & Speech recognition fork library
@Anilkumar-zs1pq 6 місяців тому
How to stop recording the voice?
@JamesDonkor-edu 7 місяців тому ⁺¹
hey great video! would this work for other languages?
@danielcasas7343 4 місяці тому
I also hace the same question
@dungnguyen-si2sz 4 місяці тому
i am getting trouble with this error:
1 error generated.
error: command '/usr/bin/clang' failed with exit code 1
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building wheel for pyaudio
Failed to build pyaudio
ERROR: Could not build wheels for pyaudio, which is required to install pyproject.toml-based projects
I cant install the two last kind of stuffs you gave us, hope you reply soon, thank you for that nice work sir.
@mikethompson6455 4 місяці тому
Getting same error. Hope someone can help
@mikethompson6455 4 місяці тому ⁺¹
Did this. I have Mac OS so
If you're using macOS, you can install the necessary tools using Homebrew. First, make sure you have Homebrew installed, then run:
brew install portaudio
After installing portaudio, you can try installing PyAudio again using pip3 install pyaudio.
@gleful262 3 місяці тому
had this issue on widows fixed it by running as admin
@furkanezgin432 10 місяців тому
Hi oscar does it works for french
@jimjones26 Рік тому
I am going to work on adding a "trigger" word that will let the program know I want to act.
@CSCoach Рік тому
That's cool!! If you get that to work, I'd love to know how
@adrio1569 8 місяців тому ⁺¹
How can we add languages to it?
@arvindh13 5 місяців тому ⁺¹
You need pre designed language libraries or you need to make a new one with a lot of data.
@sicfrydred Рік тому ⁺¹
what program did you use to do this?
@CSCoach Рік тому ⁺²
Did it in python :)
@Bartdu59Gaming Рік тому
He used "VS Code" and the language used for the program is Python.
@kavito5947 8 місяців тому
@@Bartdu59Gamingwhats the name of the terminal he used?
@simthatweek 6 місяців тому
why import pyttsx3? it is never used?
@arvindh13 5 місяців тому
It's not required yes.
@user-qj3ig7qz3y 11 місяців тому
i couldnt import pyttsx3 it says unused and alot of error there 😂
@arvindh13 5 місяців тому
you need to install it, it's not an inbuilt library.
@suissegarantiegaming2100 9 місяців тому
Could this also work with a other Language?
@CSCoach 6 місяців тому
Programming language? Or the text is outputted to a different language? Either way I'm pretty sure the answer would be yes :)
@WolFX_FPS Рік тому ⁺¹
I have an error with the speech_recognition library:
r.adjust_for_ambient_noise(source2, duration=0.2)
TypeError: Recognizer.adjust_for_ambient_noise() missing 1 required positional argument: 'source'
Any ideas?
@arvindh13 5 місяців тому
You need to provie the source before that, please check the program again and if it does not work, install Speech recognition fork library.
@gleful262 3 місяці тому
got this error, fixed it by adding() to Recognizer
@user-ve8pd3lg9t 4 місяці тому
what app he used in this video??
@stylloofdreams975 4 місяці тому ⁺¹
Visual Studio Code
@hebataha5197 3 місяці тому
Does this work offline ?
@brianmckeown70 14 днів тому
^
@odogbolahan8148 3 місяці тому
my name is oscar too haha
@soham0726 4 місяці тому
source code

Наступне

Автоматичне відтворення

How I Would Learn Python FAST in 2024 (if I could start over)