Speech To Text using ESP32

Поділитися
Вставка
  • Опубліковано 4 жов 2024
  • This video will guide you with how to convert any speech to text which you can further use for any of your projects.
    This step is very useful for our upcoming project which is making our own voice assistant based on ChatGPT. Stay Tuned.....
    To try out free trial version of Altium, click here - www.altium.com...
    Get the best quality components from Octopart:
    octopart.com/
    ⚡️⚡️ Order all components required for making this project using this single link - techiesms.com/...
    You can support me🙏🏻 and also get benefitted on Patreon :- / techiesms
    __________________________________________________________
    👨🏻‍💻 Important Links 👨🏻‍💻
    ESP32 ChatGPT Series Playlist - • Running ChatGPT on ESP...
    Google Cloud URL - cloud.google.c...
    Code used in this video - github.com/tec...
    __________________________________________________________
    Buy the components for your projects and support techiesms💡🔌
    ⚡️⚡️ Order all components required for making this project using this single link - techiesms.com/...
    Or if you want to order only specific components then here is the list and link,
    ESP32 Dev board - www.techiesms....
    INMP441 Microphone -www.techiesms....
    and many more such components are available at www.techiesms.com
    __________________________________________________________
    ⏱ Time Stamps ⏱
    __________________________________________________________
    🎥 🎙 My Video Gear 🎙🎥
    Camera - Canon 750d
    Lens - 50mm f/1.8 STM Lens (amzn.to/32gqnsi), 24mm f/2.6
    Key Light - Godox SL60W (amzn.to/3522mHe)
    Fill Light/Back Light - Osaka Lightning Setup (amzn.to/2I5cdDi)
    Tripod - amzn.to/3uNhC4H
    Microphone - Zoom H1N(amzn.to/3Osh12A)
    Editing - Final Cut Pro X on MacBook Pro ( amzn.to/3vHcSyJ )
    __________________________________________________________
    Get connected with techiesms everywhere..
    Subscribe Main Channel :- bit.ly/techiesm...
    Subscribe Shorts Channel :- bit.ly/techies...
    Facebook :- / techiesms
    Twitter :- / imtechiesms
    Instagram :- / techiesms
    Website :- www.techiesms.com
    Telegram - @techiesms
    __________________________________________________________
    techiesms
    explore | learn | share

КОМЕНТАРІ • 165

  • @gauravdashora5757
    @gauravdashora5757 Рік тому +16

    In order to transcribe long form of Audio more than 1 minute. We need to use Speech-to-Text API and asynchronous speech recognition. In order to use that we need to enable the Google Cloud Storage Bucket. In this storage bucket, we need to store the long form of Audio digitally in the bucket and process those digital audio in the same way what you are doing for 2-3 second Audio . Excited to see the final outcome.

  • @Michael-lo3ht
    @Michael-lo3ht Місяць тому +2

    I have to slow video down to 75%. I didn't have enough caffeine to keep up. Thanks for these great videos.

  • @grahamehorner174
    @grahamehorner174 Рік тому +7

    At the start you say this is a stand alone? It's not it is dependent on the cloud service 😢

  • @MichaelWeeks-ox5ep
    @MichaelWeeks-ox5ep 2 місяці тому

    I really enjoy your videos in the way you present various information, thank you. I am a faithful follower

  • @bassjunias439
    @bassjunias439 Рік тому +1

    I'm asking why the newest version of Esp32 library shouldn't work

  • @shreyasbr5955
    @shreyasbr5955 10 місяців тому +3

    I have issue that is in output where in json data i get only totalbilltime and request id but i won't get the words what i spoke
    Please help me with my issue

    • @OmarMashal1
      @OmarMashal1 10 місяців тому

      did you solve this problem? i have the same issue

    • @karthikn1126
      @karthikn1126 7 місяців тому

      Did you fix it

    • @DropThatGOAL
      @DropThatGOAL 3 місяці тому

      @@karthikn1126 Same here, What should we do?

    • @manfredmichael_3ia097
      @manfredmichael_3ia097 2 місяці тому

      Me too? Any solution?

    • @DropThatGOAL
      @DropThatGOAL 2 місяці тому

      @@manfredmichael_3ia097 Yes, Change the L/R Cable and connect it to the VDD instead :)

  • @fahimi.saiyed8939
    @fahimi.saiyed8939 Рік тому +1

    Hi Sir, Can i use 1 NodeMCU for microphone, and Other ESP32 Wrooms board for Chatgpt etc?

  • @snehachandrashekhar3746
    @snehachandrashekhar3746 7 місяців тому +1

    what if in Arduino IDE ESP32 latest version didn't work , there is no old versions of esp32 available to install

  • @reevestephen9889
    @reevestephen9889 Рік тому +1

    Does the google cloud cut money from later on??

  • @shijilt
    @shijilt Рік тому

    off topic, I want to control a 12V DC LED using ESP 32 and Google Assistant, what are my options ?

  • @0Itsuke
    @0Itsuke 10 місяців тому

    why do I keep getting PK - The pubkey tag or value is invalid (only RSA and EC are supported) while trying to connect ?

  • @atiqa5908
    @atiqa5908 11 місяців тому +1

    Hi, may I know if it is possible to send the captured voices/sentences to any IoT Platform?

  • @icecream-so7cd
    @icecream-so7cd 3 місяці тому

    It shows the error nested too deeply is there any solution for it ?

  • @Monsieur.Nobody.
    @Monsieur.Nobody. 5 місяців тому

    G'day I am wondering if it can do Speech to Text, and how small can it really get? I saw the XIAO Esp32 being tiny.
    I have a project that requires wireless, real-time Speech to Text capabilities, and 4 programmable buttons on it... in the smallest form factor possible... Max one index finger size.
    Do you reckon it can be done with Esp32's or am I better off looking somewhere else?
    what would you suggest?

  • @robrita
    @robrita Рік тому

    HOW ABOUT STT VIA WEB PAGE. SO YOU CONVERT SPEECH FROM A WEBPAGE HOSTED IN ESP32.
    GOOGLE API CAN BE DONE ON WEBPAGE AS WELL SO THAT SPEECH TIME IS NOT LIMITED

  • @Dazza_Doo
    @Dazza_Doo Рік тому

    Oh this is Crazy, Speech to Text is really hard, especially if you make it Stand alone (aka no internet). Altium has amazing Free Videos, taught me how Electricity really flows, thanks Rick Hartley.
    I'm glad you have a sponsor, sorry can't fork out 1/5 of my annual income for 1 the software

  • @uayshady
    @uayshady Рік тому +1

    hi, is there a way not to press the RS button to restart the voice? I want to be like "Ok, google" and the ESP32 will listen and convert it to text

  • @ebrahemkhalifa3675
    @ebrahemkhalifa3675 7 місяців тому

    Hey Sachin, how can i increase the time of record into 5 second?

  • @gams628
    @gams628 4 місяці тому +1

    the connection failed, how can I fix it ?

  • @apurbakhanra2464
    @apurbakhanra2464 Рік тому

    ❤the king of home automation love you

  • @rudawabdulrahman1010
    @rudawabdulrahman1010 Рік тому +1

    I’m waiting for it, thank you

  • @fitraanantioramadhani8695
    @fitraanantioramadhani8695 11 місяців тому +1

    hello sachin! I have problem about JSON API, my code don't answer this, do you have solved?

    • @techiesms
      @techiesms  11 місяців тому

      Please let me know your problem with a bit detail

    • @fitraanantioramadhani8695
      @fitraanantioramadhani8695 11 місяців тому +1

      @@techiesms My problem is that the Speech to Text JSON file is not read or the Speech to Text output does not work. So when I tried to speak, there was no answer. What do you think?

    • @oantuannhat6816
      @oantuannhat6816 11 місяців тому

      i have a same problem? Do you fix it? tks@@fitraanantioramadhani8695

    • @statusguru3412
      @statusguru3412 11 місяців тому +1

      ​@@fitraanantioramadhani8695 same problem what should I do ?

    • @fitraanantioramadhani8695
      @fitraanantioramadhani8695 10 місяців тому

      ​@@statusguru3412i can't solve too

  • @shlokrp
    @shlokrp Рік тому +1

    Excited for the next video!!

  • @waynesouth7129
    @waynesouth7129 Рік тому

    Can you make a server using lora so that chatgpt ( current project )can connect to the internet so we can use voice to text / chatgbt text to voice

  • @brijrajkacha9839
    @brijrajkacha9839 7 місяців тому

    In file included from c:\Users\brkac\OneDrive\Documents\Arduino\libraries\ESP32-audioI2S-master\src\Audio.cpp:15:0:
    c:\Users\brkac\OneDrive\Documents\Arduino\libraries\ESP32-audioI2S-master\src\opus_decoder/opus_decoder.h:7:20: fatal error: memory.h: No such file or directory
    compilation terminated.
    exit status 1

  • @yashashwiwahie4014
    @yashashwiwahie4014 5 місяців тому

    Uploaded the code successfully but on running it shows the following error:
    E (533) I2S: i2s_check_set_mclk(253): ESP32 only support to set GPIO0/GPIO1/GPIO3 as mclk signal, error GPIO number:1073423152
    E (534) I2S: i2s_set_pin(314): mclk config failed.
    please help

  • @nakuffy8848
    @nakuffy8848 4 місяці тому

    when i run this code it says:C:\Users\home\Documents\Arduino\sketch_stem\Speech_To_Text_ESP32.ino:2:31: fatal error: CloudSpeechClient.h: No such file or directory
    compilation terminated.
    exit status 1
    Compilation error: CloudSpeechClient.h: No such file or directory
    what should i do to fix this

  • @patrickrios99
    @patrickrios99 10 місяців тому

    Hello, I have problems installing the librery, help!

  • @NURSYAHIRUNNIZABINNURDZAINUDDI
    @NURSYAHIRUNNIZABINNURDZAINUDDI 10 місяців тому

    do have the video ESP 32 capture photo able to turn on led and have save data on adafruit

  • @nehalgupta1616
    @nehalgupta1616 Місяць тому

    bro it is not working because the libaries are updatd and i can t put

    • @techiesms
      @techiesms  Місяць тому

      Just wait for tomorrow’s video
      It will be a great update for STT

  • @coroy6377
    @coroy6377 8 місяців тому

    How can i extract the text and put it in the firebase?

  • @lyricaladda8522
    @lyricaladda8522 10 місяців тому

    Where is the description bro 😑

  • @reevestephen9889
    @reevestephen9889 Рік тому

    Will google cloud cut money from the bank account later??

  • @NURSYAHIRUNNIZABINNURDZAINUDDI
    @NURSYAHIRUNNIZABINNURDZAINUDDI 10 місяців тому

    hye i am going to make the project that using ESP 32 that connect to adafruit ...

  • @imnaveen.r.s5893
    @imnaveen.r.s5893 5 місяців тому

    Hi guys I have a doubt, what if the 90 days google cloud services gets over and the amount will be auto debited ? Or we just provide the card details to get the $300 ?

    • @manfredmichael_3ia097
      @manfredmichael_3ia097 2 місяці тому

      Dont worry its 300 free credit, and you wont be charged after 90 days, it just stops. Me and my friends have tried it

  • @sadamhussain816
    @sadamhussain816 Рік тому

    Is Speech to text charge while free trial??

  • @saydiy1528
    @saydiy1528 Рік тому

    good job. did you try the offline module?

  • @lyricsgo6873
    @lyricsgo6873 8 місяців тому

    I am a student trying to learn these kind of stuff. these are a lot of codes, should i upload the code one by one or at the same time in one code?

    • @sparshsinghg
      @sparshsinghg 8 місяців тому +1

      You just need to upload .ino code, the rest of the code will automatically be included as they are separate header files included in main .ino code. You just need to make sure all of these codes are in same directory.

  • @nikhilxkashyap
    @nikhilxkashyap Рік тому

    Bro, This project is not working I have tried 3-4 times but not able to convert speech to text

  • @limweekian4146
    @limweekian4146 Місяць тому

    can i have it as an audio output to my earpiece ?

    • @AyomidePopoola-kr7qz
      @AyomidePopoola-kr7qz 13 днів тому

      Maybe if you configure the code and hardware with a module that has an audio jack on it.

  • @nitishjoshi3
    @nitishjoshi3 4 місяці тому

    I am not able to get the transcribe using the code deployed on the board , after recording it is giving mein Json without transcribed data into it , Please help for the same if anyone faced the similar issue

    • @DropThatGOAL
      @DropThatGOAL 3 місяці тому

      I have same problem, did u fix it?

    • @nitishjoshi3
      @nitishjoshi3 3 місяці тому

      @@DropThatGOAL no its buggy , there is some problem with the library. I build the code on python and deployed it on raspberry pi

  • @betty_like_butarrr6247
    @betty_like_butarrr6247 11 місяців тому

    In my output i am not getting any results promt only totalbilled time. Can u help me??

    • @OmarMashal1
      @OmarMashal1 10 місяців тому +1

      did you solve this problem? i have the same issue

  • @ValirAmaril
    @ValirAmaril 9 місяців тому

    esp32 can run chatgpt? thats amazing!

  • @exploreworld366
    @exploreworld366 Рік тому

    We can use vosk api

  • @arijitroy9512
    @arijitroy9512 Рік тому +3

    Hi brother 👋 how r U ?

    • @techiesms
      @techiesms  Рік тому +1

      All good
      Hope you enjoyed the video…

  • @reevestephen9889
    @reevestephen9889 Рік тому

    I can’t downgrade the Esp32 to 1.0.6

  • @tinemaris6343
    @tinemaris6343 5 місяців тому

    The program can't do anything with the json data

  • @brijrajkacha9839
    @brijrajkacha9839 7 місяців тому

    [HTTPS] GET... failed, error: connection refused
    how to solve this? ( I have tested chatgpt api in postman, it is working there but not in code)

    • @raghujalnila120
      @raghujalnila120 6 місяців тому

      try with some other model of the open ai as text-davinci-003 is no more supported by open ai

    • @karthikn1126
      @karthikn1126 4 місяці тому

      Did you solve it bro

  • @adityanathpandey4060
    @adityanathpandey4060 Рік тому +3

    Please integrate it with chat gpt

    • @Vishal-pm3vv
      @Vishal-pm3vv Рік тому

      I can do that but how much you are ready to pay for it?

    • @techiesms
      @techiesms  Рік тому +1

      Coming soon on our channel….

  • @Sysshad
    @Sysshad 6 місяців тому

    Is it possible to get voice recog on esp32 without being connected to online services?

    • @josephgeorge3204
      @josephgeorge3204 2 дні тому

      No :)

    • @Sysshad
      @Sysshad 2 дні тому

      ​@@josephgeorge3204elechouse have done it with a simple IC, Google it. So why shouldn't it be possible on the more advanced ESP32? in the 1990s people did it with basic stamps with some extra IC..

  • @rudawabdulrahman1010
    @rudawabdulrahman1010 Рік тому +2

    Speech to text- send to AI- the Text to speech

  • @firdoshdiakus4841
    @firdoshdiakus4841 Рік тому

    can't wait awesome project

  • @d.trubre5216
    @d.trubre5216 3 місяці тому

    Ty for the video

  • @drelectronics13
    @drelectronics13 Рік тому

    Btw why don't u use new ide

  • @adityanathpandey4060
    @adityanathpandey4060 Рік тому +2

    Wow 😳

  • @1337sh337
    @1337sh337 8 місяців тому

    Why not use OpenAI whisper?

    • @techiesms
      @techiesms  8 місяців тому

      Can you share some links

  • @SkyExperiment.
    @SkyExperiment. 9 місяців тому

    can you make this video in hindi , i need this information ,

  • @architlal
    @architlal Рік тому

    Hey Sachin, I tried this out but it is only giving me the request id and total billed time as output. Their is no "result" output that I can see in the json data. Please help me out. Thank you.

    • @bakershub6171
      @bakershub6171 Рік тому

      I'm having the same problem too, please could you help @Sachin

    • @Nocturnal3D
      @Nocturnal3D Рік тому +3

      Try wiring the jumper wire from l/r to VDD rather than GND. I think the wiring schematic in this video is incorrect. In a later project the schematic is wired differently. I swapped the l/r to GND(black-) around to l/r to VDD(red+) to match the later project (The virtual assistant I think) and it worked fine after that.

    • @sadamhussain816
      @sadamhussain816 Рік тому

      Solved???

    • @sadamhussain816
      @sadamhussain816 Рік тому

      Please tell me

    • @bakershub6171
      @bakershub6171 Рік тому +3

      @@Nocturnal3D Thank you very much it worked, I really appreciate it 🙏

  • @idharalfarib5674
    @idharalfarib5674 Рік тому

    Good job brader.

  • @nagarava
    @nagarava Рік тому

    How to use a condenser mic

    • @techiesms
      @techiesms  Рік тому +1

      You need to use MEMS microphone only like this - www.techiesms.com/product/inmp441-microphone-module-i2s/

  • @peemhyyr_1386
    @peemhyyr_1386 9 місяців тому

    Hi Json dont answer the words I speak, How can I fix this ?

    • @peemhyyr_1386
      @peemhyyr_1386 9 місяців тому

      Okie i fix with wire jumper L/R to VDD and it works thx Nocturnal3d's comment

    • @karthikn1126
      @karthikn1126 7 місяців тому

      Can u please explain

    • @peemhyyr_1386
      @peemhyyr_1386 7 місяців тому

      i mean I try to write response and keyword on json.file and it doesnt show the words i speak @@karthikn1126

    • @peemhyyr_1386
      @peemhyyr_1386 7 місяців тому

      so I trried to wire jumper L/R to vdd to ground and it works so fine@@karthikn1126

  • @abhisvibetech6121
    @abhisvibetech6121 Рік тому

    They were asking 1000rs not 2rs 😢

  • @savanladva1197
    @savanladva1197 Рік тому

    please make videos using esp ide

  • @mahsanghufran
    @mahsanghufran Рік тому

    Amazing hack 👏 ❤

  • @TheTurmanDreams
    @TheTurmanDreams 10 місяців тому

    great !!!!

  • @9e92iqiwdiw89xls
    @9e92iqiwdiw89xls 3 місяці тому +2

    Connection failed😢

    • @ajuxx
      @ajuxx 2 місяці тому +1

      Same

    • @Hybroid
      @Hybroid 2 місяці тому

      Me too, is google the problem but i dont knlw how to solve it

    • @StevFoz
      @StevFoz 2 місяці тому

      its solved or no?

    • @Hybroid
      @Hybroid 2 місяці тому

      @@StevFoz no

    • @josephgeorge3204
      @josephgeorge3204 2 дні тому

      did u rectify this ?

  • @firedot69
    @firedot69 Рік тому

    Love you bro ❤

  • @drelectronics13
    @drelectronics13 Рік тому

    Acchi video hai

  • @ramavtarram818
    @ramavtarram818 Рік тому

    Is Prem Vivah mein Chandigarh mein sab koi aaega apna Ghar devasi ka idhar hai apna Bharat Desh ka sab rahega

  • @drelectronics13
    @drelectronics13 Рік тому

    Hello Sachin

    • @techiesms
      @techiesms  Рік тому

      Hello buddy
      Do share the video if you like it ✌️

  • @drelectronics13
    @drelectronics13 Рік тому

    Hi sachin

    • @techiesms
      @techiesms  Рік тому

      Hi DrElectronics
      Hope you enjoyed the video

  • @drelectronics13
    @drelectronics13 Рік тому

    Kese ho sachin

    • @techiesms
      @techiesms  Рік тому

      Me badhia Hu
      Video share jarur krna

  • @SkyExperiment.
    @SkyExperiment. 9 місяців тому

    please

  • @SeattleShelby
    @SeattleShelby 4 місяці тому

    I didn’t understand a single word you said even though I think it was supposed to be English.

  • @MantuSharma-l8j
    @MantuSharma-l8j 8 місяців тому +1

    my output.......speech to text not show.....please help
    Record start!
    Recording Completed. Now Processing...
    My Answer - HTTP/1.1 200 OK
    Content-Type: application/json; charset=UTF-8
    Vary: X-Origin
    Vary: Referer
    Date: Tue, 30 Jan 2024 18:30:24 GMT
    Server: ESF
    Cache-Control: private
    X-XSS-Protection: 0
    X-Frame-Options: SAMEORIGIN
    X-Content-Type-Options: nosniff
    Alt-Svc: h3=":443"; ma=2592000,h3-29=":443"; ma=2592000
    Accept-Ranges: none
    Vary: Origin,Accept-Encoding
    Transfer-Encoding: chunked
    44
    {
    "totalBilledTime": "3s",
    "requestId": "8554249765596014235"
    }
    0
    394
    Json daata--{
    "totalBilledTime": "3s",
    "requestId": "8554249765596014235"
    }
    0

    • @karthikn1126
      @karthikn1126 7 місяців тому

      Bro did you solve it?

    • @karthikn1126
      @karthikn1126 7 місяців тому

      Did solve it

    • @ebrahemkhalifa3675
      @ebrahemkhalifa3675 7 місяців тому

      @@karthikn1126
      I solved it
      your wiring may be the problem

    • @ebrahemkhalifa3675
      @ebrahemkhalifa3675 7 місяців тому

      @@karthikn1126
      try to connect L\R pin to vcc instead of Gnd

    • @MinhNguyen-fw5dv
      @MinhNguyen-fw5dv 7 місяців тому

      @@ebrahemkhalifa3675 what's wrong with you😑

  • @yjsanty4461
    @yjsanty4461 Рік тому

    bro i am at the edge of my project and need some assistance , i am receiving an error
    error: macro "pgm_read_byte" passed 4 arguments, but takes just 1
    return pgm_read_byte(I2S_NUM_0, (char *)data, numData, portMAX_DELAY);
    /tmp/3378378928/Speech_To_Text_ESP32/I2S.cpp: In member function 'int I2S::Read(char*, int)':
    /tmp/3378378928/Speech_To_Text_ESP32/I2S.cpp:58:10: error: 'pgm_read_byte' was not declared in this scope
    return pgm_read_byte(I2S_NUM_0, (char *)data, numData, portMAX_DELAY);

  • @purpleboba8
    @purpleboba8 Рік тому

    Hey! This is regarding sim800l with esp8266. My nodemcu board resets whenever I send message via sms using the gsm module. Do you know a reason for this?🥲