Create a Transcription Service with Make.com and Open AI's Whisper API

Поділитися
Вставка
  • Опубліковано 25 лис 2024

КОМЕНТАРІ • 28

  • @automate_all_the_things
    @automate_all_the_things  10 місяців тому +2

    If you found my video helpful, maybe also check out this one by Chais Meyer. In his automation, he creates a mini blog post and a Linkedin post from his transcription: ua-cam.com/video/eRJSXiXv8oE/v-deo.html

  • @unlimited_leads
    @unlimited_leads 11 місяців тому +3

    I was running against a wall because I didn't download the file in the 2nd step. Appreciate your content - works like a charm now.

    • @automate_all_the_things
      @automate_all_the_things  11 місяців тому +2

      Yes that took me some time to figure out as well 😃Glad you found this useful!

  • @skin.pharma
    @skin.pharma 10 місяців тому +1

    Thanks for the video, Philipp 😊👍
    Based on your more recent video about string splitting, if you use 0CodeKit, you could produce a transcription service more cost effectively by using asynchronous python coding.
    OpenAI developed Whisper as open source software, so it can be used from a python library within your own code & without charge.
    It is also possible to split audio files larger than 25MB in to smaller pieces & concatenate the individual transcriptions 👍
    Viv

    • @automate_all_the_things
      @automate_all_the_things  10 місяців тому +1

      Thanks for this great comment, Viv! It's certainly more cost effective to run the Whisper transcription locally. And a python script to do this wouldn't be terribly complicated. With my channel, I am trying to show ways to do this with no code at all, to make it more accessible to more people. I'm constantly impressed what is possible with tools like Make for non-programmers.

  • @saulosoares5636
    @saulosoares5636 9 місяців тому +2

    Great video!!! Thanks a lot!

  • @ricardotaipe
    @ricardotaipe 6 місяців тому

    You are amazing. Great tut, perfectly explained!

  • @In2DeepDaDeejay
    @In2DeepDaDeejay 5 місяців тому

    Hey Philipp! Love your content. Can you make a similar flow using Whisper, taking the transcript info and create headings for the main points and create variables to be captured into a Google Sheet. That would be great!

  • @chrisder1814
    @chrisder1814 4 місяці тому

    hello could you help me understand what result I could get if I add the jungle scout api for Amazon sellers to my scenario

  • @JeffHarrison1960
    @JeffHarrison1960 9 місяців тому +1

    Phillip, thank you for your tutorials. I find them extremely helpful, as I just getting started with Make and Bubble. Curious if there is a rapid API that could be inserted before the audio file hits whisper to cut it into smaller (25mb ) and then as it comes out of whisper to reassemble so one only has one transcription and one summary per audio file?

    • @automate_all_the_things
      @automate_all_the_things  9 місяців тому +2

      Thank you, very glad to hear you found this helpful! If the 25MB filesize limit is a problem, you could use the "Cut Audio File" action from 1001fx to create smaller audio files. It's built into Make. If your source audio file is a giant .wav format, you could also try to convert it to a smaller .mp3 file with a CloudConvert "Convert a File" action.

    • @DrivenSpice
      @DrivenSpice 8 місяців тому

      @@automate_all_the_things That's super helpful! I had the same questions. I was trying to transcribe a zoom recording and I kept getting the error of file size. Grateful for the content you created. Would you consider showing the process with a third party that will cut the file easily in a make automation? Many of the tutorials are showing transcription with short files.

    • @DrivenSpice
      @DrivenSpice 8 місяців тому

      Also, will cutting the file effect the SRT file?

  • @chrisder1814
    @chrisder1814 3 місяці тому +1

    hello I had some ideas for automation can you tell me what you think of them

    • @automate_all_the_things
      @automate_all_the_things  3 місяці тому

      Sure, lets hear them :) You can also reach me via email: philipp@lazy-android.com

  • @AIEntusiast_
    @AIEntusiast_ 8 місяців тому +1

    very nice, but when running this again with a different file same as first sucessfull test, meaning same format on files it failes saying invalid format ? any suggestions

    • @automate_all_the_things
      @automate_all_the_things  8 місяців тому +1

      Hmm, maybe the second file is too big? Whisper can only handle files up to 25 MB in size.

    • @AIEntusiast_
      @AIEntusiast_ 8 місяців тому

      @@automate_all_the_things it`s smaller and same format, for some reason deleting the first test file and also delete google doc file it worked with a new file

  • @SebastianSchulz-f5n
    @SebastianSchulz-f5n 5 місяців тому

    Lieber Philipp - Vielen Dank für den Workflow. Im Video hast du selbst den Hinweis gegeben, dass die Begrenzung für transkribierbare Dateien bei 25 MB liegt. Häufig sind die Dateien jedoch größer. Zeig doch bitte einmal, wie die Einbindung weiterer Module in den Workflow aussieht, der auch größere Dateien zu einer brauchbaren Zusammenfassung bringt. Besten Dank im Voraus!

  • @jimcrow2000
    @jimcrow2000 9 місяців тому

    hey great video, would it work from a video added to the google drive, or would we need to have the audio extracted first? thanks

    • @automate_all_the_things
      @automate_all_the_things  9 місяців тому

      The Whisper API can only handle these file formats (mp3, mp4, mpeg, mpga, m4a, wav, and webm) so you'd have to convert a video file, or detach the audio. There is a service called Transloadit integrated with Make. They have a free tier and offer a "Converts a video to WebM" action. Maybe you could use that to convert your video to webm and then send that to Whisper.

  • @redantegabanan1078
    @redantegabanan1078 8 місяців тому

    Transcription does not have timestamp when using Whisper module, whats the workaround for this?

  • @MiniDolarB3
    @MiniDolarB3 11 місяців тому +1

    Make isn't good for large audio files > 25MB (Whisper limit) and long texts input. No chuncking option

    • @automate_all_the_things
      @automate_all_the_things  11 місяців тому +1

      Yes, the 25 MB file size limit of the Whisper API can become a problem. You could add a "Cut Audio" module from 1001fx into the scenario. Or convert a large audio into a smaller file format with a CloudConvert module. Is Zapier or some other option better at handling large audio files?

    • @MiniDolarB3
      @MiniDolarB3 11 місяців тому

      @@automate_all_the_things Large text is also an issue since the API can't process. Needs to chunk and combine.

    • @hownowLLC
      @hownowLLC 7 місяців тому

      @@automate_all_the_things Respell has a good template for splitting then recombining the audio files for that :)