Phonix: Generate captions with the power of OpenAI's Whisper API

Поділитися
Вставка
  • Опубліковано 7 тра 2023
  • Phonix is a Python program that uses OpenAI's API to generate captions for videos.
    It uses the Whisper model, an automatic speech recognition system that can turn audio into text and potentially translate it too. Compared to other solutions, it has the advantage that its transcription can be "enhanced" by the user providing prompts that indicate the "domain" of the video. This means you may get better results if you use technical terms, acronyms and jargon.
    GitHub repository: github.com/platisd/phonix
  • Наука та технологія

КОМЕНТАРІ • 1