Splitting up long chunks of automatically transcribed text

Поділитися
Вставка
  • Опубліковано 26 чер 2024
  • Stream from May 28, 2024 at / sphaerophoria
    00:00 Intro
    05:45 Find split points
    50:45 Show in UI
    01:16:00 Debugging

КОМЕНТАРІ • 12

  • @rygy409
    @rygy409 29 днів тому +11

    A QoL feature might be highlighting the words on the right panel if they are included in the current clips on the timeline.

  • @gastonmaffei
    @gastonmaffei 28 днів тому +1

    one thing I'd add is an LLM powered search bar to search for nuggets in natural language. So something like "show me where I talk about X", and it jumps to that timestamp

  • @RoryDavidWatts
    @RoryDavidWatts 29 днів тому

    This is looking great

  • @CrossbowBeta
    @CrossbowBeta 28 днів тому

    So much fun statistics can be collected with this too. How many words spoken during a stream. Word occurrence histogram. Average word complexity. Then group the words together to identify a set of tags for the video. This is so cool.

  • @jomojomo123
    @jomojomo123 27 днів тому

    "is it a bug or is it just a little bit of a surprise" 😂😂

  • @distortions
    @distortions 29 днів тому +4

    whats up whats up whats up

  • @CrossbowBeta
    @CrossbowBeta 28 днів тому +1

    Next step: Editing the text and the video gets updated.

  • @flobuilds
    @flobuilds 29 днів тому

    I miss the time where we played trackmania on the school computers. It was fun

  • @bart2019
    @bart2019 29 днів тому

    In your place, i'd try to use a text editing widget for the script text and use it in read-only mode. That may detecting which word a user clicked on easier because it's likely built in.
    Also, you need a way to search in the text, again, just like in a text editor.

  • @Dje4321
    @Dje4321 29 днів тому

    Should highlight the current position in the text window to help with debugging to you know exactly what words are at which specific parts in the timelines

  • @gvteja4908
    @gvteja4908 28 днів тому

    Along with the text highlighting which others have mentioned, maybe change the audio based on text edits in the transcript, although this might be a very intense thing to do

  • @deprecated_channel_
    @deprecated_channel_ 29 днів тому +1

    Hi,
    I really like your project and seeing you go through the dev process.
    I think it would be easier to track if the current spoken word would be in bold, or a different color. A bit like karaoke.
    And from what I understand, Whisper already gives you the ranges of each word.
    While playing the video, iterating through the list would be fast.
    And on seek, since the word & time range list is in a sorted order, doing a binary search should be pretty fast.