Whisper transcription. #2567

TheStalke · 2025-03-31T12:51:12Z

TheStalke
Mar 31, 2025

Hello, Im trying to transcribe a video, for subtitles using Whisper by OpenAi, but at the start of the video theres a song and my .srt file starts from "00:00:00,00" , when theres no one talking like it goes from 0 to 13 but people only start talking from 8 to 13, is there a way to make the subtitles start at the same time as they start talking in the video automatically?

jonathgh · 2025-03-31T14:11:25Z

jonathgh
Mar 31, 2025

Hi @TheStalke, we managed to do this with post-processing, because the timestamps that Whisper produces aren't that accurate. We arrived at a solution that works pretty well- we use VAD (voice activity detection) to check when a speaker is speaking and trim the segments automatically to match:

Before:

After:

Here's the link if you want to try it out: Automatic Segment Trimming

1 reply

TheStalke Mar 31, 2025
Author

Thank you 🌹I will try it out

misutoneko · 2025-03-31T14:40:38Z

misutoneko
Mar 31, 2025

Yes, VAD usually helps. Other than that, you could try with --word_timestamps True.
There are other parameters that can be changed (logprob, compression_ratio, etc.) but their usage isn't super straightforward.

2 replies

TheStalke Mar 31, 2025
Author

Thank you 🌹I will try it out

TheStalke Apr 1, 2025
Author

It works thank you 🌹

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Whisper transcription. #2567

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Whisper transcription. #2567

Uh oh!

TheStalke Mar 31, 2025

Replies: 2 comments · 3 replies

Uh oh!

jonathgh Mar 31, 2025

Uh oh!

TheStalke Mar 31, 2025 Author

Uh oh!

misutoneko Mar 31, 2025

Uh oh!

TheStalke Mar 31, 2025 Author

Uh oh!

TheStalke Apr 1, 2025 Author

TheStalke
Mar 31, 2025

Replies: 2 comments 3 replies

jonathgh
Mar 31, 2025

TheStalke Mar 31, 2025
Author

misutoneko
Mar 31, 2025

TheStalke Mar 31, 2025
Author

TheStalke Apr 1, 2025
Author