logoalt Hacker News

GaggiXtoday at 5:43 PM1 replyview on HN

There is also: https://github.com/linto-ai/whisper-timestamped

It doesn't use an extra model (so it supports every language that works with Whisper out of the box and use less memory), it works by applying Dynamic Time Warping to cross-attention weights.


Replies

oezitoday at 7:35 PM

Just a warning that plain WhisperX is more accurate and Whisper-timestamped has many weird quirks.