Is there anything truly low latency(sub 100ms)? Speech recognition is so cool but I want it to be low latency.
On macbook pro - parakeet.cpp is very low latency, under 100ms (76ms) for 60s audio.
Parakeet does streaming I think, so if you throw enough compute at it, it should be. The closest competitor is whisper v3 which is relatively slow, maybe Voxtral but it's still very new.
Agree about the latency requirement.
There's https://kyutai.org/stt, which is very low latency. But it seems not as hackable.