logoalt Hacker News

d4rkp4tternyesterday at 1:07 PM1 replyview on HN

I use the open source Handy [1] app with Parakeet V3 for STT when talking to coding agents and I’ve yet to see anything that beats this setup in terms of speed/accuracy. I get near instant transcription, and the slight accuracy drop is immaterial when talking to AIs that can “read between the lines”.

I tried incorporating this Voxtral C implementation into Handy but got very slow transcriptions on my M1 Max MacBook 64GB.

[1] https://github.com/cjpais/Handy

I’ll have to try the other implementations mentioned here.


Replies

thethimbleyesterday at 7:41 PM

Handy is great but I wish the STT was realtime instead of batch