I wonder if you could use the same technique (RAM models as ROM) for something like Whisper Speech-t...

briansm • yesterday at 10:20 AM • 1 reply • view on HN

I wonder if you could use the same technique (RAM models as ROM) for something like Whisper Speech-to-text, where the models are much smaller (around a Gigabyte) for a super-efficient single-chip speech recognition solution with tons of context knowledge.

Replies

JLO64 • yesterday at 4:22 PM

Right now I have to wait 10 minutes at a time for the 2+ hour long transcriptions I've uploaded to Voxstral to process. The speed up here could be immense and worthwhile to so many customers of these products.

alt Hacker News

Replies