logoalt Hacker News

mariano54last Thursday at 6:20 PM1 replyview on HN

Yes we do have this issue, but it's improved a bit over chatgpt due to using multiple transcribers.

The models are improving though, and they are at a very good place for English at the moment. I expect by next year we will switch over to full voice to voice models.


Replies

harleslast Thursday at 6:28 PM

This reply seems to miss the question, or at least doesn’t answer it clearly. Is this service overly tolerant of mispronunciations? Foundational models are becoming more tolerant, not less, over time which is the opposite of what I’d want in this case.

show 1 reply