logoalt Hacker News

antirezlast Wednesday at 6:16 PM2 repliesview on HN

Different representations at the same bitrate may have features that make one a lot more resilient to errors. This thing about Italian, you fill find in any benchmark of vastly different AI transcribing models. You can find similar results also on the way LLMs mostly trained on English generalize usually very well with Italian. All this despite Italian accounting for marginal percentage of the training set. How do you explain that? I always cringe when people refute evidence.


Replies

hollowturtlelast Wednesday at 11:04 PM

> All this despite Italian accounting for marginal percentage of the training set.

Evidence?

testdelacc1last Wednesday at 6:42 PM

Where is this evidence you’ve cited for your claims?