What training data? Many of these languages have very little digitized literature. Even if we assume...

AlotOfReading • yesterday at 4:58 PM • 1 reply • view on HN

What training data? Many of these languages have very little digitized literature. Even if we assume they have sizeable extant corpuses (e.g. Tibetic/Bhoti), that's not enough. LLMs are still pretty garbage at English prose, for example.

Replies

general_reveal • yesterday at 5:21 PM

!Remind me in 1 year (certainly less than 5).

alt Hacker News

Replies