logoalt Hacker News

khalictoday at 12:40 PM0 repliesview on HN

I don't know, I'm implementing a translation system right now, and Apertus is very good for the model size. I wished they added some chain of thought training to increase precision and context understanding.