logoalt Hacker News

Ifkaluvatoday at 12:49 AM0 repliesview on HN

You’d think so, but I haven’t seen it explicitly discussed in their papers, and nobody else that I know of trains on that many tokens