logoalt Hacker News

flumes_whims_today at 1:07 PM0 repliesview on HN

The overhead shrinks with larger models. It doesn't seem that bad.

https://arxiv.org/pdf/2409.03992v2