logoalt Hacker News

smy20011yesterday at 5:26 PM0 repliesview on HN

It interesting to see that the eval set becoming more and more expensive. Previously we just need to evaluate one test set, right now we need to create a lot of diffs and run a lot of tests.