logoalt Hacker News

ricardobayeslast Monday at 8:14 PM0 repliesview on HN

I'd say give it some time for the dust to settle. This field badly needs standardized benchmarks even before the conversation around model goodness can start.