logoalt Hacker News

b--ltoday at 12:31 AM1 replyview on HN

grok-4.1-fast is the the number 2 model on this benchmark.

~~If you've used this model in real life to do any sort of programming, and have seen its output, you would know that there is something VERY wrong with your benchmark.~~

Edit: Oh sorry, I looked at the questions, I see this is also for SQL specifically. Interesting. Maybe they tuned that grok model for SQL. Cool site. I bookmarked it.


Replies

nltoday at 12:59 AM

Yeah, multi-step SQL generation and debugging.

Some models surprised me and Grok Fast was one of them. It is consistently good at this task though!