logoalt Hacker News

doctobogganlast Thursday at 6:24 PM3 repliesview on HN

This seems like another "better vibes" release. With the number of benchmarks exploding, random luck means you can almost always find a couple showing what you want to show. I didn't see much concrete evidence this was noticeably better than 5.1 (or even 5.0).

Being a point release though I guess that's fair. I suspect there is also some decent optimizations on the backend that make it cheaper and faster for OpenAI to run, and those are the real reasons they want us to use it.


Replies

sebzim4500last Thursday at 10:38 PM

>I suspect there is also some decent optimizations on the backend that make it cheaper and faster for OpenAI to run, and those are the real reasons they want us to use it.

I doubt it, given it is more expensive than the old model.

rat9988last Thursday at 6:31 PM

> I didn't see much concrete evidence this was noticeably better than 5.1

Did you test it?

show 1 reply
BrtBytelast Friday at 3:50 PM

At this point the benchmark soup is so dense that it's hard to tell signal from selective framing