logoalt Hacker News

packetlostyesterday at 8:47 PM2 repliesview on HN

Nah, I model hop constantly as I work with serving GLM and Kimi models and they're not nearly as good as Opus 4.5+ and GPT 5.2+ and it's not particularly close. They're good by standards set a generation or two ago, but they're really not competitive with where the frontier models are at now.


Replies

zozbot234yesterday at 8:55 PM

They compete with "mini" or "nano" model classes quite well given the price of inference. You'd need to "model hop" anyway, using Opus for everything is quite wasteful.

show 1 reply
marioptyesterday at 8:56 PM

Guess it really depends on what you use them for. I've been able to built whole apps with them, not slop. Kimi is quite good at design, for 3D, I noticed Gemini 3.1 is excellent for basic to medium use cases.

I've tried both Opus and GPT 5.4, they also hallucinate just like the rest at a much higher cost.

The more you use a model overtime, the better you become with it. It's really hard to measure, my main metric lately has been tokens per second/time to complete task.

At this point I've the feeling frontier models are optimizing for benchmarks and one shot prompts.