logoalt Hacker News

LaurensBERyesterday at 9:47 PM6 repliesview on HN

Based on my first impressions it's about 6 months behind the frontier labs. So very similar to Opus in January.

That is, pretty damn impressive and very useable. When it comes to architecture or complex problems it does noticeable worse but I don't think anyone expected anything else.

One particular interesting strong point seems to be design and user interfaces. It does seem to punch above it's weight there but that might just be personal preference.


Replies

pastel8739today at 5:08 AM

Opus in January was right about when AI became actually useful for coding for me. So if that’s the case, that is absolutely great.

jstummbilligtoday at 1:29 PM

> When it comes to architecture or complex problems it does noticeable worse but I don't think anyone expected anything else.

So it's not really similar to opus in January?

bywtoday at 3:16 AM

> Opus in January

So pre-nerf Opus?

show 1 reply
becomevocaltoday at 2:39 AM

Appreciate the quick take! Sounds like a keeper to me. I think the Opus and Fable design (that I saw for a short while) have gotten stale

show 1 reply
Lord-Jobotoday at 2:14 AM

It’s insanely impressive and I’m so glad that the space has actual competition

ignoramoustoday at 7:12 AM

> Based on my first impressions it's about 6 months behind the frontier labs. So very similar to Opus in January.

According to this one benchmark, I find it amusing that Qwen3.6 27B beats ALL "frontier lab" models on coding Kotlin: https://archive.vn/RYBCL / https://gertlabs.com/rankings?mode=agentic_coding&language=k...

show 1 reply