Based on my first impressions it's about 6 months behind the frontier labs. So very similar to ...

LaurensBER • yesterday at 9:47 PM • 6 replies • view on HN

Based on my first impressions it's about 6 months behind the frontier labs. So very similar to Opus in January.

That is, pretty damn impressive and very useable. When it comes to architecture or complex problems it does noticeable worse but I don't think anyone expected anything else.

One particular interesting strong point seems to be design and user interfaces. It does seem to punch above it's weight there but that might just be personal preference.

Replies

pastel8739 • today at 5:08 AM

Opus in January was right about when AI became actually useful for coding for me. So if that’s the case, that is absolutely great.

jstummbillig • today at 1:29 PM

> When it comes to architecture or complex problems it does noticeable worse but I don't think anyone expected anything else.

So it's not really similar to opus in January?

byw • today at 3:16 AM

> Opus in January

So pre-nerf Opus?

➕ show 1 reply

becomevocal • today at 2:39 AM

Appreciate the quick take! Sounds like a keeper to me. I think the Opus and Fable design (that I saw for a short while) have gotten stale

➕ show 1 reply

Lord-Jobo • today at 2:14 AM

It’s insanely impressive and I’m so glad that the space has actual competition

ignoramous • today at 7:12 AM

> Based on my first impressions it's about 6 months behind the frontier labs. So very similar to Opus in January.

According to this one benchmark, I find it amusing that Qwen3.6 27B beats ALL "frontier lab" models on coding Kotlin: https://archive.vn/RYBCL / https://gertlabs.com/rankings?mode=agentic_coding&language=k...

➕ show 1 reply

alt Hacker News

Replies