100% agree.
My impression is that the open-weight models have been drawing close-to-level at coding tasks, while Anthropic and OpenAI have been putting large amounts of effort into developing their models' abilities in other domains: legal, biomedical/science, etc. Anthropic (especially?) has also been putting more obvious resource behind optimising their harnesses - from Code to Cowork (which is kinda Code for normies), Design, etc.
GLM 5.2 has replaced "normie" agentic workflows previously backed by Sonnet and Opus. So I don't know. From my end it seems to me they are perfectly capable of working agenticly.