Qwen3.7-Max: The Agent Frontier

587 points • by kevinsimper • today at 10:35 AM • 235 comments • view on HN

Comments

The non-hallucination rate in AA-omniscience is SOTA, better than Opus 4.7, Gemini 3.1 Pro and GPT5.5! Congrats to the team

➕ show 5 replies

briga • today at 3:23 PM

I was getting dangerously close to my weekly Claude Code limit last night so I had Claude set up Qwen3.6 with llama.cpp and OpenCode. Honestly it's a great (free!) alternative to Claude Code--certainly more than good enough for a lot of smaller less complex tasks. I'm excited to try this new version. The fact that open-source models are so close to the frontier is very impressive.

➕ show 10 replies

tekacs • today at 12:56 PM

As they start to release more proprietary models, I so wish that they partnered with one of the major US hyperscalers to allow using these models through something US-domiciled.

Totally understand why it may not be reasonable or in their best interest (and that the US is _absolutely_ not doing the same reflexively). But it would be lovely to be able to try these out on production workloads in earnest.

➕ show 7 replies

goyozi • today at 11:10 AM

These are very good numbers. I still don’t get why they don’t compare against latest competitor versions in these posts, it’s not like we’re all not going to notice.

➕ show 8 replies

maxdo • today at 8:20 PM

No opus 4.7 , gpt5.5 , Gemini flash 3.5 in benchmarks

tarruda • today at 12:24 PM

Looking forward to more open weight releases from Qwen, especially 122B and 397B.

➕ show 5 replies

flakiness • today at 3:57 PM

I'm using pi agent and love to try qwen models (hosted). What are the good options? The official provider doesn't include Alibaba. Is OpenRouter etc. fast enough?

(As a reference, DeepSeek v4 is severely throttled on these proxy services.)

➕ show 1 reply

ndom91 • today at 2:20 PM

Is this one of those ones where they'll drop the huggingface release a week later? Or do we know for sure that this is staying proprietary?

➕ show 1 reply

slicktux • today at 10:48 PM

I just started messing with local LLMs and honestly I’m pretty impressed. I have a workstation laptop with an NVIDIA A1000 (6GB VRAM) and 96GB of RAM. I rarely used my gpu. Occasional CAD design or Machine Learning with OpenCV.

I ran llama3:latest and it ran pretty fast! I’m curious to see how Qwen would run on my system.

eddyaipt • today at 2:10 PM

The pattern I trust most is adding a small verification artifact after every external action. Agents usually fail from silent state drift faster than from lack of reasoning depth.

➕ show 1 reply

jdw64 • today at 2:49 PM

QWEN really hits the sweet spot it's cheap, fast, and actually good.

eleventen • today at 7:42 PM

Checking openrouter (it's not available yet) and, uh, what's up with the spike in Qwen usage from early april here? https://openrouter.ai/qwen

Is this normal humans kicking the tires on a new model, or a few whales doing serious benchmarks?

➕ show 2 replies

bratao • today at 12:14 PM

It is super strange that all last (3?) releases they keep comparing older models such as Opus-4.6.

➕ show 3 replies

bsenftner • today at 12:29 PM

Any reports from people using their coding agent(s)?

➕ show 2 replies

XCSme • today at 12:59 PM

Any info on pricing and latency?

➕ show 1 reply

aliljet • today at 4:21 PM

Where can a user reasonably host this in an affordable way to access the local LLM revolution?

➕ show 3 replies

LAC-Tech • today at 10:17 PM

Trying to buy Qwen credits and get an API key is a challenge all in itself. So many site redirects.

hmaddipatla • today at 3:11 PM

The tokenomics and value for capability, context and latency look like they could deliver super competitive offer - what would it take for you to switch??

xiaoluolyg • today at 4:12 PM

congrats to qwen teams, remarkable

cft • today at 4:32 PM

Downloading this and cancelling Google Antigravity Pro at the same time:

I had a Google Pro account that I inherited from buying a Pixel 9 XL - it's free for a year after a flagship Pixel phone purchase. After a year they started charging for it, and i tolerated it, because Flash was usable in Antigravity for dumb auxiliary tasks that I did not want to waste GPT/Opus on. It had a separate generous quota from Gemini 3.1 Pro. Now with Flash 3.5 they combined the quotas with Pro, such that on a Google pro account you can work 4-5 hours per week in Flash. And by the way, 3.1 Pro is useless for programming, compared to Codex/Opus

➕ show 1 reply

indigodaddy • today at 4:08 PM

Is it multimodal/vision?

joshjob42 • today at 4:53 PM

I really like what Qwen are doing, and a lot of these Chinese labs, but until I can ask their models what happened during the student protests in 1989 or why human rights groups are upset about the Uighurs and the model gives me a straight answer I'm just not able to trust these models with anything of substance.

➕ show 2 replies

esafak • today at 1:29 PM

Does anyone have experience with the Alibaba Cloud Model Studio that serves these qwen models?

wolvoleo • today at 7:42 PM

[dead]

spacebacon • today at 2:39 PM

[flagged]

hydra-f • today at 1:16 PM

[dead]

storus • today at 5:43 PM

[dead]

tonyspiro • today at 4:06 PM

[flagged]

kevinsimper • today at 11:12 AM

[flagged]

nikhilpareek13 • today at 1:02 PM

[dead]

DeathArrow • today at 9:03 PM

[dead]

howmayiannoyyou • today at 12:58 PM

I can't bring myself to use any model that trains or sends telemetry back to my country's primary competitor/adversary. I don't care how much money is saved.

➕ show 4 replies

dfansteel • today at 12:58 PM

Can anyone check its knowledge base for me? I’m honestly not able to run it and the Qwen models I can run censor information critical towards the Chinese government.

Tiananmen Square is the first place to start.

➕ show 2 replies

alt Hacker News

Qwen3.7-Max: The Agent Frontier

Comments