logoalt Hacker News

mtct88yesterday at 2:07 PM5 repliesview on HN

Nice release from the Qwen team.

Small openweight coding models are, imho, the way to go for custom agents tailored to the specific needs of dev shops that are restricted from accessing public models.

I'm thinking about banking and healthcare sector development agencies, for example.

It's a shame this remains a market largely overlooked by Western players, Mistral being the only one moving in that direction.


Replies

lelanthranyesterday at 2:36 PM

> It's a shame this remains a market largely overlooked by Western players, Mistral being the only one moving in that direction.

I've said in a recent comment that Mistral is the only one of the current players who appear to be moving towards a sustainable business - all the other AI companies are simply looking for a big payday, not to operate sustainably.

show 1 reply
Aurornisyesterday at 3:36 PM

I play with the small open weight models and I disagree. They are fun, but they are not in the same class as hosted models running on big hardware.

If some organization forbade external models they should invest in the hardware to run bigger open models. The small models are a waste of time for serious work when there are more capable models available.

NitpickLawyeryesterday at 2:14 PM

I agree with the sentiment, but these models aren't suited for that. You can run much bigger models on prem with ~100k of hardware, and those can actually be useful in real-world tasks. These small models are fun to play with, but are nowhere close to solving the needs of a dev shop working in healthcare or banking, sadly.

kennethopsyesterday at 2:10 PM

I love the idea of building competitor to open weight models but damn is this an expensive game to play

smrtinsertyesterday at 2:26 PM

How true is this? How does a regulated industry confirm the model itself wasn't trained with malicious intent?

show 1 reply