Got nuked on day zero by Qwen models at tenth or so of params. Does not handle critical inputs eve...

villgax • yesterday at 10:13 AM • 2 replies • view on HN

Got nuked on day zero by Qwen models at tenth or so of params.

Does not handle critical inputs even for moderation tasks

These guys did not even bother with an official huggingface space

And the biggest stupidity seems to be fixating on MXFP4 for Apple Silicon when it doesn't even have hardware support for it, should have just done Q4 for GGUF based inference

Replies

gyan • yesterday at 11:21 AM

> These guys did not even bother with an official huggingface space

https://huggingface.co/sarvamai

➕ show 1 reply

petesergeant • yesterday at 11:50 AM

Got to start somewhere.

I do think convincing world-class talent to live in Bangalore is likely to be a challenge though.

➕ show 3 replies

alt Hacker News

Replies