logoalt Hacker News

villgaxyesterday at 10:13 AM2 repliesview on HN

Got nuked on day zero by Qwen models at tenth or so of params.

Does not handle critical inputs even for moderation tasks

These guys did not even bother with an official huggingface space

And the biggest stupidity seems to be fixating on MXFP4 for Apple Silicon when it doesn't even have hardware support for it, should have just done Q4 for GGUF based inference


Replies

gyanyesterday at 11:21 AM

> These guys did not even bother with an official huggingface space

https://huggingface.co/sarvamai

show 1 reply
petesergeantyesterday at 11:50 AM

Got to start somewhere.

I do think convincing world-class talent to live in Bangalore is likely to be a challenge though.

show 3 replies