logoalt Hacker News

DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles

59 pointsby mjiyesterday at 11:44 PM6 commentsview on HN

Comments

Palmiktoday at 4:48 AM

Similar article for vLLM: https://vllm-website-pdzeaspbm-inferact-inc.vercel.app/blog/...

Bechmarks from InferenceX (they do not have apples-to-apples setups to compare the different engines for whatever reason): https://inferencex.semianalysis.com/inference?i_hc=1&g_model...

I find it odd that sglang, vLLM, TRTLLM don't seem to want to publish benchmarks comparing each other. They used to, but now there seems to be some unspoken rule against it.

At least we get comparison against "other OSS engine" this time, but that could be HF's Transformers as well :)

show 2 replies
palatatoday at 12:39 PM

Yet another website where I don't know what they do, so I go to the homepage that has a marketing sentence explaining what they do, and I still don't understand.

Something with LLMs, obviously.