Similar article for vLLM: | alt Hacker News

Palmik • today at 4:48 AM • 2 replies • view on HN

Bechmarks from InferenceX (they do not have apples-to-apples setups to compare the different engines for whatever reason): https://inferencex.semianalysis.com/inference?i_hc=1&g_model...

I find it odd that sglang, vLLM, TRTLLM don't seem to want to publish benchmarks comparing each other. They used to, but now there seems to be some unspoken rule against it.

At least we get comparison against "other OSS engine" this time, but that could be HF's Transformers as well :)

Replies

imjonse • today at 5:13 AM

They're OSS projects in a friendly competition, both working towards the goal of having alternatives to big closed source players. No need for jabs.

➕ show 1 reply

mirekrusin • today at 6:58 AM

Too early, they simply didn't yet have access?