Does this mean I should switch to sglang? How hard is it to add the capability for these type of models to vLLM? Or does it already handle them?
[dead]
[dead]