logoalt Hacker News

Gareth321yesterday at 12:54 PM4 repliesview on HN

I think the next major innovation is going to be intelligent model routing. I've been exploring OpenClaw and OpenRouter, and there is a real lack of options to select the best model for the job and execute. The providers are trying to do that with their own models, but none of them offer everything to everyone at all times. I see a future with increasingly niche models being offered for all kinds of novel use cases. We need a way to fluidly apply the right model for the job.


Replies

nylonstrungyesterday at 1:12 PM

Agree that routing is becoming the critical layer here. Vllm iris is really promising for this https://blog.vllm.ai/2026/01/05/vllm-sr-iris.html

There's already some good work on router benchmarking which is pretty interesting

condimentyesterday at 3:31 PM

At 16k tokens/s why bother routing? We're talking about multiple orders of magnitude faster and cheaper execution.

Abundance supports different strategies. One approach: Set a deadline for a response, send the turn to every AI that could possibly answer, and when the deadline arrives, cancel any request that hasn't yet completed. You know a priori which models have the highest quality in aggregate. Pick that one.

show 1 reply
monoosoyesterday at 2:08 PM

I came across this yesterday. Haven't tried it, but it looks interesting:

https://agent-relay.com/

eshaham78yesterday at 1:03 PM

[dead]