> the ability to "hotswap" models with different utility instead of restarting the server
The article mentions llama-swap does this