Got a link to that API inference provider?
Just look up OpenRouter, OpenCode Go/Zen, Together, Fireworks, Cerebras, etc.
DeepSeek Platform API is worth checking out too, due to their insanely good caching and token costs.
I'm Ollama Cloud which has a coding plan style model but without restrictions on the harness or direct API calls from your code.
I use novita ai
Just look up OpenRouter, OpenCode Go/Zen, Together, Fireworks, Cerebras, etc.
DeepSeek Platform API is worth checking out too, due to their insanely good caching and token costs.