logoalt Hacker News

verdvermtoday at 3:13 PM5 repliesview on HN

You can access Claude models with Google Cloud reliability via VertexAI. The caveat is that you cannot use your subscription, per-token pricing only.

I personally prefer per-token, it makes you more thoughtful about your setup and usage, instead of spray and pray.

You can also access the notable open weight models with VertexAI, only need to change the model id string.


Replies

Scene_Cast2today at 3:26 PM

I also use them per-token (and strongly prefer that due to a lack of lock-in).

However, from a game theory perspective, when there's a subscription, the model makers are incentivized to maximize problem solving in the minimum amount of tokens. With per-token pricing, the incentive is to maximize problem solving while increasing token usage.

show 1 reply
limatoday at 6:10 PM

We tried this, but the quota for Opus models defaults to 0 on VertexAI and quota increase requests are auto-rejected.

Any tips?

perfmodetoday at 3:42 PM

You can use your subscription for Anthropic-hosted Claude models?

show 2 replies
chewbachatoday at 3:35 PM

You mean Google Chaos Services as we call them?

joe_mambatoday at 3:18 PM

I saw a funny skit where if free Claude instance was down for you, you could just ask Rufus, Amazon's shopping AI assistant, your math/coding question phrased as a question about a product, and it would just answer lol.

show 1 reply