logoalt Hacker News

obblekktoday at 12:52 AM2 repliesview on HN

This is the first time in recent memory that software has had high variable costs so the surprise at these rules is understandable.

In this case, a the difference in context cache hit rate between openclaw and antigravity.

For example if openclaw starts every message with the current time hh:mm:ss at the top of the context window, followed by the full convo history, it would have a cache hit rate if ~0. Simply moving the updated time to each new message incrementally would increase hit rate to over 90%. Idk if openclaw does this but there’s many many optimizations like this. And worse, thrashing the cache has non linear effects on the server as more and more users’ cached contexts get evicted from cache due to high cardinality. The cost to serve difference could be >10x.

Google is the furthest behind on coding agent adoption and has all the incentives to allow off policy use to grow demand. But it would probably be better to design their own optimized openclaw and serve that for free than let any unoptimized requests in.


Replies

martinaldtoday at 1:06 AM

It's a fair point, but I think people are thinking too much about 'cost' and 'subsidies' and just the fact that everyone is so compute stretched.

While it's sort of the same thing, I think it's much more a symptom of not enough compute vs some 'dump cheap tokens' on the market strategy.

One related thought I had was that given OpenAI is the only one _not_ doing this of the big3, it probably indicates they have a lot more spare compute.

It doesn't make sense to me that given the absolutely brutal competition any of these companies would block use of 3rd party apps unless they had to. They clearly have enough cash, so I don't think it's about money - I think it's that an indicator that Google and Anthropic are really struggling with keeping up with demand. Given Anthropics reliability issues last week this does not surprise me.

show 2 replies
goodmythicaltoday at 1:03 PM

>This is the first time in recent memory that software has had high variable costs

Running software has always had a variable cost.

Why should I be surprised if [cloud provider] were upset that I were running a thousand free tier servers? Or utilizing any paid plan at all to somehow effect utilizations far exceeding the clearly documented limitations of my plan?

Using the torrent network protocol on a VPN that doesn't support it, or fork bombing an email server, or using that one popular free video hosting service to host nigh unlimitted arbritrary data, or hosting content that is illegal to the server operator regardless of its legality to me, etc, etc, etc

It's all the same thing: TOS violation.

No one is being forced to use these products without reading and signing the terms of service. In this particular instance, you can even use the free version of the provided service to analyze the terms of service for the paid plan if you were really so lazy.

I really am genuinely confounded as to why people are so regularly surprised that they can't just do whatever they please with proprietary solutions. Like "oh what do you mean I can't lie about the date of my injury in order to get it covered by insurance?".

It's almost like people just assume that everything ever works exactly as they would deam it to (in their benefit), rather than the much more sane assumption that every company is going to be naturally inclined to cater to their own benefit before the users'.