logoalt Hacker News

Computer Use is 45x more expensive than structured APIs

419 pointsby palashawasyesterday at 4:34 PM242 commentsview on HN

Comments

ipunchghostsyesterday at 8:53 PM

I have a similar finding for a website I made that collates college town bar specials and live music. Using agents with vision models works but it's not as straightforward as one would initially think. U can check out the results here. https://www.nittanynights.com

sanderjdyesterday at 6:50 PM

Only 45x?

doctorpcgumtoday at 3:43 AM

Bh

taorminayesterday at 5:06 PM

The interface designed for humans is poor for AI needs? And the interface designed for programmatic use is easier for the AI to use? In other news, the sky is blue and water is wet.

show 1 reply
creatonezyesterday at 9:26 PM

Browser agents / vision agents are a menace and ISPs should outright ban subscribers who run them on the public internet.

gowldyesterday at 5:39 PM

Confusing title? "Computer Use" is actually "Browser vision"?

deafpolygonyesterday at 8:50 PM

This is missing the point that AI training probably costed boatloads more to achieve to get here.

theabhinavdasyesterday at 8:38 PM

For now.

azyctoday at 9:23 AM

[flagged]

sneefletoday at 7:42 AM

[flagged]

WhoffAgentsyesterday at 9:16 PM

[flagged]

jacktutoday at 5:50 AM

[dead]

momo26today at 2:34 AM

[flagged]

Amber-chentoday at 2:43 AM

[dead]

lacymorrowyesterday at 9:08 PM

[dead]

rgilliotteyesterday at 7:36 PM

[dead]

show 1 reply
BionicAItoday at 8:46 AM

[flagged]

volume_techyesterday at 6:06 PM

[flagged]

overlord1109today at 3:25 AM

[dead]

doctorpcgumtoday at 3:43 AM

[flagged]

faangguyindiayesterday at 5:29 PM

I saw Codex was screenshotting, then clicking around. I just stopped it and never used that again.

Using CLI tools is much faster and token-efficient. I developed ten apps in the last two months. One reached 10,000+ monthly active users.

I ask Codex to generate SVG line by line and backtrack edit, ask it to use Inkscape to generate icons, etc...

I developed all this on $20 codex sub.

show 2 replies
bottlepalmyesterday at 9:44 PM

There's no way this is true. I would argue in some cases computer use is less expensive. First for APIs that don't even exist, it's a non starter. Second most APIs are not designed for agents and are verbose as hell - returning the entire DTO and tons of unnecessary properties burns tokens. Second computer use is not as token hungry as you think it is - a single screenshot may be just 1000 tokens, it's actually competitive and beats API workflows in many cases.

0xWTFyesterday at 10:25 PM

So, to make this concrete, Akasa uses computer vision to read medical records to replace medical coders because there aren't enough medical coders to get all the billing right and medical systems leave like $1T a year on the table.

The EHRs could give companies like Akasa API access so Akasa could then just run NLP, but the EHR vendors don't grant various third parties API access for various reasons, so instead Akasa gets a seat license for each medical system they service and uses computer vision to read the screen (a cadre of Akasa medical coders review errors to stay up to date with unannounced changes from the EHR vendors) and then runs the NLP to figure out which CPT codes to assign to actually put in a bill and send the payer so the hospitals can stay afloat.

So this 45x delta is how much more the medical systems pay Akasa because Epic won't work with Akasa.

This is but one example of why US medical bills are outrageously high.