logoalt Hacker News

mattmanseryesterday at 1:42 PM0 repliesview on HN

Try doing some inference with local models.

I'd be surprised if they're making money on inference just from that. There's no way someone paying $20 p/m and using it all day is not spending way more on even just the electricity for tokens, let alone the capex.