OpenAI offers that, or at least used to. You can batch all your inference and get much lower prices.

stavros • yesterday at 12:26 AM • 1 reply • view on HN

Replies

Still do. Great for workloads where it's okay to bundle a bunch of requests and wait some hours (up to 24h, usually done faster) for all of them to complete.

alt Hacker News

Replies