logoalt Hacker News

zackifytoday at 3:23 PM1 replyview on HN

I ran glm 5.2 on rented 8x h200 it could only do 2x concurrency at a cost of $40 an hour. It felt great but dang I wish it was cheaper... It needs 750 at fp8


Replies

zackangelotoday at 5:26 PM

what was the concurrency limitation? that node should be able to support a lot more