logoalt Hacker News

aurareturnyesterday at 11:30 AM1 replyview on HN

Don’t forget that the 8B model requires 10 of said chips to run.

And it’s a 3bit quant. So 3GB ram requirement.

If they run 8B using native 16bit quant, it will use 60 H100 sized chips.


Replies

dust42yesterday at 11:38 AM

> Don’t forget that the 8B model requires 10 of said chips to run.

Are you sure about that? If true it would definitely make it look a lot less interesting.

show 1 reply