logoalt Hacker News

nickpsecuritytoday at 3:23 AM1 replyview on HN

You're thinking in a provably-useful direction:

https://arxiv.org/pdf/2312.11514


Replies

Tuna-Fishtoday at 2:42 PM

HBF is not that. The paper you linked is about how to use flash memory that exists to boost LLM performance, with all kinds of optimization tricks. HBF is about making flash memory that doesn't require any of those tricks, and just has the read throughput that's needed for inference.