HBF is not that. The paper you linked is about how to use flash memory that exists to boost LLM perf...

Tuna-Fish • today at 2:42 PM • 0 replies • view on HN

HBF is not that. The paper you linked is about how to use flash memory that exists to boost LLM performance, with all kinds of optimization tricks. HBF is about making flash memory that doesn't require any of those tricks, and just has the read throughput that's needed for inference.

alt Hacker News