logoalt Hacker News

mathisfun123yesterday at 7:45 PM1 replyview on HN

You don't know what you're talking about: an enormous amount of TOPs now runs through quantized (read: integer) kernels. Many GPUs don't have even FP64 or even FP32 support.


Replies

jmalickiyesterday at 8:07 PM

EDIT: I was completely wrong, I have mostly worked with GGUF and related quantizations that are LUTs, thank you for correcting me.

show 1 reply