Nice! I'm interested in your cubecl-wgpu patches. I've been struggling to get lower than...

scronkfinkle • yesterday at 1:12 PM • 0 replies • view on HN

Nice!

I'm interested in your cubecl-wgpu patches. I've been struggling to get lower than FP32 safetensor models working on burn, did you write the patches to cubecl-wgpu to get around this restriction, to add support for GGUF files, or both?

I've been working on something similar, but for whisper and as a library for other projects: https://github.com/Scronkfinkle/quiet-crab

alt Hacker News