logoalt Hacker News

nlyesterday at 5:58 AM1 replyview on HN

Llama 3.1 8B is pretty useful for some thing. I use it to generate SQL pretty reliably for example.

They are doing an updated model in a month or so anyway, then a frontier level one "by summer".


Replies

numeriyesterday at 3:24 PM

but Taalas had to quantize Llama 3.1 8B to death to get it to fit. It can't produce coherent non-English text at all.