logoalt Hacker News

dangusyesterday at 7:00 PM1 replyview on HN

Sounds to me like there’s potential to use these for established models to provide cost/scale advantage while frontier models will run in the existing setup.


Replies

yunohnyesterday at 7:13 PM

IME llama et all require LoRA or fine-tuning to be usable. That's their real value vs closed source massive models, and their small size makes this possible, appealing, and doable on a recurring basis as things evolve. Again, rendering ASICs useless.

show 1 reply