logoalt Hacker News

justinhjyesterday at 6:26 PM1 replyview on HN

We see the same with Google's Flash models. It's easier to make a small capable model when you have a large model to start from.


Replies

karmasimidayesterday at 6:34 PM

Flash models are nowhere near Pro models in daily use. Much higher hallucinations, and easy to get into a death sprawl of failed tool uses and never come out

You should always take those claim that smaller models are as capable as larger models with a grain of salt.

show 1 reply