logoalt Hacker News

agentdev001today at 1:53 PM1 replyview on HN

I find papers/articles which discuss solutions that rely heavily on a model in the middle unreadable, if the models used are not discussed.

The data you need to get into context for a small model, vs a big boy frontier model, vs a fine tuned open weight big boy- are all very different. I can understand what they're doing here, and most of the 'why', but- not all of the why.


Replies

sarangk90today at 2:50 PM

[flagged]