logoalt Hacker News

braplast Friday at 8:44 PM1 replyview on HN

I get what you’re saying, and you’re right, however I can also see where they’re coming from:

Empirically, agents (especially the coding CLIs) seem to be doing so much better with files, even if the tooling around them is less than ideal.

With other custom tools they instantly lose 50 IQ points, if they even bother using the tools in the first place.


Replies

tensorlast Friday at 8:48 PM

Sorry, this still makes no sense. LLMs don't care about files. The way most codings systems work is that they simply provide the whole file to the LLM rather than a subset of it. That's just a choice in how you implemented your RAG search system and database. In this case the "record" is big, a file. No doubt that works for code, but it's nonsensical outside that.

E.g. for wikipedia the logical unit would likely be an article. For a book, maybe it's a chapter, or maybe it's a paragraph. You need to design the system around your content and feed the LLM an appropriate logically related set of data.

show 5 replies