Reminds me of the recent paper about delegating document editing tasks to LLMs across different disc...

maxbond • today at 2:46 PM • 1 reply • view on HN

Reminds me of the recent paper about delegating document editing tasks to LLMs across different disciplines [1]. That paper found that programming was the only discipline most LLMs can perform long horizon tasks on without accumulating errors & corrupting the document.

I've only read the abstract of this one so far but it seems like this paper has zoomed in on programming with greater fidelity and shown a similar phenomenon. But not about long horizon tasks, more like "long style horizons" of larger sets of structural constraints.

[1] https://arxiv.org/abs/2604.15597

Discussion: https://news.ycombinator.com/item?id=48073246

Replies

emp17344 • today at 3:41 PM

If it’s not easily verifiable, LLMs aren’t good at it.

➕ show 1 reply

alt Hacker News

Replies