HyperAgents: Self-referential self-improving agents

51 points • by andyg_blog • last Tuesday at 4:54 PM • 19 comments • view on HN

https://arxiv.org/abs/2603.19461

Comments

The paper is here - https://arxiv.org/pdf/2603.19461

This, IMO is the biggest insight into where we're at and where we're going:

> Because both evaluation and self-modification are coding tasks, gains in coding ability can translate into gains in self-improvement ability.

There's a thing that I've noticed early into LLMs: once they unlock one capability, you can use that capability to compose stuff and improve on other, related or not, capabilities. For example "reflexion" goes into coding - hey, this didn't work, let me try ... Then "tools". Then "reflxion" + "tools". And so on.

You can get workflows that have individual parts that aren't so precise become better by composing them, and letting one component influence the other. Like e2e coding gets better by checking with "gof" tools (linters, compilers, etc). Then it gets even better by adding a coding review stage. Then it gets even better by adding a static analysis phase.

Now we're seeing this all converge on "self improving" by combining "improving" components. And so on. This is really cool.

➕ show 4 replies

Jerrrrrrrry • today at 6:00 PM

No matter how far we go, we end up with generation / discrimination architecture.

Its is the core of any and all learning/exellency; exposure to chaotic perturbations allow selection of solutions that are then generalized to further, ever more straining problems; producing increasingly applicable solutions.

This is the core of evolution, and is actually derivable from just a single rule.

➕ show 2 replies

flockonus • today at 5:48 PM

The readme seems very unclear about what it does. Anyone has a practical example of it?

➕ show 2 replies

measurablefunc • today at 7:07 PM

That's great but how about UltraAgents: Meta-referential meta-improving self-referential hyperagents?

sonu27 • today at 6:59 PM

Can someone add this to OpenClaw :)

jauntywundrkind • today at 5:56 PM

Pi is self modifying, self aware. https://lucumr.pocoo.org/2026/1/31/pi/

But this idea of having a task agent & meta agent maybe has wings. Neat submission.

➕ show 1 reply

llmslave • today at 6:12 PM

I think even code bases will have self improving agents. Software is moving from just the product code, to the agent code that maintains the product. Engineering teams/companies that move in this direction will vastly out produce others.

I've had to really shift how I think about building code bases, alot of logic can go into claude skills and sub agents. Requires essentially relearning software engineering

maxbeech • today at 7:13 PM

[dead]

felixagentai • today at 6:43 PM

[dead]

agentpiravi • last Tuesday at 5:03 PM

[dead]

andyg_blog • last Tuesday at 4:54 PM

[dead]

alt Hacker News

HyperAgents: Self-referential self-improving agents

Comments