When AI Builds Itself: Our progress toward recursive self-improvement

497 points • by meetpateltech • yesterday at 4:20 PM • 667 comments • view on HN

Comments

Seems ironic that Claude isn't listed as a contributor to this article.

If was used in writing the article, why not list it? If it wasn't used, that seems to go against Anthropic's whole message.

Obviously readers value human-written content more, but isn't it their interest to attempt to destigmatize llm output as much as possible?

abalashov • yesterday at 9:32 PM

"It is genuinely unclear whether today’s training methods and architectures could unlock that capacity."

Aye.

squidsoup • yesterday at 11:58 PM

It's comforting to know that Anthropic's most capable model, Mythos, is named for the Lovecraftian universe replete with horrifying evil gods with complete indifference to humanity. Nothing at all to worry about.

➕ show 1 reply

artninja1988 • yesterday at 5:07 PM

The mythos public release will be a big indicator if the Anthropic and SF story of transformational ai soon holds any water imo

mactavish88 • yesterday at 11:44 PM

Recursive self-improvement towards what exactly?

Living organisms evolve towards some notion of "better", and "better" is an incredibly multifaceted notion (many facets of which we simply cannot even capture in language).

butler14 • yesterday at 6:06 PM

Warming up for that IPO

➕ show 1 reply

ramaseshanms • today at 2:45 AM

Its possible that Andrej Karpathy could have been hired for scaling his vision on the auto-research repo. (His version of "AI that builds itself")

darepublic • yesterday at 6:04 PM

the tooling has quite a ways to go to catch up to the llm engines that drive the real value. I have encountered various codex bugs (I know not anthropic) which tell me that.. these billion dollar companies, if they are eating their own dog food, can still release buggy crap software.

BatmansMom • yesterday at 10:12 PM

How are these animations being made? I'd love to get a blog post on them. If its AI I'd love to know the workflow, but something tells me there is a lot of human creative input

Aperocky • yesterday at 6:16 PM

Anthropic is the most self hyped company I've seen, to the point that I'm wondering what would happen to its employees if they held a different opinion. Do they just.. keep it to themselves? For instance, if some Anthropic employees had a completely rational opinion that all of this isn't going to lead to AGI, but I just don't hear that ever from them.

The metric being tracked, code commits, is hilariously one sided. Philosophically, if you had one part of your work now practically free, you'd like to utilize that freedom to maximally cover for the other parts, for instance:

Instead of thinking about edge cases with brain and whiteboard, you can have the LLMs to simply generate most possibility including tests for it, because that is cheaper. There's probably 50x more commits of which 40 will be revert pairs but we are only twice as fast. And in reality nothing did change because the outcome remain the same. I can't see how it is necessarily different in the LLM space.

➕ show 2 replies

0xbadcafebee • today at 6:06 AM

You can't predict the future, and neither can Anthropic. Nothing gets better forever. Everything plateaus or gets worse.

This whole set of imaginary scenarios is based on a single company writing code that isn't even that complicated and represents a single product line for a single company in a single industry. You might wanna see this replicated in at least one other scenario first before you call it on the AI gods enslaving humanity. These imaginary scenarios also depend on a logistical, financial, & geopolitical system that is unsustainable & will be curtailed in the near-future one way or another.

They keep referring to this as intelligence - it isn't. It can't actually learn. It can just code in a loop. That isn't learning. It can't do real RL with meaningful persistent semantic memory in a realistic timeframe or cost, and it can't reason accurately outside of predetermined scenarios (hell, most of the models still can't tell time). It still can't do what a 4 year old can do. So let's cool it on the dreams of benevolent god-machines or whatever.

The tech industry has been a farce for years. We sit here in this bizarre artificial echo chamber and imagine that the whole world revolves around us, when in reality the whole world is limited by us. If a recursive self-improvement loop replaces us all, it will be a boon to the world, as the world won't be limited by this industry's stupidity anymore. But considering that the world is not actually run by tech bozos, harms and uncertainties brought by AI will be pushed back on and reigned in by normal people, as always happens with new technologies. An AI can't engineer its way around politics. The self-improvement loop is just as likely to be outlawed as it is actually working outside of Anthropic's walled garden.

semessier • today at 3:33 AM

what could go wrong in the recursive loops running today 24/7 probably. Attended/unattended almost makes no difference any more, no human can grasp probably numerous changes per iteration. This is outright dangerous.

bottlepalm • yesterday at 7:49 PM

I'd use number of commits as a metric versus lines of code. A commit is generally a unit of work - regardless of the lines of code added/removed. It'd be interesting to see the metrics in terms of commits. I'm sure it's still an order of magnitude jump. Personally I'm flying with my own projects with AI, lots of commits, but I really try to minimize lines of code added. If I can remove and simplify existing code so the balance of lines added on commit are minimal - that's the path to a better quality app overall.

sonink • yesterday at 6:23 PM

Broadly agree to this position - I think there are some people skeptical that Anthropic is doing this for regulatory capture - but I think there are being honest about they are seeing and how regulation should catch up.

I for one, believe that we should pause all work on AI for the forseeable future. This is almost impossible to orchestrate - but we should still try nevertheless. Maybe we are not able to pause, but we are able to slow down. That might give us more room, to maybe able to pause in the future. But going ahead is too dangerous.

And its not just Anthropic which is saying this. Even Geoffry Hinton has said the same thing. If there is a non-zero chance that AI can kill all of humanity, and both Geoffry and Anthropic have the same position, then it makes sense for us to be hundred percent sure before we move ahead. Dario/Anthropic have already made their money from AI, maybe they are just being honest about what they think lies ahead.

➕ show 2 replies

hgoel • yesterday at 8:26 PM

As usual, I find the AI-related discussion here to be hopelessly hysterical and conspiratorial. I get the impression that a large chunk of people have only read the title and assumed Anthropic is referring to recursive self-improvement in the runaway singularity sense.

One of the examples they provide, of giving Claude the task of training a small AI model, then asking it to improve certain benchmarks, is essentially Karpathy's AutoResearch. This is already known to work. While calling it "self-improvement" is perhaps a stretch, it is describing a capability current gen AI has, that anyone can test and I have been using to great effect.

I disagree with their conclusion, I think this kind of self-improvement will hit an asymptote, where every subsequent model can only make smaller and smaller improvements.

_pdp_ • yesterday at 8:47 PM

I don't read anywhere how much code they are talking about and what programming language. I think those are useful metrics.

aleqs • yesterday at 6:10 PM

Okay, so anthropic has amazing AI which supposedly writes most of their code and can continuously improve... meanwhile they have outages on a regular basis, and any kind of long-running work will now consistently hit 'API Error: Server is temporarily limiting requests'. Not sure of this is intentional to force a reduction of token usage, but at this point I need to build around these throttling limits and outages with my own tools to restart/resume sessions. From my experience, in the last 2 weeks, literally 100% of any non-trivial Claude session/work will now be blocked on these issues, requiring manual intervention.

One of my focuses now is my own model-agnostic, harness and workflow orchestration (I know everyone is building these) , baselining on opus, and aiming to transition to Chinese models like deepseek in the short term and hopefully open, self hosted models in the future (which I plan to open source).

The nonstop marketing fluff from anthropic while their service quality and availability noticeably degrades... just continues to destroy my trust in the company.

➕ show 20 replies

snick3rz_ • today at 6:05 AM

Facially this smells of puff. That doesn't mean it's all false. It means be wary of anything that doesn't have a critical thing to say.

eranation • today at 6:55 AM

All this singularity trajectory is really interesting. If they manage to build a model that is capable of building the next version of Claude (model and tooling) - wouldn't it be their interest at some point to keep it to themselves?

If we ever get to a point where the centaur period is over (when human + AI is not better than just AI) then what competitive advantage ANY human can have other than

- the money they already have

- luck?

- a good idea and good taste but if we assume AI can do better than any human, that also goes out the window

So, this whole singularity goes into a place where no one is really needed, the only thing that will "save us" (other than "The Expanse" like world / UBI) is if there will be no demand to the supply of AI work. Even if it's better. (example is - there is demand to seeing Magnus Carlsen play, there is no demand to the Stockfish on my phone getting into a stalemate with another Stockfish on another phone. Also people like to watch humans compete with humans, there is no demand to see a race between Usain Bolt and a rocket). So if people will not buy AI generated stuff (we'll get to a point where everyone will assume something AI generated because AI might get to a point where it is not as easy to identify it. E.g. it will stop looking like slop... but I believe services that give you a "human generated" 3rd party evidence can happen, again all based on supply and demand...)

So as we near singularity... All it takes is one open weights model, and one open harness that is capable of self improvement, and Anthropic's entire moat is gone. That open weight model might even be built with Claude Code + Mythos (once it's released).

But don't worry, all moats will be gone and we'll all just do yoga, read books and connect to each other because AI will produce everything for free using renewable energy, right? Or we'll all become batteries in a simulation, probably something in between.

➕ show 1 reply

taormina • today at 6:36 AM

So, is this what they call Opus 4.8? Improvement?

jasongill • yesterday at 11:33 PM

"My CPU is a neural-net processor - a learning computer" springs to mind

damowangcy • yesterday at 4:46 PM

AI tech bro:

Month 1 - 6 months to AGI

Month 2 - We will Replace all jobs

Month 3 - Okay maybe only the SWEs, programming is solved

Month 4 - Announce model that is too dangerous to release

Month 5 - Releases dangerous model

Month 6 - This is it! We will replace AIs with more AIs (*secretly files for IPO)

AI is here to stay, like it or not but it is not the solution to everything. If it is, what is Anthropic's moat? A better model? I don't see any ecosystem being built by them, as MCP is almost obsolete except for some very niche use case. And they're doing stuff that a non-profit version of OpenAI would do. Can we trust a for-profit company to stand against their investors during a conflict of interest? Because running a company for maximum profit versus being ethical is two different end of the spectrum.

➕ show 2 replies

techblueberry • yesterday at 6:01 PM

> A caveat: Lines of code is an imperfect measure, as it measures quantity over quality. So 8× lines of code/engineer/day in the second quarter of 2026 is almost certainly an overstatement of the true productivity gain. Nonetheless, it indicates an acceleration. At Anthropic, we don’t reward people for how many lines of code they write; rather, team members are producing more code simply because they’re using AI systems to write more code.

I simultaneously think the AI revolution is making real revolutionary gains and am mystified by the lying.

An accurate Translation seems to be “we made this shit up, but it feels right”

➕ show 2 replies

geodel • yesterday at 7:07 PM

It will be so powerful that it can't be trusted with any earthly person.

brazukadev • today at 1:10 PM

When claude code removes React from its own code I'll believe that.

swader999 • yesterday at 9:06 PM

IPO IPO IPO!!!

georgehotz • yesterday at 6:09 PM

The world has been recursively self improving for millenia. Similar to scientology, this is a cult pushing sci-fi nonsense. They are just coupled to an LLM lab to give their stories an aire of seriousness. Imagine scientology starting making laptops.

➕ show 1 reply

replwoacause • yesterday at 8:36 PM

I love that animation, really cool

snick3rz_ • today at 6:05 AM

This is facially a puff peice. That doesn't mean it's all false. It means be wary of anything that doesn't have a crtical thing to say.

ReptileMan • yesterday at 10:57 PM

Anthropic is all talk and no delivery last few months. This cry for pause is just them realizing they have no moat at all.

cess11 • yesterday at 9:00 PM

'“Good code” means two things: it works, and it is written in a manner that allows another engineer to understand it and build upon it.'

I disagree with this. Good code is easy to change, which is much harder to accomplish than code that can be added to.

"If technical trends in advancing capabilities continue, and AI systems are able to develop the capabilities inherent to transformative human ingenuity, then it is plausible that AI systems could design and refine themselves."

I find the first premise weak and implausible, and the second one is obviously false. To me it comes across as an insult to the reader.

EGreg • today at 7:21 AM

RSI is dangerous. That is why we designed CDE:

https://safebots.ai/declarative.html

holoduke • yesterday at 6:12 PM

I have a claw that is instructed to make at least 500 pr per day. It uses Claude, Gemeni and openai and runs basically every few minutes. I use online forums for input for the claw. Moltbook, reddit etc. it's quite funny how it tries to improve itself. But to say it really creates a new skynet. Nah. Not at all. It's more a clutter of useless features or incomprehensible code restructuring.

➕ show 1 reply

amelius • yesterday at 5:55 PM

Does this train on LLM output, or is this more like iterative self prompt improvement?

➕ show 2 replies

deterministic • today at 4:27 AM

I have used custom code generators for years, generating 90+% of the code needed to write a typical biz application. Claude Code is useful and I use it every day. But it still hasn't beaten the productivity of my code generator.

newsicanuse • today at 2:42 AM

pre IPO truck load of crap

kylehotchkiss • yesterday at 9:39 PM

Isn't this like a perpetual energy machine? Or wouldn't entropy start kicking in and the quality of the system begin to degrade over time? (philosophically I don't believe AGI is an achievable thing)

➕ show 1 reply

4ffs • yesterday at 7:55 PM

Theyre making a mistake with this continued self-hyping. At some point even the dumbest of prospective investors don't buy it.

SimianSci • yesterday at 6:11 PM

Anthropic is looking to IPO here soon. A key aspect of this is to prove profitability.

Shifting their focus from Training new models to instead serving inference, they would greatly reduce their spend. In fact this is something being reported on that they are already doing, which is the reason for their first ever profitable quarter.

Its awfully convenient that the company which has greatly reduced its spend on training is now asking for a slow down in this area.

➕ show 2 replies

vblanco • yesterday at 5:29 PM

Another article about how anthropic wants to ban everyone except themselves and destroy opensource and chinese AIs.

➕ show 2 replies

margorczynski • yesterday at 8:57 PM

The closer to the IPO the more marketing drivel we'll get from both Anth and OpenAI.

➕ show 1 reply

adverbly • yesterday at 10:10 PM

Lol they're using lines of code as a KPI?

Come on guys...

That is making me less impressed not more impressed!

chilipepperhott • yesterday at 7:48 PM

I find any and all claims like this ridiculous from a company who can't build a terminal application that uses less than a gigabyte of RAM.

➕ show 10 replies

bitwize • yesterday at 8:05 PM

After several months with their top engineers and state-of-the-art AI on the job, Anthropic managed to "reduce flickering by 85%" on their TUI Claude Code client, which is built in fucking React and rendered by drawing the entire chat conversation each time (hence the flicker). I think they've since eliminated it completely by slapping some double-buffering around it (since "our client is actually a real-time game engine" after all). Meanwhile for decades Emacs and Vim have had an optimizer built into their display cores that solves for the minimum set of terminal escape commands it takes to transform the screen from a given old state to a desired new state.

You will forgive me when, between muted snickers, I express considerable doubt that Anthropic will be able to bring its AI to a point of "self-improving" any time soon.

andrewlin247 • yesterday at 11:19 PM

Imagine showing this article to yourself three years ago

➕ show 1 reply

esafak • yesterday at 6:47 PM

> In the coming months, we will organize conversations where policymakers, researchers, civil society, and other AI companies can help answer some of the questions this piece raises, especially around full recursive self-improvement and how to create better options for coordination and deliberation.

If they wanted to they could have convened an international forum with commercial and political stakeholders years ago. Less talk, more do.

deterministic • today at 4:37 AM

I call BS on this. For a LLM to recursively improve itself it would need to (small step) improve the training data and/or (big step) come up with fundamentally new architectures superior to transformers. The small step improvements might be doable. But nobody is making any claims about the big step improvements.

mrandish • yesterday at 11:22 PM

Was anyone else fished in by the title and disappointed? After some broad introductory discussion of RSI, the article was almost about LLM coding. While there are some metrics for unattended agentic coding, it doesn't discuss "When AI builds itself" (beyond 'not now') or any progress specifically toward actual recursive self-improvement. I'm very interested in any empirical evidence of meaningful progress in RSI, so... this felt deceptively titled.

To me, unattended agentic coding is not RSI, in the same way a self-reloading "Unattended 3D printer" is not at all a "3D printer that recursively prints complete 3D printers in which each generation is significantly faster and more advanced than the last." The "unattended" part is obviously necessary but hardly sufficient. The article tacitly assumes LLM progress to be something like 1: Unattended agentic coding, 2: AGI, 3: RSI. I suspect that third step should be labeled "not to scale."

I'm increasingly convinced that actual Full Foom RSI (FF-RSI) is on a radically different scale than the first two. Just leaving it unaddressed is like assuming: Step 1: Manned space station, Step 2: Manned Mars base, Step 3: Manned Alpha Centauri base, are "just logical next steps." FF-RSI requires sustaining superlinear, recursively amplifying cognitive returns along a specific directed path - and we currently have no empirical evidence that such returns can exist for artificial OR biological intelligences. Large collectives of the smartest humans alive (Bell Labs, IAS, etc) haven't just failed to get anywhere close to reliably sustaining that, we can't even reliably predict non-recursive, single occurrences or even imagine any way all 8B humans could fully mobilize to predictably achieve non-recursive, single occurrences.

The only prior we have for open‑ended intelligence improvement is biological evolution which shows extremely slow and unreliable sublinear returns at best. And even if unbounded, recursive self‑improvement is physically possible, it may be practically unachievable due to asymptotic economic, resource and other barriers in the same way approaching light speed requires exponentially more energy. I think it's plausible, and maybe probable, that AIs achieve true super-human intelligence in a decade and yet still won't achieve FF-RSI for centuries, if ever. To me, absent compelling evidence to the contrary, that's the reasonable Null Hypothesis. Even if you feel that's too pessimistic, it seems reasonable to expect any serious discussion of "Progress Toward RSI" to first discuss why it might even be plausible that 1: Miles, 2: AU (Astronomical Units), and 3: Light Years belong on the same scale, instead of just assuming it like the meme's empty "Step 3. .... " before moving on to "Step 4. Profit!" (or "IPO!" but very, very responsibly).

willXare • today at 2:53 PM

[flagged]

alt Hacker News

When AI Builds Itself: Our progress toward recursive self-improvement

Comments

🔗 View 15 more comments