Until the context window gets superceded with some groundbreaking new architecture, not ever.
Even if LLMs become incredibly, undeniably brilliant 1000000 IQ, they cannot keep track of what's going across long horizons. Imagine a supergenius, but in Memento.
No amount of MD scribbling or embeddings will remove that limitation, but it may obfuscate it further and make it seem like progress is being made.
At the end of the day, being fully autonomous means that something can keep track of context, goals, complex and shifting relationships, over LONG time horizons without drift. If you need to be there to prompt, it is not truly automating. Until the continuity becomes real instead of simulated, context no longer has to compact, and weights update on-demand, you will always need a prompt wrangler leading the effort. And prompt wrangling is cognitive labor.