Agree, and I'd add: the feedback loop between decision and consequence got dramatically shorter. You can test an architectural hypothesis in hours instead of weeks. That part is genuinely powerful.
But faster feedback also means bad decisions propagate faster. The skill isn't generating three implementations in parallel -- it's knowing which one won't page you at 3am. That judgment comes from having been paged at 3am enough times to recognize the patterns.
> The skill isn't generating three implementations in parallel -- it's knowing which one won't page you at 3am.
100% this.