logoalt Hacker News

jbonatakisyesterday at 12:32 AM0 repliesview on HN

The past few weeks I've been building Blackbird

https://github.com/jbonatakis/blackbird

At a high level it's my take on how the execution aspect of spec-driven development should be handled. Where as most tools that are popular right now break a spec down into a task list and instruct your agent to work through it in a single session, I am treating agents as stateless. By this I mean a separate (headless) session is started with selected context for each task. This avoids context exhaustion, compaction (and the resulting confusion that can occur), and means that Blackbird can work through effectively an arbitrarily large task list.

Right now it's BYO-spec, but then it:

* breaks the spec down into a dependent-aware plan (DAG) composed of parent and child tasks

* executes tasks one at a time based on their status (ready to execute if all dependencies are marked as completed)

* allows you to (optionally) pause execution after each task to review, approve and continue, approve and quit, or reject the changes altogether

* (soon) treats parent tasks as an automated reviewer for all child tasks and optionally auto-resume those sessions to address the feedback

* and more

It's entirely bootstrapped, and so far I'm quite pleased with it. I also wrote a post[1] today about some of the concepts I had in mind as I was defining the architecture.

[1] https://jack.bonatak.is/blah/killer-context/