They are utterly trivial tools, and most of them are extremely over-engineered to the point of absurdity (that was the most embarassing about the Claude Code source map leak imho). That the for loop concatenating LLM output from POST calls to an internal buffer has more code than what is doing training or inference should tell them something. Hermes is the only one that tried to do something novel (low bar), and at least when I looked at it it hadn't succumbed to the vibes yet.