Does this actually relate to the code quality being observed by the agent? The readme isn't very clear on that IMO. I have some projects I'd love to try this out on, but only if I am to get an accurate representation of the LLMs suffering.
I'm very open to suggestions, but currently it's a very simple scan of the code. Check the python scripts.
You could have the actual output of the agent turned into TTS using the model of your choice with TalkiTo… or listen to whatever weird sounds this makes. Seems like this is copying that viral Mac moan app. 2026 is weird.
https://github.com/AndrewVos/endless-toil/blob/main/plugins/...
So it is left up to agent to decide.