Hello! I’m one of the three engineers who write this piece. Happy to answer questions.
Fantastic job!
Can you share what type of project that was? On the spectrum from a database engine to cat picture sharing web site (very high demand for correctness vs very lax).
Very cool article!
- are other teams adopting this approach? What’s the blockers if not?
- have there been problems where the models alone were not enough to debug and the devs had to fix it themselves?
- as the rate of changes has increased with more devs how have you dealt with concurrent writers with merge conflicts?
- if there was anything you could change in the approach you started with, what would it be?
Have you been satisfied with the quality of code generated by the model? Or did you have to tweak some rule file or skill to improve it? Or is human-readable code not even a goal at this point?
Were those em dashes you, or GPT
Interesting write up!
Have you been able to extract libraries or tools from this project yet? If so how was that experience?
That is, do you see yourself releasing a metric harness, or sub-projects that are equivalent of ActiveRecord, zod, or similar open source tooling that frequently originate in a large in-house project - and then is exported out as a stand-alone toll, utility, library or framework?
Because while ai can reimplement minor tools, it's utility entirely depends on the existence of solid tools, libraries and frameworks.