> The approach was the same as Cloudflare’s vinext rewrite: port the official jsonata-js test ...

crazygringo • yesterday at 11:53 PM • 2 replies • view on HN

> The approach was the same as Cloudflare’s vinext rewrite: port the official jsonata-js test suite to Go, then implement the evaluator until every test passes.

This makes me wonder, for reimplementation projects like this that aren't lucky enough to have super-extensive test suites, how good are LLM's at taking existing code bases and writing tests for every single piece of logic, every code path? So that you can then do a "cleanish-room" reimplementation in a different language (or even same language) using these tests?

Obviously the easy part is getting the LLM's to write lots of tests, which is then trivial to iterate until they all pass on the original code. The hard parts are how to verify that the tests cover all possible code paths and edge cases, and how to reliably trigger certain internal code paths.

Replies

jng • today at 12:12 AM

I've found Claude Code with Opus 4.5+ to be excellent at generating test cases that exercise the different features, and even push into the edge cases. You sometimes need to nudge it into generating more convoluted cases when necessary, but then it is just nudging. I now routinely generate more LOCs of test cases than actual core code, while I used to only write very limited test cases just for the most complex areas amenable to automated testing.

I've been successful at using Claude Code this way:

1. get it to generate code for complex data structures in a separate library project

2. use the code inside a complex existing project (no LLM here)

3. then find a bug in the project, with some fuzzy clues as to causes

4. tell CC about the bug and ask it to generate intensive test cases in the direction of the fuzzy clues

5. get the test cases to reproduce the bug and then CC to fix it by itself

6. take the new code back to the full project and see the issue fixed

All this using C++. I've been a pretty intensive developer for ~35 years. I've done this kind of thing by hand a million times, not any more. We really live in the future now.

faangguyindia • today at 10:00 AM

Imagine how many tests were frauded and fake passed by claude on this project

alt Hacker News

Replies