I would be interested to see how exactly the agent helped. How was it used, where did it lead to the given improvement and in how far would it have taken a human to come to the same solution.
The blog post has many links to papers and preprints discussing this exact question.
seems like `karpathy/autoresearch` on steroids
The blog post has many links to papers and preprints discussing this exact question.