I've been doing the same. take papers, define a high level goal, then let it iterate. I have access to DGX boxes and watching the model rewrite stuff to take NVLink into account after it discovered it was great :-)