Is there a reason why this has to be done at training time? Could the system prompt tell the model to convert the output to a different format?