These models are getting crazy good at examining things like core dumps and disassembly. I've been using an agent to write compiler logic, and its amazing the kind results you can get by having the agent examine the raw binary outputs. I would not be surprised to see agents excel at identifying and labeling patterns for decompilation.
Really? Could you share your techniques that get you there?
Inspired by https://github.com/scosman/cursed_browser, I have a little art project going where the CPU of a virtual machine is entirely LLM-powered. But even though the ISA is well known and clearly in the LLM's training data (it answers question about it mostly fine), I can rarely get it to even decode a handful of instructions in a row correctly. It'll e.g. do 10 instructions right (even execute right!), then just lose the ability to do bit manipulation all of a sudden and fail miserably at even decoding the 11th. If I try to help it along it'll apologize profusely, do it wrong in five novel ways, before it gaslights me saying I'm in fact mistaken.