Really? Could you share your techniques that get you there?
Inspired by https://github.com/scosman/cursed_browser, I have a little art project going where the CPU of a virtual machine is entirely LLM-powered. But even though the ISA is well known and clearly in the LLM's training data (it answers question about it mostly fine), I can rarely get it to even decode a handful of instructions in a row correctly. It'll e.g. do 10 instructions right (even execute right!), then just lose the ability to do bit manipulation all of a sudden and fail miserably at even decoding the 11th. If I try to help it along it'll apologize profusely, do it wrong in five novel ways, before it gaslights me saying I'm in fact mistaken.