ONNX is horrible for anything that has variable input shapes and that is why nobody uses it for LLMs. It fundamentally is poorly designed for anything that doesn't take a fixed size image.
ANE itself is also limited to fixed computation "shapes" so I'm not sure how much that would matter practically.
ANE itself is also limited to fixed computation "shapes" so I'm not sure how much that would matter practically.