My bet is that the last item is what we’ll end up leaning heavily on - feels like the path of least resistance
Throw in some simulated user interactions in a staging environment with a bunch of agents acting like customers a la StrongDM so you can catch the bugs earlier