What if a tool produced an error and a retry? Is retry loop now a part of the log?
Flaky tools and unreliable systems aren't that different. "What if my database crashes while recovering? What if it crashes while recovering from a crash-while-recovering?" That's when you pull out database ARIES, UNDO / REDO.
Flaky tools and unreliable systems aren't that different. "What if my database crashes while recovering? What if it crashes while recovering from a crash-while-recovering?" That's when you pull out database ARIES, UNDO / REDO.