the debate over whose data this is, misses a practical point for builders. If one run services that handles document, the only way to make AI training go out of context is to design architecture in such a way which make data impossible for to AI access the data. If a server can read even a single byte then privacy is just a myth.
Even i have been exploring client side only processing document workflow. WASM in browser with Zero server contact and then it changes conversation from trust our terms ot literally no one can access it