deepseek has no part of their privacy policy on their API about training. They are 100% training on every single word you give it.
If your customers are fine with that, your IP is not interesting, then you can use it.
I don't believe a single word from AI companies, no matter where they are from. Sourcing their training data is run like genuine criminal enterprises - last year Anthropic settled for 1.5 billion, and and if they settled so quickly it might mean what we would see in court is even worse.
You don’t have to access Deepseek through Deepseek. You can self-host it and your data never leaves your premises.
You can use deepseek through opencode, which says its providers have a no-retention policy.
I self-host Flash actually, but yeah.
When I use their API I use it knowing that they probably train on the data, and knowing that it's probably used to improve future iterations of their models.
But I use their API extremely rarely lately, because local Flash is good enough for me the vast majority of the time
Though with open models you have a lot of choice where to get it from. I see like ~15 providers here with various logging/ZDR policies, so pick whatever mix of price to features you want:
https://openrouter.ai/deepseek/deepseek-v4-flash