logoalt Hacker News

lionkoryesterday at 7:18 AM5 repliesview on HN

deepseek has no part of their privacy policy on their API about training. They are 100% training on every single word you give it.

If your customers are fine with that, your IP is not interesting, then you can use it.


Replies

radicalityyesterday at 7:43 AM

Though with open models you have a lot of choice where to get it from. I see like ~15 providers here with various logging/ZDR policies, so pick whatever mix of price to features you want:

https://openrouter.ai/deepseek/deepseek-v4-flash

nextaccounticyesterday at 10:18 AM

I don't believe a single word from AI companies, no matter where they are from. Sourcing their training data is run like genuine criminal enterprises - last year Anthropic settled for 1.5 billion, and and if they settled so quickly it might mean what we would see in court is even worse.

anon373839yesterday at 10:14 AM

You don’t have to access Deepseek through Deepseek. You can self-host it and your data never leaves your premises.

kzrdudeyesterday at 7:28 PM

You can use deepseek through opencode, which says its providers have a no-retention policy.

show 1 reply
wolttamyesterday at 1:56 PM

I self-host Flash actually, but yeah.

When I use their API I use it knowing that they probably train on the data, and knowing that it's probably used to improve future iterations of their models.

But I use their API extremely rarely lately, because local Flash is good enough for me the vast majority of the time

show 1 reply