logoalt Hacker News

thomasahletoday at 12:09 AM1 replyview on HN

> Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.

Not sure exactly what you're proposing, but if the noise is added independently to different people, you can just buy multiple copies to reduce it.

There are a lot of ways to do this wrong, which is why so much analysis has gone into differential privacy.


Replies

jmoletoday at 4:01 AM

Sorry, I think you're reading more into this than I intended to say. My point was that the raw data itself doesn't need noise, but the published data necessarily does.