logoalt Hacker News

chapsyesterday at 5:15 PM1 replyview on HN

"obviously wrong" is a never ending rabbit hole and you'll never, ever be satisfied because there will always be something "obviously wrong" with the data.

Messy data is a signal. You're wrong to omit signal.


Replies

GMoromisatoyesterday at 5:22 PM

100%. There is even signal in the pattern of errors. If you remove some errors but not others, you lose signal.