False positives can be eliminated mechanistically by testing if they actually work, in a sufficiently isolated automated test apparatus.
The hard thing is reducing detected crashes to well-formulated test cases that help rather than hinder maintainers.