Discussion about this post

User's avatar
David Muncier's avatar

Thanks for the response Vote Integrity,

I'm a data scientist and absolutely agree that data cleaning is a big part fo the job and is not on it's own suspicious. But data cleaning and the decisions to apply filters to search for final are subjective. Remove some data noise, but not all of it, then calling likely remnants noise a signal of fraud, is the mark of a biased investigation. It's pretty clear that you were selective and subjective in your data cleaning and filtering to find the "pattern mismatches" you claim.

I'm going to continue to respectfully question your work and intentions, because I'm seeing far larger anomalies that you should have been noting if your goal was really to find problematic voting patterns. My hypothesis is that you "cleaned" many of the messy updates, choosing to keep and focus on a few smaller remnant "dirty" points in the data in a few contested states to identify your pattern. More as I peel back the data in my spare time...

Expand full comment
Vote Integrity's avatar

(Copied from your comment on https://votepatternanalysis.substack.com/p/voting-anomalies-2020)

Hi David,

Regarding the vote updates which have the totals dropping to zero, we removed those *before* computing successive vote differences. These are almost certainly either data errors themselves or reflect a bizarre way of issuing corrections. Data-cleaning is a very common and normal part of data analysis and shouldn't be regarded as suspicious. The appendix linked to in this report provides the entirety of the code and data used for this report and you can see exactly what we did if you look there.

On your second point, there are of course other ways to look at the data -- infinitely many of them, in fact. We chose to do it this way because we thought it was the best formalization of a "vote spike" which many online had noticed and which were the motivation for this investigation (and report). While measuring the significance of any particular vote update to the final outcome is interesting, it wasn't the goal here.

Best,

Vote Integrity

One more thing worth noting here is that there has been a great deal of suspicion, and understandably so, around vote updates which involve vote counts for any candidate dropping. Assessing the legitimacy of these is inherently difficult with a data set which is only state-level and was out of scope for our investigation, but we wish you the best in your work.

Expand full comment
3 more comments...

No posts