Limitations of High-Level Analyses Part 2 : Some out there are taking Dr Shiva's analysis and replicating it for Democratic votes to compare and see if the pattern is different. From a couple I've seen, there is a similar downward linear pattern that's less steep.
2. This has led some to conclude that this is in fact a natural pattern that has naturally occurred as the result of the local the electorate, and therefore, there's nothing untoward about the pattern. If it's happening for both Ds and Rs, it can't be a problem. Right? No!
3. When you see the same downward linear slope, it doesn't mean the local electorate is uniquely voting in those areas. It means that manipulation is occurring on both sides. The different in the steepness of the pattern allows to understand the severity of the manipulation.
4. Or said another way, the difference in steepness for Rs and Ds tells us the net / overall effect and direction of the manipulation. It doesn't mean that manipulation is not occurring. As I said in my last thread, these are the tabulated results of the ballots.
5. Well, how do we know for sure manipulation is occurring in both directions. Let's compare two different districts. Last time it was VA-1 and VA-2. Let's look at VA-5 and VA-6 this time. The shifting in VA-5 is slightly in favor of Trump. VA-6 is heavily in favor of Biden. Why?
6. I believe it's to create similar patterns in both directions that will undermine a high-level analysis that compares the two. The is effective when analysts do not have access to batch level detail. Because overall, we see a mixture of results from various tabulation machines.
7. It is not reasonable to assume that all machines would incorrectly tabulate in the exact same way across a state. Even within a district, it's unreasonable to assume that to be the case. Same for a precinct. In my local precinct, there were 3 on election day.
8. This is why high level analysis is only good at showing that manipulation occurs, but it's very limited in its ability to measure the impact. This is why Dr. Shiva says a minimum of X votes were shifted. He doesn't have the granular data (machine / batch) details to make a
9. a more accurate assessment / determination. He receives criticism for this, but it's the only alternative when the data is an amalgamation of manipulated and non-manipulated batches of ballots. His estimate is imperfect to be sure, but it's the only method widely applicable!
10. We need to make something clear about the high-level voting results. It's limited in determining the impact of the manipulation, and it is also limited in it's usefulness as a representation of how the electorate votes. The manipulation belies the critics of Dr. Shiva.
11. What is ultimately needed is public data for each tabulator and the results for each batch it produced. This will tell us where manipulation occurs, when batch after batch produces the same skewed pattern. We will know which machines are affected.
Limitations of High-level Analyses : After the Dr Shiva presentation, there has been a lot of effort dedicated to to understanding the pattern he showed in his graph. Many tweets speculating about the reason such a pattern might be true / reasonable.
2. Some have compared the graphs across elections (2016 and 2020) to understand how the trends compare and what the differences may mean in terms of the voters. While these analyses are interesting from an academic perspective, we should ask what is actually being compared?
3. Let's take a step back from voting results analyses, and consider what these results are and what they represent. Using a random county as an example, an analysis of this county's precincts will show us the outlines / broad pattern of the votes tabulated for the county.
Deep Dive Explanation of Approach / Analysis : I want to provide the context of the work I am doing, and the reasoning behind it. I started this with a question : Why did R's do so well down ballot, and not at the top? In order to answer this question, I needed a suitable dataset
2. The ideal dataset would allow us to see, for each batch of ballots, the landscape of the votes (e.g. which candidates received the votes, and by what proportions). And not just any batch would suffice to perform this analysis.
3. Batches of ballots needed to be large (e.g. many ballots), and must include votes for Prez, Senate, and House Rep for both Dems and Reps (3 races x 2 parties). The batch is a self contained example of voting behavior, that would be random within a district.
1. Change Log Update : Batch Analysis. As far as I know, there isn't a data source available of how batches of ballots voted. We see the batches in the Edison data feed, and can see their impact on the Dem / Rep %s. But that's for the President. What about down ballot? Let's see!
2. Seeing how Dem & Rep candidates performed down ballot compared to the Prez is a powerful insight that can tell us how more about what these batches contain. @va_shiva had a presentation about weighted race voting, and while I'm not familiar, the pattern I'm seeing might
3. be what his analysis showed in MI. Now these batches are in VA. I've focused on CD-1, CD-10, and CD-11. The other criteria was that the batch of votes had to be >1000 for both Biden and Trump. The big batches are more meaningful in terms of trends.
1. Change Log Update : I've spent some time going through the data to get a better understanding of the relationships between the rows, and how to identify batches of votes that were loaded into the system. I focused on the largest batches in CD-1, CD-10, and CD-11.
2. I was particularly focused on batches that had both Dem and Rep candidates for Prez, Senate, and House. The purpose was to compare the votes across the ballots. Expecting to see Prez with the largest numbers, and the Senate / House being lower numbers.
3. Well, this is true for the Dem candidates, but not for Rep candidates. I haven't manually checked all the batches, but this tends to hold true for smaller batches (less than 50K). Here is an example of what I mean for this unusual trend in the votes of these batches.
Found another file on VA website. This one is a "Change Control" log for votes in the system. I've only begun to explore this, but there's some interesting activity for VA-7 for Spanberger. On Oct 30th, a preload change was initiated that would expire at 11/4 @ 4:13AM
At 4:13AM, 66,498 votes are assigned to her with an expiration of 11/5 @ 11:26 AM. When this comes around, the total is adjusted to 63,687. Reason given is "Tabulation Error in Precinct". The last change has no expiration date. These changes affected Chesterfield County.
This lines up with the raw vote total for Spanberger in Chesterfield County. There's also a change records to assign Rashid (VA-1) in Stafford county. See both pics attached :
@WontMarch4Soros@bedivere_knight@ColdPotatoSpud After thinking about the data / analyses I've been doing on the raw data vs website and other reports, I believe I have an explanation for why Rashid has so many more votes than Wittman. This will be a long explanation :
1. This explanation may bounce around a bit, but touches on different aspects of the pics I've posted across different threads. Ask questions if things don't seem to follow, since they are related.
If we think about how elections work, different precincts will have diff ballots
2. The ballots are different because downstream the (house reps) are different in the various districts. There are only two races that will be on every ballot across every district precinct. Senate and President. This is important.