Carl Allen Profile picture
Sports & Poli Data. OH Small Biz Owner. Frmr MLB/NFL data guy. Bilingüe. Profitable v Props. Created new poll metrics #BlackLivesMatter. Threads: RealCarlAllen

Sep 12, 2020, 37 tweets

1/x The polls weren't wrong, you just read them wrong.

The spread in a poll matters some. But a 50-47 lead (+3) is FAR MORE ROBUST than a 46-40 lead (+6)

People see a poll avg at 46-40, and when the result is 47-49, they say "pOlL wAS WrONg"

From +6 to -2?!? Off by 8!!!

Nope.

2/x

If the polls were "off" by 3 or 4 but the result of the election didn't swing because of it, no one would care, except pollsters. We wouldn't be talking about it.

Here's the most important question: what do polls measure?

3/x

This seems like a silly question, but think about it. What do polls measure?

Polls measure preference and plans.

What do polls not measure?

How/if undecided voters will vote.

It seems that people are criticizing polling for not measuring things it doesn't attempt to.

4/x

This goes back to high school. Remember studying validity & reliability?

When we talk about validity, we ask: "does this study actually measure what we're using it to measure?"

In the case of polls: they don't measure preferences or tendencies of undecided voters.

5/x

Polls are intended to measure HOW MANY voters plan to vote for each candidate, and how many are undecided

That's it

For aspiring political and data scientists:

If you figure out a way to calculate or measure how/if undecided voters will vote, I have a blank check for you.

6/x

The problem with the polling wasn't a problem with the polling at all. Simply how people read them.

A 45-40 lead (+5) isn't simply a +5.

More undecideds = more uncertainty = more variability.

2016 had a historic number of undecided voters.

cnbc.com/2016/06/20/25-…

7/x

Clinton's leads in these swing states (+1, +3, even +6) have her polling no higher than 46.8%.

And while I'm not a fan of lumping declared "third party" voters with "undecided" voters, it doesn't take a mathematician to see the gap could feasibly be made up.

8/x

The assumption seemed, that the undecideds would break somewhat evenly, not impact Clinton's lead. Other than the openly and augury @FiveThirtyEight "Trump still has a chance, undecided voters could swing the election"

9/x

2016 introduced me to a term I hadn't heard before: double haters.

People who didn't like EITHER candidate.

These folks were mostly moderate or conservative, older, and low-middle income (and cared about jobs!)

These double haters broke for Trump by a whopping 20 points.

10/x

Now, hindsight bias and all, people say "duh, of course conservative undecided voters would favor Trump"

But that's not the point!

Those undecided voters ended up VOTING for Trump.

Many undecided voters who favored Hillary didn't vote.

washingtonpost.com/news/politics/…

11/x

Major differences, looking at 2020.

Double haters tend to be younger and liberal (disclosure: it me) with a notable uptick in black double haters (disclosure: it not me).

2020 has *far fewer* undecided voters and there's a case that those undecideds ACTUALLY FAVOR BIDEN

12/x

Trump won double haters by a 20 point margin in 2016.

As of April, Biden was winning these "double haters" BY 50 POINTS.

The "I like neither, but I prefer the jobs jobs jobs businessmam outsider to the dynastic (lady) neoliberal" edge seems gone.

bloomberg.com/news/articles/…

13/x

So back to the polls. Stop looking at the MARGIN and actually look at the NUMBERS.

Not all margins are equal. A 7 point margin is great but is 44/37 the same as 50/43? Of course not!

Think of it this way: the undecideds gotta go somewhere. AND POLLS ONLY CAPTURE DECIDEDS.

14/x

When people - analysts included - talk about "polling error" they refer to margin. This is not accurate. A candidate going from +4 in polls to -2 in the election (or vice versa) DOES NOT INSTANTLY INDICATE polling error, and it's wildly erroneous to call it a 6 point one.

15/x

Back to before - what do polls measure?

Preferences of decided voters, and HOW MANY undecided voters.

What do polls NOT measure?

Preferences of undecided voters.

Using polls (with margins that include undecided voters) versus results (which do not) is INTERNALLY INVALID

16/x

What you've implied, by comparing margins in polls to margins in results, is that a poll is an attempt to try and measure not only how decided voters apportion themselves, not only how many undecided voters there are but also the whims of those undecided voters.

17/x

No one would contend that a candidate polling with a 38-34 (+4 margin) lead with 28% undecided is less safe than a candidate with a 48-44 (+4 margin) lead and only 8% undecided.

If you agree that all margins are not the same, you can't use margins to measure poll validity.

18/x

Now for some data.

Here are the 2016 pre-Election poll data, including third party and undecided

@DrewLinzer noted the historic number of undecided voters.

Think of it this way:

Undecideds have to go somewhere!

Not predicting where they go isn't a weakness of polling.

19/x

Here is the overlay of polling results to actual results, with how undecided voters aligned.

Note that third partys lost a small but not insignificant amount in the polls compared to the election.

Nowhere did polls overstate either of the major party candidates support.

20/x

I'll preface this by saying that there's 2 months until the election and things change.

HOWEVER

No candidate underperformed their poll number in 2016.

No candidate underperformed their poll average in 2016.

That yellow undecided bar can only make up for so much.

21/x

No state better illustrates what I mean about margins vs raw numbers like Wisconsin

@RealClearNews had Hillary's margin +6.5 there in 2016, and they have Biden's margin there now

The difference? In 2016 the poll was 46.8-40.3

In 2020 the poll is 49.8-43.3

Not the same!

FAR FEWER undecideds this year

Could make a case (as I briefly touched on earlier) that undecideds could actually break IN BIDEN'S FAVOR this year

But even if we assume undecideds again overwhelmingly break R:

OH, FL, NC close
Biden comfortably wins MI, PA, WI, AZ, MN for 270+

Fixed Trump-leaning forecast.

Earlier version had even distribution of undecideds.

This one distributes undecideds proportionally to 2016. In most swing states, that's 60%-90% Trump, ~ best case scenario for him.

In 2016, that was enough to swing election. Not so in 2020.

Why does this matter for 2020?

Because Hillary's leads in the polls were FAR MORE PRECARIOUS than Biden's, despite comparable margins.

People who look only at margins are missing the point. They're reading the polls wrong.

Consider: PA, WI, OH, NC, MN.

They look comparable!

They look comparable if you only look at margins. But this ignores BIGGEST VARIABLE OF 2016:

Undecided voters. Historically high!

Those undecideds have to go somewhere

When people say "polls were off" it implies that it overestimated support. Nope!

Undecideds ~ half in 2020.

For FL, PA, WI, OH, NC, and MN, Biden polling margin compared to Hillary's:

+1.4, +1.1, +0.6, +0.7, +0, +0 respectively

So to say he's only slightly outperforming Hillary here (including 4 of the most important, which Hillary lost in 16) feels like a valid analysis

It's not!

Despite their vote margins being comparable, their VOTE SHARE is not.

For those same states, FL, PA, WI, OH, NC, MN, Biden's vote share compared to Hillary's in the polls is:

+2.9, +3.5, +3.6, +3.9, +2.5, +5.1

This is enormous! Biden's "base" of support is MUCH higher.

For obvious reasons, the closer a candidate gets to 50%, the more insurmountable their lead becomes.

So, to hammer home the point, a +6 lead of 46-40 is very much NOT the same as a 49-43 one. Stop treating them as the same.

2016 had a lot of undecideds and most went for Trump.

And perhaps an even more important point, in no swing state (where polling is always most robust) did the polls OVERESTIMATE a major party candidate's vote share. None!

What does this mean for 2020?

Biden is polling closer to 50% (even at or above 50%) in many swing states!

Look at MI, PA, WI, AZ, NH, ME, MN.

Undecided voters matter a lot less when "decideds" alone get you to an insurmountable margin!

That is, Trump can't simply sway undecideds and overcome his deficit.

Biden is so close to 50% (i.e. insurmountable) that undecideds, of which there are far fewer, simply couldn't swing the election even w Trump-heavy swing

Biden's worst case scenario currently appears to be a 294-244 victory

His PA, WI, MI, MN, NH, ME, AZ vote share is too high

This is to say nothing of the fact that:

1) Florida actually looks slightly favorable for Biden, even with Trump's 2016 heavy undecided vote proportion, and

2) NC looks like it will be the closest state, with a dead tie if you give Trump his 2016 undecided proportion

That being said, I think there's strong evidence to suggest that undecided voters will FAVOR BIDEN in 2020.

In 2016, voters who didn't like either candidate (who would certainly qualify of undecideds) were overwhelmingly conservative, moderate, and older.

Total opposite in 2020

Would I have predicted Trump would have won *that high* of a proportion of undecided voters? No - but I certainly would have predicted that he'd win a lot MORE undecideds that Hillary.

So, here's my hot take for #2020Election: I think undecided voters will favor Biden.

And to be clear, I don't think undecided voters need to favor Biden for Biden to win. I think the number of undecideds is small enough this year that in order for Biden to lose, it's not enough for Trump to dominate undecideds: Biden would have to actually lose decideds. Tough.

So, in the case undecideds favor Biden to Trump - which I think there's a strong case for - we get something like this.

MI and WI flip like the Ohio of 2016 (from lean one candidate to 8 points other candidate).

Ohio and NC narrowly go to blue. IA, GA, TX too close to call.

This isn't a "prediction" as much as a very possible map.

Biden wins comfortably and IA, TX, GA have margins as small as MI, WI, PA did in 2016.

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling