Mike Hoerger, PhD MSCR MBA Profile picture
Aug 12, 2024 8 tweets 9 min read Read on X
BREAKING: Version 2.0 of the PMC COVID-19 Forecasting Model, August 12, 2024
🧵1/7

The U.S. now tops 1.3 million daily infections. 2.8% of the population (1 in 36) are actively infectious.



Deep Dive on Version 2.0 of the Model...

Welcome to version 2.0 of the PMC Model. The “C” in PMC is for Collaborative, and the work to improve this model is grounded in feedback from readers like you over the past year. Thank you for your support.

What’s New?

In short, the new model has substantial data quality improvements by combining multiple data sources for estimating transmission in unique ways that will hopefully increase forecasting accuracy, provide a truer representation of what has happened and is happening during the pandemic, and linkages to some statistics you will find helpful in day-to-day decision making.

Here is a deeper dive into the changes (skip to next section if desired). The new model is designed to provide a “true” picture of what has happened during the pandemic. It integrates three main data sources: the IHME true case estimation model, Biobot SARS-CoV-2 wastewater surveillance data, and the current CDC NWSS SARS-CoV-2 wastewater data. IHME provided a comprehensive case estimation model through April 1, 2023. Biobot was the CDC wastewater subcontractor through last fall and continues to do extensive non-CDC wastewater work. The CDC NWSS data are currently subcontracted with Verily, a subsidiary of Alphabet, which is the parent company of Google. Over the past year, we have seen Biobot scale back their public data and visualizations, and Verily has made steady improvements in their work with the CDC.

We previously relied solely on Biobot for forecasting and a Biobot-IHME data linkage for case estimation. It was a Biobot-heavy model. The current model is not tied strictly to any data set, but rather the PMC’s best estimate of the truth, a true-case model that uses multiple data sources in the spirit of IHME’s original work in this area. Essentially, we link all three data sources, which have been active over different points of the pandemic to derive a composite “PMC” indicator of true levels of transmission. The indicator is weighted based on which data sources were available and their perceived quality at each point in time. We scale this composite PMC indicator to the metric the CDC uses when helpful for comparisons with their website, and scale it with the true case estimates of the IHME otherwise, as true cases are more relevant than arbitrary wastewater metrics.

A great feature of the model is that it continues to integrate real-time data from Biobot and the CDC. From the perspective of Classical Test Theory, this is a huge advantage, as it provides a much more reliable indicator of what is currently happening with transmission. Both sources often make retroactive corrections for the most recent week’s data, sometimes sizable, and pitting the two indicators against one another reduces measurement error on average, which offers vital improvements in forecasting.

What are the Biggest Improvements in the Model?

· Accuracy in Real-Time Data – In integrating two active surveillance data sources, the real-time data will be more accurate. The biggest predictor of next week’s transmission levels, and the shape of how transmission is increasing or decreasing, accelerating or decelerating, is the current week’s real-time data. If the real-time data are off by 5% or 10%, the big-picture take on the forecast will still be reasonable, but a more precise estimate allows for greater accuracy in estimating the height and timing of waves.
· Regional Statistics – We are already integrating some regional data. Like you, we miss the vast and high-quality regional data and visualizations Biobot provided. We are hoping to take back some of those advantages through the new model and will improve them over time.
· Credibility – Although Biobot and CDC have unique strengths and limitations, a clear strength of adding the current CDC data set is that many people prefer to defer to the credibility of the CDC. The PMC model can be characterized fairly as a “CDC-derived case estimation and forecasting model,” which should lend more credence with those who are not deep enough in the weeds to evaluate the data as critically and prefer appeals to authority. We also provide some statistics that will allow you to draw more useful inferences from the CDC website.

What’s the Same in the Current Model?

The analytic assumptions underlying the forecasting model remain the same. It uses regression-based techniques common across all industries, using a combination of historic data (median levels of transmission for each day of the year) and emerging data from the past four weeks to characterize how transmission is growing or shrinking. Holidays and routine patterns of behavior that map on well to a calendar are “baked in” to the historic data. “New variants” and atypical patterns of behavior are baked into the data on recent patterns of transmission. It’s a top-down big picture model.

What are the Biggest Drawbacks of the New Model?

· Disruptions in Longitudinal Comparisons – You will notice some inconsistencies between the current and prior model that use additional data to form more accurate estimates, which is sometimes frustrating. A few examples. In the early pandemic, we estimated cases linking Biobot to IHME case estimates. Biobot transmission estimates were a bit “hotter” than others during that time period, the IHME estimates “cooler.” Our composite model depicts each of the first 4 waves somewhat smaller, which we believe provides a better picture of the “truth” as we can estimate it, but it is annoying psychologically to re-envision what has happened. This also throws off some of the big-picture statistics; for example, as of August 12, 2024, we estimate that Americans have had about 3.3 infections on average. A few months ago, we estimated nearly 3.5, so this is consistent with “cooler” picture of early-pandemic transmission. Presently, the CDC transmission estimates are running much hotter than those of Biobot, leading to estimates of a larger and earlier peak in the present wave. We would have preferred the CDC re-up with Biobot at the potential contract renewal to promote continuity in the data, but these sorts of changes in model estimation are the expected consequences of such a transition.

· Constantly-updating Historical Data – The CDC updates all of their historical estimates of transmission frequently, any time a new site comes on board, and twice annually to standardize the data longitudinally. This can sometimes create weird issues, where transmission is going up, but real-time values are lower than what was reported in real time the prior week because recent data were corrected downward. It will also throw off some of the helpful statistics we provide. These are minor nuisances, but be aware of them in case you spot something that seems strange.
· Documentation of Accuracy – We have excellent data on the accuracy of the prior model and will submit a report for publication shortly. All prior reports are publicly available. Many report quick facts on longitudinal accuracy, international comparisons, use in news articles, and references to use in peer-reviewed scientific journal articles. We cannot document the real-time accuracy of the new model yet, but know that when using historical data, the model accounts for 98% of the variability in wastewater transmission 1-week into the future, which is 2% higher than our prior model. The vast majority of forecasting errors have been and will continue to be based on inaccuracies in the real-time data wastewater surveillance companies report, and the model changes reduce those issues. We hope you will trust our history and that the methodologic changes represent improvements.

What Improvements Should We Expect in the Future?

There are many improvements we hope to roll out in the future. These include changes based on your feedback, the addition of confidence intervals in some of the graphs, and regional forecasting models. We may incorporate additional data sets if they can improve real-time estimates of current transmission.pmc19.com/data/Figure showing transmission throughout the entire pandemic, shows we are entering a 9th wave and provides CDC levels as well as the PMC estimate of daily infections, now at 3.3 million.
🧵2/7

Our graph of year-over-year transmission shows we have likely never had such high COVID transmission in mid-August.

Many classrooms will have a >50% chance someone is infectious. Expect K-12 schools and universities to be hotbeds for COVID outbreaks unless they are using serious multilayered mitigation.

🔹Indoor air quality that meets ASHRAE Standard 241 (if they have never heard of this or cannot explain how they are meeting the standard, they likely are not meeting the standard).
🔹Surveillance testing.
🔹Free on-demand testing.
🔹Universal masking.

This is uncharted territory in terms of such low mitigation coupled with high transmission with school starting. The possibility of a slightly larger wave than what we forecast remains.Figure showing year-over-year Covid transmission. We are presently just below the largest peak for a summer all-time, and we have never seen transmission this high in mid-August.
🧵3/7

Let's zoom in on the current wave. We're at our highest level of transmission since the winter surge, with 1.3 million daily infections.

Note, our model now combines Biobot and CDC data. Biobot still has the peak coming in early Sept, and so did the CDC until a huge spike this week.

By including two data sources, it helps counterbalance against errors in their real-time reporting, but we could still see some volatility in the size and date of the peak at this point.

Of course, different locations peak at different times.

You'll note that Aug 12 appears in the "forecasted" zone. That's because even wastewater data experience lags in reporting.Figure shows the most recent year of transmission. We're presently at 1.3 million daily infections, and likely closing in on the peak of this summer's wave.   Graph says "past 12 months," but it really shows the past 11 and includes a 1-month forecast. I'll fix that for next week.
🧵4/7
Here are some precise statistics on the current state of the pandemic in the U.S.

We are experiencing higher transmission than during 91% of the pandemic. 1 in 36 infectious. >1.3 million daily infections, nearly 10 million weekly infections, >400,000 resulting weekly Long COVID cases.

In a classroom of 25-30 students, there's over a 50% chance someone would be infectious.Current Levels for Aug 12, 2024 % of the Population Infectious 2.8% (1 in 36) New Daily Infections 1,336,000  New Weekly Infections 9,352,000  Resulting Weekly Long COVID Cases 468,000 to 1,870,000  Monthly Forecast Average % of the Population Infectious 2.7% (1 in 37) Average New Daily Infections 1,295,900 New Infections During the Next Month 38,877,000 Resulting Monthly Long COVID Cases 1,944,000 to 7,775,000  Running Totals Infections Nationwide in 2024 164,431,000 Average Number of Infections Per Person All-Time, U.S. 3.27  How Does Risk Increase with More Social Contacts? Number of Peo...
🧵5/7 CDC Covid-19 heat map  Highest transmission: West coast, Utah, Wyoming, Minnesota, Oklahoma, Texas, Louisiana, Arkansas, Florida, Carolinas
🧵6/7
Out West, about 1 in 24 people are infectious with COVID. The South is close behind.

On pgs 11-12 of the report, I walk you through an example of how to make rough estimates. In Louisiana, about 1 in 26 (or 3.9%) are infectious today.

pmc19.com/data/PMC_COVID…
A figure shows the CDC graph of regional variation in transmission.  A supplemental table states: Estimated Percentage Actively Infectious 	National	2.8% (1 in 36) 	Northeast	1.6% (1 in 63) 	Midwest	2.2% (1 in 45) 	South	3.7% (1 in 27) 	West	4.2% (1 in 24)  Some supplemental statistics are provided, best summarized in the linked report.
🧵7/7
Here's the full PMC COVID-19 Dashboard for Aug 12, 2024

Please share and adapt any of the images for use across other platforms and websites.
#MaskUp

pmc19.com/data/
The dashboard has tables and figures spread out across six sections. Each section is summarized in the Alt text in the prior 6 tweets.   Apologies for typos. I see in one place, I noted in Alt text 3.3 million daily infections, and it should be 1.3, as noted repeatedly elsewhere. Try to do better next time!
Thanks to those who have chatted about data and modeling recently - @MoriartyLab @jlerollblues @AnciraBecky @amethystarlight

Thanks @luckytran for your aesthetic Covid graphics during the winter surge. They inspired me trying to make the main graph nicer.

Thanks all!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Mike Hoerger, PhD MSCR MBA

Mike Hoerger, PhD MSCR MBA Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @michael_hoerger

Sep 16
PMC COVlD Report, Sep 15, 2025 (U.S.)
🧵1/7

COVlD-19 levels are "Very High" or "High" in the majority of states, per the CDC.

This includes 27 states & D.C.

🔥🔥Very High:
Alaska, Hawai'i, California, Nevada, Idaho, Utah, S. Dakota, Nebraska, Texas, Louisiana, Indiana, Kentucky, Tennessee, Florida, S. Carolina, N. Carolina, D.C., Maryland, and Connecticut.

🔥High:
Washington state, Oregon, Montana, probably N. Dakota (imputed), Arkansas, Alabama, Virginia, Delaware, Rhode Island, and Massachusetts.

PMC estimates 1 in 38 people (2.7%) are actively infectious. Wastewater-derived case estimates suggest 1.3 million new daily infections.Heat map using CDC levels. Key findings summarized in the post.
PMC COVlD Report, Sep 15, 2025 (U.S.)
🧵2/7

Transmission is peaking nationally, but regional variation is common. Know what's happening in your state, and get the word out.

Note that the levels CDC calls "low" are still quite alarming.State | CDC Level | PMC Estimate, % Actively Infectious Alabama	High	1 in 30 (3.4%) Alaska	Very High	1 in 22 (4.6%) Arizona	Moderate	1 in 40 (2.5%) Arkansas	High	1 in 28 (3.6%) California	Very High	1 in 21 (4.8%) Colorado	Moderate	1 in 50 (2.0%) Connecticut	Very High	1 in 19 (5.3%) Delaware	High	1 in 33 (3.0%) District of Columbia	Very High	1 in 17 (6.0%) Florida	Very High	1 in 24 (4.2%) Georgia	Moderate	1 in 46 (2.2%) Guam	Low	1 in 64 (1.6%) Hawaii	Very High	1 in 26 (3.8%) Idaho	Very High	1 in 14 (7.3%) Illinois	Moderate	1 in 44 (2.3%) Indiana	Very High	1 in 16 (6.1%) Iowa	Moderate	1 in 40...
PMC COVlD Report, Sep 15, 2025 (U.S.)
🧵3/7

Note that transmission is increasingly spreading from the South & West toward other areas. Know your state-level risk.

Transmission remains alarming even in areas CDC labels "Very Low" (e.g., Missouri, estimated 1 in 109). State | CDC Level | PMC Estimate, % Actively Infectious Missouri	Very Low	1 in 109 (0.9%) Montana	High	1 in 36 (2.8%) Nebraska	Very High	1 in 18 (5.5%) Nevada	Very High	1 in 17 (5.9%) New Hampshire	Low	1 in 64 (1.6%) New Jersey	Moderate	1 in 56 (1.8%) New Mexico	Very Low	1 in 106 (0.9%) New York	Low	1 in 73 (1.4%) North Carolina	Very High	1 in 17 (5.8%) North Dakota	High*	1 in 32 (3.2%) Ohio	Moderate	1 in 47 (2.1%) Oklahoma	Moderate*	1 in 44 (2.3%) Oregon	High	1 in 31 (3.2%) Pennsylvania	Moderate	1 in 44 (2.3%) Rhode Island	High	1 in 30 (3.3%) South Carolina	Very High	1 in 15 (6.6%) South D...
Read 7 tweets
Sep 13
California COVlD Surge Rages Higher

🔸CDC SARS-CoV-2 wastewater levels "Very High," and up from last week
🔸1 in 21 estimated actively infectious
🔸>250,000 estimated new daily infections statewide

Four figures...
1/4🧵 CDC: Very High: PMC Estimate: 1 in 21 actively infectious
CDC wastewater data in California show COVlD cases increasing from the already "Very High" levels last week.

2/4🧵 Line graph of the past 6 months showing the surge in wastewater viral levels
With an estimated 1 in 21 (or 4.8%) of California residents actively infectious of COVlD, risk increases dramatically in larger and more frequent social gatherings.

Interact with 25 people of average risk of being positive, and that's a >70% chance of exposure.
3/4🧵 How Does Risk Increase with More Social Contacts? Number of People | Chances Anyone is Infectious 1	4.8% 2	9.4% 3	13.7% 4	17.9% 5	21.8% 6	25.6% 7	29.1% 8	32.5% 9	35.8% 10	38.9% 15	52.2% 20	62.6% 25	70.8% 30	77.1% 35	82.1% 40	86.0% 50	91.5% 75	97.5% 100	99.3% 300	99.9%
Read 4 tweets
Sep 10
Let's say you're a dairy farmer. You have 100 cows. Each year, about 5 cows die, and another 5 cows are born. Then, along comes a virus. Let's call it "cowvid"...
1/
Let's say "cowvid" wipes out about half the cows over the course of a couple years. Now, you're down to 50 cows....
2/
The local mayor declares "cowvid" to be over. This surprises you as a farmer because 5 of your cows keep dying annually. 5 of 50 instead of 5 out of 100. Seems like more, but you're not a city slicker...
3/
Read 12 tweets
Sep 7
#DuringCOVID is today.

Image pack 1 of 9 🧵 Graph of the 11 waves of the pandemic in the U.S., tailored to a key message noted in the post.
1 million New Daily Infections.

Today!

Image pack 2 of 9 🧵 Graph of the 11 waves of the pandemic in the U.S., tailored to a key message noted in the post.
Where are the free vaccines, N95s, and tests?

Image pack 3 of 9 🧵 Graph of the 11 waves of the pandemic in the U.S., tailored to a key message noted in the post.
Read 9 tweets
Sep 3
PMC Dashboard Update (U.S.) 🧵1 of 8

The 11th wave is still rising.
🔥23 states/territories High/Very High
🔥Very High: Alabama, DC, Guam, Hawai'i, Louisiana, Nebraska, Nevada, South Carolina, Texas, Utah
🔥1 in 56 estimated actively infectious
🔥876,000 new daily infections CDC heat map, very high states noted in post. PMC estimate of 1 in 56 actively infectious nationwide
PMC Dashboard Update (U.S.) 🧵2 of 8

Note that the CDC has modified 📉 how transmission levels correspond to the categorical bins.

Take California. We estimate 1 in 30 actively infectious statewide. This would have previously been "Very High," now just "High."
#NewNormal CDC heatmap, with PMC estimate of 1 in 30
PMC Dashboard Update (U.S.) 🧵3 of 8

Here are the prevalence estimates for the first half of states/territories.

Notice how high the levels are in some of the "Moderate" states. State	CDC Level Alabama	Very High Alaska	High Arizona	High Arkansas	Moderate California	High Colorado	Moderate Connecticut	High Delaware	High District of Columbia	Very High Florida	High Georgia	Moderate Guam	Very High Hawaii	Very High Idaho	High Illinois	Very Low Indiana	Moderate* Iowa	Low Kansas	Low Kentucky	High Louisiana	Very High Maine	Low Maryland	Moderate Massachusetts	Moderate Michigan	Very Low Minnesota	Moderate Mississippi	High*
Read 9 tweets
Aug 21
During times like these when COVlD transmission heats up in the U.S., expect to see a lot more angry outbursts for three central reasons.

First, "displacement," or people trying to deny the reality of their anxiety by taking it out on other people....
Second, a lot of people can sustain a strong denial of reality about the ongoing pandemic during lulls. They suppress the existence of COVlD waves and excess deaths, disability, and retirements.

During waves, those defenses burst. Loss of control = anger...
Third, a lot of people (many reading this) understand COVlD correctly & experience righteous indignation during COVlD waves. We quite reasonably do not like all of the unjust and gratuitous suffering.

I find it helpful to channel that intensity into helping other people....
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(