Mike Hoerger, PhD MSCR MBA Profile picture
Aug 12, 2024 8 tweets 9 min read Read on X
BREAKING: Version 2.0 of the PMC COVID-19 Forecasting Model, August 12, 2024
🧵1/7

The U.S. now tops 1.3 million daily infections. 2.8% of the population (1 in 36) are actively infectious.



Deep Dive on Version 2.0 of the Model...

Welcome to version 2.0 of the PMC Model. The “C” in PMC is for Collaborative, and the work to improve this model is grounded in feedback from readers like you over the past year. Thank you for your support.

What’s New?

In short, the new model has substantial data quality improvements by combining multiple data sources for estimating transmission in unique ways that will hopefully increase forecasting accuracy, provide a truer representation of what has happened and is happening during the pandemic, and linkages to some statistics you will find helpful in day-to-day decision making.

Here is a deeper dive into the changes (skip to next section if desired). The new model is designed to provide a “true” picture of what has happened during the pandemic. It integrates three main data sources: the IHME true case estimation model, Biobot SARS-CoV-2 wastewater surveillance data, and the current CDC NWSS SARS-CoV-2 wastewater data. IHME provided a comprehensive case estimation model through April 1, 2023. Biobot was the CDC wastewater subcontractor through last fall and continues to do extensive non-CDC wastewater work. The CDC NWSS data are currently subcontracted with Verily, a subsidiary of Alphabet, which is the parent company of Google. Over the past year, we have seen Biobot scale back their public data and visualizations, and Verily has made steady improvements in their work with the CDC.

We previously relied solely on Biobot for forecasting and a Biobot-IHME data linkage for case estimation. It was a Biobot-heavy model. The current model is not tied strictly to any data set, but rather the PMC’s best estimate of the truth, a true-case model that uses multiple data sources in the spirit of IHME’s original work in this area. Essentially, we link all three data sources, which have been active over different points of the pandemic to derive a composite “PMC” indicator of true levels of transmission. The indicator is weighted based on which data sources were available and their perceived quality at each point in time. We scale this composite PMC indicator to the metric the CDC uses when helpful for comparisons with their website, and scale it with the true case estimates of the IHME otherwise, as true cases are more relevant than arbitrary wastewater metrics.

A great feature of the model is that it continues to integrate real-time data from Biobot and the CDC. From the perspective of Classical Test Theory, this is a huge advantage, as it provides a much more reliable indicator of what is currently happening with transmission. Both sources often make retroactive corrections for the most recent week’s data, sometimes sizable, and pitting the two indicators against one another reduces measurement error on average, which offers vital improvements in forecasting.

What are the Biggest Improvements in the Model?

· Accuracy in Real-Time Data – In integrating two active surveillance data sources, the real-time data will be more accurate. The biggest predictor of next week’s transmission levels, and the shape of how transmission is increasing or decreasing, accelerating or decelerating, is the current week’s real-time data. If the real-time data are off by 5% or 10%, the big-picture take on the forecast will still be reasonable, but a more precise estimate allows for greater accuracy in estimating the height and timing of waves.
· Regional Statistics – We are already integrating some regional data. Like you, we miss the vast and high-quality regional data and visualizations Biobot provided. We are hoping to take back some of those advantages through the new model and will improve them over time.
· Credibility – Although Biobot and CDC have unique strengths and limitations, a clear strength of adding the current CDC data set is that many people prefer to defer to the credibility of the CDC. The PMC model can be characterized fairly as a “CDC-derived case estimation and forecasting model,” which should lend more credence with those who are not deep enough in the weeds to evaluate the data as critically and prefer appeals to authority. We also provide some statistics that will allow you to draw more useful inferences from the CDC website.

What’s the Same in the Current Model?

The analytic assumptions underlying the forecasting model remain the same. It uses regression-based techniques common across all industries, using a combination of historic data (median levels of transmission for each day of the year) and emerging data from the past four weeks to characterize how transmission is growing or shrinking. Holidays and routine patterns of behavior that map on well to a calendar are “baked in” to the historic data. “New variants” and atypical patterns of behavior are baked into the data on recent patterns of transmission. It’s a top-down big picture model.

What are the Biggest Drawbacks of the New Model?

· Disruptions in Longitudinal Comparisons – You will notice some inconsistencies between the current and prior model that use additional data to form more accurate estimates, which is sometimes frustrating. A few examples. In the early pandemic, we estimated cases linking Biobot to IHME case estimates. Biobot transmission estimates were a bit “hotter” than others during that time period, the IHME estimates “cooler.” Our composite model depicts each of the first 4 waves somewhat smaller, which we believe provides a better picture of the “truth” as we can estimate it, but it is annoying psychologically to re-envision what has happened. This also throws off some of the big-picture statistics; for example, as of August 12, 2024, we estimate that Americans have had about 3.3 infections on average. A few months ago, we estimated nearly 3.5, so this is consistent with “cooler” picture of early-pandemic transmission. Presently, the CDC transmission estimates are running much hotter than those of Biobot, leading to estimates of a larger and earlier peak in the present wave. We would have preferred the CDC re-up with Biobot at the potential contract renewal to promote continuity in the data, but these sorts of changes in model estimation are the expected consequences of such a transition.

· Constantly-updating Historical Data – The CDC updates all of their historical estimates of transmission frequently, any time a new site comes on board, and twice annually to standardize the data longitudinally. This can sometimes create weird issues, where transmission is going up, but real-time values are lower than what was reported in real time the prior week because recent data were corrected downward. It will also throw off some of the helpful statistics we provide. These are minor nuisances, but be aware of them in case you spot something that seems strange.
· Documentation of Accuracy – We have excellent data on the accuracy of the prior model and will submit a report for publication shortly. All prior reports are publicly available. Many report quick facts on longitudinal accuracy, international comparisons, use in news articles, and references to use in peer-reviewed scientific journal articles. We cannot document the real-time accuracy of the new model yet, but know that when using historical data, the model accounts for 98% of the variability in wastewater transmission 1-week into the future, which is 2% higher than our prior model. The vast majority of forecasting errors have been and will continue to be based on inaccuracies in the real-time data wastewater surveillance companies report, and the model changes reduce those issues. We hope you will trust our history and that the methodologic changes represent improvements.

What Improvements Should We Expect in the Future?

There are many improvements we hope to roll out in the future. These include changes based on your feedback, the addition of confidence intervals in some of the graphs, and regional forecasting models. We may incorporate additional data sets if they can improve real-time estimates of current transmission.pmc19.com/data/Figure showing transmission throughout the entire pandemic, shows we are entering a 9th wave and provides CDC levels as well as the PMC estimate of daily infections, now at 3.3 million.
🧵2/7

Our graph of year-over-year transmission shows we have likely never had such high COVID transmission in mid-August.

Many classrooms will have a >50% chance someone is infectious. Expect K-12 schools and universities to be hotbeds for COVID outbreaks unless they are using serious multilayered mitigation.

🔹Indoor air quality that meets ASHRAE Standard 241 (if they have never heard of this or cannot explain how they are meeting the standard, they likely are not meeting the standard).
🔹Surveillance testing.
🔹Free on-demand testing.
🔹Universal masking.

This is uncharted territory in terms of such low mitigation coupled with high transmission with school starting. The possibility of a slightly larger wave than what we forecast remains.Figure showing year-over-year Covid transmission. We are presently just below the largest peak for a summer all-time, and we have never seen transmission this high in mid-August.
🧵3/7

Let's zoom in on the current wave. We're at our highest level of transmission since the winter surge, with 1.3 million daily infections.

Note, our model now combines Biobot and CDC data. Biobot still has the peak coming in early Sept, and so did the CDC until a huge spike this week.

By including two data sources, it helps counterbalance against errors in their real-time reporting, but we could still see some volatility in the size and date of the peak at this point.

Of course, different locations peak at different times.

You'll note that Aug 12 appears in the "forecasted" zone. That's because even wastewater data experience lags in reporting.Figure shows the most recent year of transmission. We're presently at 1.3 million daily infections, and likely closing in on the peak of this summer's wave.   Graph says "past 12 months," but it really shows the past 11 and includes a 1-month forecast. I'll fix that for next week.
🧵4/7
Here are some precise statistics on the current state of the pandemic in the U.S.

We are experiencing higher transmission than during 91% of the pandemic. 1 in 36 infectious. >1.3 million daily infections, nearly 10 million weekly infections, >400,000 resulting weekly Long COVID cases.

In a classroom of 25-30 students, there's over a 50% chance someone would be infectious.Current Levels for Aug 12, 2024 % of the Population Infectious 2.8% (1 in 36) New Daily Infections 1,336,000  New Weekly Infections 9,352,000  Resulting Weekly Long COVID Cases 468,000 to 1,870,000  Monthly Forecast Average % of the Population Infectious 2.7% (1 in 37) Average New Daily Infections 1,295,900 New Infections During the Next Month 38,877,000 Resulting Monthly Long COVID Cases 1,944,000 to 7,775,000  Running Totals Infections Nationwide in 2024 164,431,000 Average Number of Infections Per Person All-Time, U.S. 3.27  How Does Risk Increase with More Social Contacts? Number of Peo...
🧵5/7 CDC Covid-19 heat map  Highest transmission: West coast, Utah, Wyoming, Minnesota, Oklahoma, Texas, Louisiana, Arkansas, Florida, Carolinas
🧵6/7
Out West, about 1 in 24 people are infectious with COVID. The South is close behind.

On pgs 11-12 of the report, I walk you through an example of how to make rough estimates. In Louisiana, about 1 in 26 (or 3.9%) are infectious today.

pmc19.com/data/PMC_COVID…
A figure shows the CDC graph of regional variation in transmission.  A supplemental table states: Estimated Percentage Actively Infectious 	National	2.8% (1 in 36) 	Northeast	1.6% (1 in 63) 	Midwest	2.2% (1 in 45) 	South	3.7% (1 in 27) 	West	4.2% (1 in 24)  Some supplemental statistics are provided, best summarized in the linked report.
🧵7/7
Here's the full PMC COVID-19 Dashboard for Aug 12, 2024

Please share and adapt any of the images for use across other platforms and websites.
#MaskUp

pmc19.com/data/
The dashboard has tables and figures spread out across six sections. Each section is summarized in the Alt text in the prior 6 tweets.   Apologies for typos. I see in one place, I noted in Alt text 3.3 million daily infections, and it should be 1.3, as noted repeatedly elsewhere. Try to do better next time!
Thanks to those who have chatted about data and modeling recently - @MoriartyLab @jlerollblues @AnciraBecky @amethystarlight

Thanks @luckytran for your aesthetic Covid graphics during the winter surge. They inspired me trying to make the main graph nicer.

Thanks all!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Mike Hoerger, PhD MSCR MBA

Mike Hoerger, PhD MSCR MBA Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @michael_hoerger

Jan 25
During this 12th COVlD wave, the CDC reports 1-in-3 states have "High" or "Very High" levels.

PMC estimates the proportion of residents actively infectious (prevalence):
◾️USA: 1 in 67
◾️IA: 1 in 27
◾️MI: 1 in 25
◾️IN & CT: 1 in 23
◾️ME: 1 in 21
◾️OK & SD: 1 in 17

🧵1/ Heat map using CDC data. National PMC prevalence estimate noted; estimated incidence of 732,000 new daily infections.
On average, Americans have have 5.0 cumulative SARS-CoV-2 infections.

This week's infections are expected to result in 1/4 to 1 million new #LongCOVID conditions and ≈2,000 excess deaths.
🧵2/ Column 1: Table of state-level prevalence estimates. Highest estimates noted in the thread text.  Column 2:  Proportion Actively Infectious										1 in 67 (1.5%) New Daily Infections										 732,000  Infections the Past Week										 5,220,000  Infections in 2026										 24,000,000  Cumulative Infections per Person										 5.04  										 Long COVID										 Long COVID Cases Resulting								37,000 to 146,000		   from New Daily Infections										 Long COVID Cases Resulting								261,000 to 1,040,000		   from New Weekly Infections										 										 Excess Deaths										 Ex...
The wave peak is now estimated >10% higher than last week at 1.2 million new daily infections, nearly double the Delta wave.

We expect sustained high transmission (≈600,000 to 750,000 new daily infections) the next few weeks as COVlD circulates through schools/families.
🧵3/ Fig 1: Graph of 12 waves  Fig 2: "Barometer" showing above average transmission  Fig 3: Year-over-year graph, which informs the analytic forecast  Fig 4: Forecast described in post
Read 4 tweets
Jan 17
Based on today's CDC & Biobot data, we estimate the following for the week of Jan 19:

🔸1 in 52 people in the U.S. actively infectious
🔸25% chance of exposure in a room of 15 ppl
🔸Nearly 1 million new daily infections
🔸5 cumulative infections per person all-time (avg)
🧵1/5 Heat map from CDC data with PMC estimates. Description of "Very High" states in next post
Transmission estimates have been marginally corrected upward.

11 states have Very High COVlD levels:

🔸PA: 1 in 25 estimated actively infectious
🔸MI: 1 in 23
🔸OH & KY: 1 in 22
🔸SD: 1 in 20
🔸NE & IA: 1 in 18
🔸IL & ME: 1 in 17
🔸IN: 1 in 16
🔸WV: 1 in 11
🧵2/5 Proportion Actively Infectious										1 in 52 (1.9%) New Daily Infections										 941,000  Infections the Past Week										 6,020,000  Infections in 2026										 18,000,000  Cumulative Infections per Person										 5.01  										 Long COVID										 Long COVID Cases Resulting								47,000 to 188,000		   from New Daily Infections										 Long COVID Cases Resulting								301,000 to 1,200,000		   from New Weekly Infections										 										 Excess Deaths										 Excess Deaths Resulting 									270 to 450	   from New Daily Infections										 Excess Deaths Resulting 				...
We're in the middle of a 12th COVlD wave.

The peak has likely passed, but with students headed back to school, transmission is expected to remain high for at least the next several weeks.

🧵3/5 1) Graph of 12 waves 2) Barometer showing above-average transmission 3) Year over year graph 4) Forecast for transmission to decline and then percolate at high levels
Read 5 tweets
Jan 10
The size of the winter COVlD wave has been revised upward as post-holiday data come in.

We estimated 1 in 55 people in the U.S. are actively infectious.

🔥WV: 1 in 14
🔥IN: 1 in 15
🔥MI & OH: 1 in 21
🔥MO: 1 in 22
🔥CT: 1 in 24
🔥KS: 1 in 25
🔥MA & IL: 1 in 27

Quick 🧵 1/4 Heat map and PMC estimates, 1 in 55 infectious and 892,000 new daily infections for Jan 12.  We expedited the report to release it two days early.
Nationally, we are seeing an estimated 892,000 new daily SARS-CoV-2 infections, meaning a 1 in 4 chance of exposure in a room of 15 people. Risk varies considerably by state.

We are approaching an average of 5 infections per person since pandemic onset.
🧵 2/4 Alabama	Moderate Alaska	Very Low Arizona	Very Low Arkansas	High* California	Very Low Colorado	Low Connecticut	Very High Delaware	Moderate District of Columbia	Very Low Florida	Very Low Georgia	Very Low Guam	Very Low Hawaii	Very Low Idaho	Very Low Illinois	Very High Indiana	Very High Iowa	High Kansas	Very High Kentucky	Moderate Louisiana	Moderate Maine	High Maryland	High Massachusetts	Very High Michigan	Very High* Minnesota	Moderate Mississippi	Low* Missouri	Very High* Montana	High Nebraska	High Nevada	Very Low New Hampshire	Moderate New Jersey	Low New Mexico	Moderate New York	High* North Ca...
We are in the 12th COVlD wave of the U.S.

Current transmission is higher than 68% of all days since the pandemic onset in 2020.
🧵 3/4 12 waves of COVlD  Pandemic barometer: Higher than 88% of the past 100 days, 73% of the past year, 68% of the entire pandemic.  Year over year graph  Forecast of slowly declining transmission.
Read 4 tweets
Jan 8
You might not have heard, but the northeastern U.S. is in a COVlD surge.

We use wastewater levels to derive estimates of the proportion of people actively infectious in each state (prevalence), e.g., 1 in 24 people in Connecticut.

Let me walk you through it...

🧵1/8 Colors show CDC levels PMC prevalence estimates noted: -Maine 1 in 38 actively infectious with COVlD -New Hampshire 1 in 35 (limited data) -Vermont 1 in 75 -New York 1 in 44 (limited data) -Pennsylvania 1 in 44 -Massachusetts 1 in 36 -Connecticut 1 in 24 -Rhode Island 1 in 41 -New Jersey 1 in 82
Notice that #Connecticut has excellent SARS-CoV-2 wastewater surveillance. It's "Very High" across much of the state, per CDC.

Based on wastewater levels, we estimate 1 in 24 residents are actively infectious w/COVlD. That's a 66% exposure risk in a room of 25 people.

🧵2/8 Colors show CDC levels PMC estimate of prevalence
The CDC reports "Very High" levels in #Massachusetts.

The surveillance is less robust, but we estimate 1 in 26 residents are actively infectious, similar to our estimate in CT where coverage is better.

In a room of 25 people, that's a 62% chance of an exposure.

🧵3/8 Colors show CDC levels PMC prevalence estimates provided
Read 8 tweets
Jan 8
We're in the middle of a 12th COVlD wave in the U.S., with transmission particularly high in the Midwest and Northeast.

The CDC announced this week that COVlD continues to kill more Americans than breast and prostate cancer combined.

Get boosted & #MaskUp 💉💪😷
1/4🧵 Heat map of CDC data with PMC prevalence estimate
Levels are "Moderate" to "Very High" in 26 states.

However, data reporting is slow, and about 1/3 of states have low-quality data this week due to the holidays and illness.

2/4🧵 National estimates: Number of People		Chances Anyone is Infectious			 1				1.5%	 2				3.0%	 3				4.5%	 4				6.0%	 5				7.4%	 10				14.3%	 15				20.7%	 20				26.5%	 25				32.0%	 30				37.0%	 50				53.8%	 75				68.6%	 100				78.6%	 200				95.4%	 300				99.0%
Barometer: Higher transmission than 90 of the past 100 days (perhaps higher still, due to low data reporting quality)
State	CDC Level	Actively Infectious Alabama	Moderate	1 in 48 (2.1%) Alaska	Very Low	1 in 152 (0.7%) Arizona	Very Low	1 in 201 (0.5%) Arkansas	High*	1 in 36 (2.8%) California	Very Low	1 in 484 (0.2%) Colorado	Moderate	1 in 49 (2.0%) Connecticut	Very High	1 in 24 (4.2%) Delaware	Low*	1 in 70 (1.4%) District of Columbia	Very Low	1 in 5,835 (0.0%) Florida	Very Low	1 in 284 (0.4%) Georgia	Low	1 in 90 (1.1%) Guam	Very Low	1 in 687 (0.1%) Hawaii	Very Low	1 in 874 (0.1%) Idaho	Very Low	1 in 169 (0.6%) Illinois	Moderate*	1 in 56 (1.8%) Indiana	High*	1 in 34 (2.9%) Iowa	Moderate	1 in 41 (2.4%) Kansas...
State	CDC Level	Actively Infectious Missouri	Moderate*	1 in 42 (2.4%) Montana	High	1 in 34 (2.9%) Nebraska	Very High	1 in 26 (3.9%) Nevada	Very Low	1 in 138 (0.7%) New Hampshire	High*	1 in 35 (2.9%) New Jersey	Low	1 in 82 (1.2%) New Mexico	Low	1 in 87 (1.2%) New York	Moderate*	1 in 44 (2.3%) North Carolina	Low	1 in 82 (1.2%) North Dakota	High*	1 in 34 (3.0%) Ohio	Very High	1 in 27 (3.7%) Oklahoma	Moderate*	1 in 62 (1.6%) Oregon	Very Low	1 in 170 (0.6%) Pennsylvania	Moderate	1 in 44 (2.3%) Rhode Island	Moderate	1 in 41 (2.4%) South Carolina	Moderate	1 in 54 (1.9%) South Dakota	Very High	1 in...
If like years 1-4 of the pandemic, the winter wave has peaked. If like last year, we could hover near peak levels for a month.

Forecasting quality is low with 1/3 of states having data issues. Hopefully, we'll know a lot more in a few days.

3/4🧵 12 waves
Proportion Actively Infectious										1 in 65 (1.5%) New Daily Infections										 749,000  Infections the Past Week										 5,390,000  Infections in 2026										 3,000,000  Cumulative Infections per Person										 4.88  										 Long COVID										 Long COVID Cases Resulting								37,000 to 150,000		   from New Daily Infections										 Long COVID Cases Resulting								270,000 to 1,080,000		   from New Weekly Infections										 										 Excess Deaths										 Excess Deaths Resulting 									220 to 370	   from New Daily Infections										 Excess Deaths Resulting 					...
year over year graph
forecast
Read 4 tweets
Jan 5
We told you that 109,000-175,000 Americans would died of COVID (excess deaths) in 2025.

Today, the CDC estimates 101,000 deaths/year (flat from Oct 2022 to Sep 2024), and likely higher when considering more nebulous non-acute excess deaths (heart attack 6 months later).
1/5
The CDC estimates are actually higher than I would have guessed, given their methodology, which models estimates based on easily countable factors in healthcare and expert input on multiplier values. It lends credence to the PMC upper bound of excess deaths of 175,000/yr.
2/5
What's troubling is the CDC has annual mortality flat. My expectation based on mortality displacement and Swiss Re data is that it should be declining. If is stays flat, we're running on something like breast+prostate cancer or lung cancer deaths per year in perpetuity.
3/5
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(