Mike Hoerger, PhD MSCR MBA Profile picture
Aug 12, 2024 8 tweets 9 min read Read on X
BREAKING: Version 2.0 of the PMC COVID-19 Forecasting Model, August 12, 2024
🧵1/7

The U.S. now tops 1.3 million daily infections. 2.8% of the population (1 in 36) are actively infectious.



Deep Dive on Version 2.0 of the Model...

Welcome to version 2.0 of the PMC Model. The “C” in PMC is for Collaborative, and the work to improve this model is grounded in feedback from readers like you over the past year. Thank you for your support.

What’s New?

In short, the new model has substantial data quality improvements by combining multiple data sources for estimating transmission in unique ways that will hopefully increase forecasting accuracy, provide a truer representation of what has happened and is happening during the pandemic, and linkages to some statistics you will find helpful in day-to-day decision making.

Here is a deeper dive into the changes (skip to next section if desired). The new model is designed to provide a “true” picture of what has happened during the pandemic. It integrates three main data sources: the IHME true case estimation model, Biobot SARS-CoV-2 wastewater surveillance data, and the current CDC NWSS SARS-CoV-2 wastewater data. IHME provided a comprehensive case estimation model through April 1, 2023. Biobot was the CDC wastewater subcontractor through last fall and continues to do extensive non-CDC wastewater work. The CDC NWSS data are currently subcontracted with Verily, a subsidiary of Alphabet, which is the parent company of Google. Over the past year, we have seen Biobot scale back their public data and visualizations, and Verily has made steady improvements in their work with the CDC.

We previously relied solely on Biobot for forecasting and a Biobot-IHME data linkage for case estimation. It was a Biobot-heavy model. The current model is not tied strictly to any data set, but rather the PMC’s best estimate of the truth, a true-case model that uses multiple data sources in the spirit of IHME’s original work in this area. Essentially, we link all three data sources, which have been active over different points of the pandemic to derive a composite “PMC” indicator of true levels of transmission. The indicator is weighted based on which data sources were available and their perceived quality at each point in time. We scale this composite PMC indicator to the metric the CDC uses when helpful for comparisons with their website, and scale it with the true case estimates of the IHME otherwise, as true cases are more relevant than arbitrary wastewater metrics.

A great feature of the model is that it continues to integrate real-time data from Biobot and the CDC. From the perspective of Classical Test Theory, this is a huge advantage, as it provides a much more reliable indicator of what is currently happening with transmission. Both sources often make retroactive corrections for the most recent week’s data, sometimes sizable, and pitting the two indicators against one another reduces measurement error on average, which offers vital improvements in forecasting.

What are the Biggest Improvements in the Model?

· Accuracy in Real-Time Data – In integrating two active surveillance data sources, the real-time data will be more accurate. The biggest predictor of next week’s transmission levels, and the shape of how transmission is increasing or decreasing, accelerating or decelerating, is the current week’s real-time data. If the real-time data are off by 5% or 10%, the big-picture take on the forecast will still be reasonable, but a more precise estimate allows for greater accuracy in estimating the height and timing of waves.
· Regional Statistics – We are already integrating some regional data. Like you, we miss the vast and high-quality regional data and visualizations Biobot provided. We are hoping to take back some of those advantages through the new model and will improve them over time.
· Credibility – Although Biobot and CDC have unique strengths and limitations, a clear strength of adding the current CDC data set is that many people prefer to defer to the credibility of the CDC. The PMC model can be characterized fairly as a “CDC-derived case estimation and forecasting model,” which should lend more credence with those who are not deep enough in the weeds to evaluate the data as critically and prefer appeals to authority. We also provide some statistics that will allow you to draw more useful inferences from the CDC website.

What’s the Same in the Current Model?

The analytic assumptions underlying the forecasting model remain the same. It uses regression-based techniques common across all industries, using a combination of historic data (median levels of transmission for each day of the year) and emerging data from the past four weeks to characterize how transmission is growing or shrinking. Holidays and routine patterns of behavior that map on well to a calendar are “baked in” to the historic data. “New variants” and atypical patterns of behavior are baked into the data on recent patterns of transmission. It’s a top-down big picture model.

What are the Biggest Drawbacks of the New Model?

· Disruptions in Longitudinal Comparisons – You will notice some inconsistencies between the current and prior model that use additional data to form more accurate estimates, which is sometimes frustrating. A few examples. In the early pandemic, we estimated cases linking Biobot to IHME case estimates. Biobot transmission estimates were a bit “hotter” than others during that time period, the IHME estimates “cooler.” Our composite model depicts each of the first 4 waves somewhat smaller, which we believe provides a better picture of the “truth” as we can estimate it, but it is annoying psychologically to re-envision what has happened. This also throws off some of the big-picture statistics; for example, as of August 12, 2024, we estimate that Americans have had about 3.3 infections on average. A few months ago, we estimated nearly 3.5, so this is consistent with “cooler” picture of early-pandemic transmission. Presently, the CDC transmission estimates are running much hotter than those of Biobot, leading to estimates of a larger and earlier peak in the present wave. We would have preferred the CDC re-up with Biobot at the potential contract renewal to promote continuity in the data, but these sorts of changes in model estimation are the expected consequences of such a transition.

· Constantly-updating Historical Data – The CDC updates all of their historical estimates of transmission frequently, any time a new site comes on board, and twice annually to standardize the data longitudinally. This can sometimes create weird issues, where transmission is going up, but real-time values are lower than what was reported in real time the prior week because recent data were corrected downward. It will also throw off some of the helpful statistics we provide. These are minor nuisances, but be aware of them in case you spot something that seems strange.
· Documentation of Accuracy – We have excellent data on the accuracy of the prior model and will submit a report for publication shortly. All prior reports are publicly available. Many report quick facts on longitudinal accuracy, international comparisons, use in news articles, and references to use in peer-reviewed scientific journal articles. We cannot document the real-time accuracy of the new model yet, but know that when using historical data, the model accounts for 98% of the variability in wastewater transmission 1-week into the future, which is 2% higher than our prior model. The vast majority of forecasting errors have been and will continue to be based on inaccuracies in the real-time data wastewater surveillance companies report, and the model changes reduce those issues. We hope you will trust our history and that the methodologic changes represent improvements.

What Improvements Should We Expect in the Future?

There are many improvements we hope to roll out in the future. These include changes based on your feedback, the addition of confidence intervals in some of the graphs, and regional forecasting models. We may incorporate additional data sets if they can improve real-time estimates of current transmission.pmc19.com/data/Figure showing transmission throughout the entire pandemic, shows we are entering a 9th wave and provides CDC levels as well as the PMC estimate of daily infections, now at 3.3 million.
🧵2/7

Our graph of year-over-year transmission shows we have likely never had such high COVID transmission in mid-August.

Many classrooms will have a >50% chance someone is infectious. Expect K-12 schools and universities to be hotbeds for COVID outbreaks unless they are using serious multilayered mitigation.

🔹Indoor air quality that meets ASHRAE Standard 241 (if they have never heard of this or cannot explain how they are meeting the standard, they likely are not meeting the standard).
🔹Surveillance testing.
🔹Free on-demand testing.
🔹Universal masking.

This is uncharted territory in terms of such low mitigation coupled with high transmission with school starting. The possibility of a slightly larger wave than what we forecast remains.Figure showing year-over-year Covid transmission. We are presently just below the largest peak for a summer all-time, and we have never seen transmission this high in mid-August.
🧵3/7

Let's zoom in on the current wave. We're at our highest level of transmission since the winter surge, with 1.3 million daily infections.

Note, our model now combines Biobot and CDC data. Biobot still has the peak coming in early Sept, and so did the CDC until a huge spike this week.

By including two data sources, it helps counterbalance against errors in their real-time reporting, but we could still see some volatility in the size and date of the peak at this point.

Of course, different locations peak at different times.

You'll note that Aug 12 appears in the "forecasted" zone. That's because even wastewater data experience lags in reporting.Figure shows the most recent year of transmission. We're presently at 1.3 million daily infections, and likely closing in on the peak of this summer's wave.   Graph says "past 12 months," but it really shows the past 11 and includes a 1-month forecast. I'll fix that for next week.
🧵4/7
Here are some precise statistics on the current state of the pandemic in the U.S.

We are experiencing higher transmission than during 91% of the pandemic. 1 in 36 infectious. >1.3 million daily infections, nearly 10 million weekly infections, >400,000 resulting weekly Long COVID cases.

In a classroom of 25-30 students, there's over a 50% chance someone would be infectious.Current Levels for Aug 12, 2024 % of the Population Infectious 2.8% (1 in 36) New Daily Infections 1,336,000  New Weekly Infections 9,352,000  Resulting Weekly Long COVID Cases 468,000 to 1,870,000  Monthly Forecast Average % of the Population Infectious 2.7% (1 in 37) Average New Daily Infections 1,295,900 New Infections During the Next Month 38,877,000 Resulting Monthly Long COVID Cases 1,944,000 to 7,775,000  Running Totals Infections Nationwide in 2024 164,431,000 Average Number of Infections Per Person All-Time, U.S. 3.27  How Does Risk Increase with More Social Contacts? Number of Peo...
🧵5/7 CDC Covid-19 heat map  Highest transmission: West coast, Utah, Wyoming, Minnesota, Oklahoma, Texas, Louisiana, Arkansas, Florida, Carolinas
🧵6/7
Out West, about 1 in 24 people are infectious with COVID. The South is close behind.

On pgs 11-12 of the report, I walk you through an example of how to make rough estimates. In Louisiana, about 1 in 26 (or 3.9%) are infectious today.

pmc19.com/data/PMC_COVID…
A figure shows the CDC graph of regional variation in transmission.  A supplemental table states: Estimated Percentage Actively Infectious 	National	2.8% (1 in 36) 	Northeast	1.6% (1 in 63) 	Midwest	2.2% (1 in 45) 	South	3.7% (1 in 27) 	West	4.2% (1 in 24)  Some supplemental statistics are provided, best summarized in the linked report.
🧵7/7
Here's the full PMC COVID-19 Dashboard for Aug 12, 2024

Please share and adapt any of the images for use across other platforms and websites.
#MaskUp

pmc19.com/data/
The dashboard has tables and figures spread out across six sections. Each section is summarized in the Alt text in the prior 6 tweets.   Apologies for typos. I see in one place, I noted in Alt text 3.3 million daily infections, and it should be 1.3, as noted repeatedly elsewhere. Try to do better next time!
Thanks to those who have chatted about data and modeling recently - @MoriartyLab @jlerollblues @AnciraBecky @amethystarlight

Thanks @luckytran for your aesthetic Covid graphics during the winter surge. They inspired me trying to make the main graph nicer.

Thanks all!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Mike Hoerger, PhD MSCR MBA

Mike Hoerger, PhD MSCR MBA Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @michael_hoerger

Jul 2
🌍Want to track COVID transmission accurately worldwide?

This PMC thread walks you through leading dashboards with information more up to date than WHO & EU directories.

🧵 1/ World map, countries with high-quality up-to-date surveillance systems shown in blue.
The Pandemic Mitigation Collaborative (PMC) Dashboard provides weekly COVID updates for the U.S., using wastewater surveillance derived case estimation models and analytic forecasting.

We have added a list of international dashboards:
🧵 2/
pmc19.com/data/index.php…Australia: NSW Health Australia Department of Health, Western Australia Austria: Federal Government of Austria City of Vienna - Regional Coron-A Consortium Austria Czech Republic: State Health Institute Czech Republic Belgium: Belgian Institute for Health Canada: Government of Canada Andrew Young's Canada Visualization C19 Resources Canada * World Health Network (WHN) Canada **
Denmark: Statens Serum Institut Denmark Europe (multi-national): EU Wastewater Observatory Finland: National Institute for Health and Welfare Finland WHN Finland ** France: French Republic Data Airborne Risk Reduction Association (ARRA) France Zan Armstrong's France Visualization COVID Weather App France (Android, IOS, or Web) Thomas Delattre's France Visualization Germany: Infection Radar Germany Bay-VOC Bavaria Region Berlin Waterworks WHN Germany **
Hong Kong: Centre for Health Protection, Hong Kong Hungary: National Center for Public Health and Pharmacy of Hungary India: Pune Knowledge Cluster of India Gujarat Biotechnology Research Centre of India Ireland: Health Protection Surveillance Centre Ireland Japan: Japan Institute for Health Security Latvia: Institute of Food Safety, Animal Health and Environment Latvia Lithuania: National Public Health Centre of Luthuania Luxembourg: Microbs Luxembourg Netherlands: National Institute for Public Health and the Environment Netherlands WHN Netherlands ** New Zealand: New Zealand Institute for...
Slovenia: National Institute of Public Health Slovenia Spain: Government of Catalonia - Regional South Africa: National Institute for Communicable Diseases of South Africa South Korea: Korea Disease Control and Prevention Agency Sweden: Pathogens Sweden Switzerland: Swiss Confederation Federal Office of Public Health ETH Zurich Switzerland WISE Dashboard U.K.: Public Health Scotland Buckinghamshire Disability Service (BuDS) U.K.  *See the numbers 1-21 along the lower left. Click on 3-6 for national and regional data. ** Click on the dropdown menu. May need to try a different web browser.
Our international directory includes official government dashboards & those developed by citizen scientists.

We exclude countries that have stopped reporting in the past 2-12 months even if on EU or WHO lists. We also exclude low-quality data from opt-in testing programs.
🧵 3/
Read 43 tweets
Jun 24
PMC COVlD Dashboard, Jun 23, 2025 (U.S.)

🔥Biggest uptick since Jan
🔥1 in 167 actively infectious
🔥>2 million weekly infections
🔥700-1,200 resulting excess deaths from weekly infections

Track transmission closer to home w/our new state & international resources 👇

🧵1/6
PMC COVlD Dashboard, Jun 23, 2025 (U.S.)

🔹With >90% probability, we have entered the 11th COVlD wave.
🔹In a room of 50 people, there is already a 1 in 4 chance of an exposure.
🔹We expect nearly 15 million infections in the next month, and rising.

🧵2/6Current Levels for Jun 23, 2025	 % of the Population Infectious	 0.6% (1 in 167)	 New Daily Infections	 287000	 New Weekly Infections	 2009000	 Resulting Weekly Long COVID Cases	 100,000 to 402,000	 Resulting Weekly Excess Deaths	 700 to 1,200	 	 Monthly Forecast	 Average % of the Population Infectious	 1.0% (1 in 97)	 Average New Daily Infections	 493300	 New Infections During the Next Month	 14799000	 Resulting Monthly Long COVID Cases	 740,000 to 2,960,000	 Resulting Monthly Excess Deaths	 5,300 to 8,800	 	 Running Totals	 Infections Nationwide in 2025	 74869000	 Average Number of Infect...
PMC COVlD Dashboard, Jun 23, 2025 (U.S.)

We continue to expect transmission to break 500,000 daily infections in the U.S. around July 9th.

This is the same prediction as last week, as the forecast was dead on. Yet, there is considerably uncertainty around this timing.

🧵3/6Longitudinal transmission, past 12 months and 1-month forecast
Read 6 tweets
Jun 17
1) PMC COVlD Dashboard, June 16, 2025 (U.S.)

Current transmission (red line) closely tracks that of summer 2023 (yellow line).

We expect to break 500k daily infections between July 9 and the end of July. Our current forecast...Year over year graph of tranmission
2) PMC COVlD Dashboard, June 16, 2025 (U.S.)

Our current forecast is a bit more aggressive, predicting breaking 500k daily infections by July 9. The 2023 trend suggests end of July.

The 95% confidence interval shows large variation. Note that...Past 12 months of transmission (U.S.) and forecast
3) PMC COVlD Dashboard, June 16, 2025 (U.S.)

Note that CDC and Biobot both had retroactive corrections to last week's data, meaning the relative "lull" will last a little longer than the uncorrected data suggested. No big news on NB.1.8.1.

All good news, but...10 pandemic waves. Soon 11
Read 7 tweets
Jun 16
1) Here's a quick example of how the federal government is censoring the best scientific research. It's not just cuts to ongoing research.

It's new grant submissions too... "No forbidden words found"
2) In January, I re-submitted a promising Covid/cancer grant to a non-federal funder. Hundreds of pages. Hundreds of hours of work. The best proposal I've submitted as a scientist.

Out of curiosity, I used Sean Mullen's Scan Assist tool to see how many banned words it had...
3) The proposal had 1,750 banned words. No big deal -- they're non-federal.

BUT I had planned to submit a smaller version to NIH this month as a "back up." Impossible!

It's not a matter of using a thesaurus or the find/replace command. The grant is on *Covid*... 1,750 banned words found
Read 13 tweets
Jun 9
1) PMC COVlD Dashboard, June 9, 2025

CDC wastewater surveillance data show transmission rising. This is our forecast if transmission growth follows typical patterns.

The high & low estimates could be thought of as optimistic & pessimistic scenarios for NB.1.8.1.Forecast graph: Rising transmission the next month
2) PMC COVlD Dashboard, June 9, 2025

Notice that current transmission (red line, lower left) tracks closely with two years ago (yellow), slightly below the median (gray), and not far below last year (orange).

Consider each of these trajectories realistic scenarios.Image
3) PMC COVlD Dashboard, June 9, 2025

All indications are that we are headed into the start of an 11th national wave in the U.S.

We could percolate near the lull point another couple weeks (fingers crossed), but that scenario is becoming less likely.Graph of 10 C19 waves
Read 7 tweets
Jun 3
1/ PMC COVlD Dashboard, June 2, 2025 (U.S.)

National COVlD transmission recently fell to its lowest levels since the pre-Delta era.

It's go-time for many who have delayed medical appointments. The situation will likely get much worse in Jul/Aug.
2/ PMC COVlD Dashboard, June 2, 2025 (U.S.)

An estimated 1 in 211 are actively infectious. Most states are "low" or "very low" per CDC.

The situation remains serious even in a relative "lull." >1.5 million weekly estimated infections to result in 600-900 excess deaths.C19 heat map
year over year graph, tracking closely with the median and past 2 years
Current Levels for Jun 2, 2025	 % of the Population Infectious	 0.5% (1 in 211)	 New Daily Infections	 227000	 New Weekly Infections	 1589000	 Resulting Weekly Long COVID Cases	 79,000 to 318,000	 Resulting Weekly Excess Deaths	 600 to 900	 	 Monthly Forecast	 Average % of the Population Infectious	 0.7% (1 in 139)	 Average New Daily Infections	 344566.6667	 New Infections During the Next Month	 10337000	 Resulting Monthly Long COVID Cases	 517,000 to 2,067,000	 Resulting Monthly Excess Deaths	 3,700 to 6,200	 	 Running Totals	 Infections Nationwide in 2025	 70132000	 Average Number of Infe...
Graph of the whole pandemic, 11th wave forthcoming
3/ PMC COVlD Dashboard, June 2, 2025 (U.S.)

By the end of the month, we forecast an increase to 450k daily infections. If NB.1.8.1 takes off, closer to 600k. If overhyped, percolating only slightly higher.Past 12 months and forecast
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(