My Authors
Read all threads
We’ve noticed something that looks like a potentially important data-entry problem regarding comorbidities in the “big” China CDC report on the Epidemiological Characteristics of COVID19. This report uses data from 72,314 patient records. (1/6)
The China CDC report has the most widely used numbers I’ve seen for age and sex fatality rates. It was reported in brief form in JAMA, and fully in a China CDC weekly (link below, it loads slowly sometimes).
weekly.chinacdc.cn/en/article/id/…

(2/6)
See numbers here.
From the report:
“† The comorbid condition variable, only includes a total of 20,812 patients and 504 deaths and these values were used to calculate percentages in the confirmed cases and deaths columns.”
(3/6)
Summing up the numbers for each of the 5 conditions plus “none” (which I take to mean none of these 5) adds up to exactly 20,812, whereas we should expect double counting.

Add 23,690 reported as missing = 44,502, which is 170 less than the 44,672 total confirmed cases

(4/6)
Does anyone have any insight into this? Or access to the data behind the report? I am worried that this might compromise the per-comorbidity fatality rates. I hope there is some simple fix that we have missed.

(5/6)
And this is the JAMA paper, which is a condensed version of the full China CDC report. It doesn't have the full breakdown of comorbidities, but does report the per-comorbidity fatality rates.

(6/6)

jamanetwork.com/journals/jama/…
@SanjatKanjilal @EpiEllie @EpidByDesign @statsepi @_MiguelHernan @GSCollins @EpidByDesign @jd_wilko

(tagging in case you have an idea or know someone who knows someone with better access to the numbers behind this report)
Just to be clear: the reason the number of patients with each comorbidity and with no comorbidities should *not* add up to the total number of patients is because a large proportion of patients are expected to have 2 or more comorbidities: e.g. CVD+diabetes, diabetes+hypertension
The least concerning solution to this I can think of would be if the reported sum of patient with known comorbid condition was wrongly calculated by summing up the categories for the five conditions + “none”. Then the fatality rates per condition are still correct.
Missing some Tweet in this thread? You can try to force a refresh.

Enjoying this thread?

Keep Current with Uri Shalit

Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just three indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!