The geospatial analysis in Worobey2022 relies on a centering model to determine the origin point of COVID in Wuhan Dec 2019.
This model is not valid. doi.org/10.1126/scienc…
The centering model can be stated as follows: the spatial pattern of the home residence of severe cases is centered on the origin point, with spatial density decreasing away from the origin point.
Fig 2A and B of Worobey2022 provide insight into the authors' logic.
“We hypothesized that if the Huanan market were the epicenter of the pandemic, then early cases should fall not just unexpectedly near to it but should also be unexpectedly centered on it”
Worobey et al place considerable emphasis on the claim that the severe-case-residences are centered on the Huanan market, in the article itself, in media, and in tweets posted by the authors, e.g.:
The centering model requires that most cases are infected near the origin, and that their residences are distributed in a circular pattern around the origin.
That is not compatible with the patterns of transmission of COVID or with the patterns of human movement in a city.
For purposes of illustration, consider this simple example of a COVID transmission pattern from Singapore, Jan 29-Feb 24, 2020.
Note how it gets complicated with only a few steps from the local origin.
Another view of the Singapore transmission pattern.
Now consider human movement in Wuhan.
Wuhan is a large urban center with modern transportation infrastructure. Millions of people move daily on complex networks, by car, scooter, bus, metro, rail. map.baidu.com/@12735318.3419…
Spatial pattern of daily population movements in Wuhan (2015).
Red: people entering an area.
Green: people leaving an area.
Yellow: little change. journals.plos.org/plosone/articl…
The centering model is not compatible with the complexities of transmission chains and human movement.
We can test this conclusion.
There is one subset of Dec 2019 severe cases with a known point of infection: those with a link to Huanan market.
Helpfully, the market-linked cases are plotted as orange dots on Worobey2022 Fig 1a.
It is evident by inspection that the orange dots are not close to the market, and do not form a pattern around the market.
The non-validity of the centering model is demonstrable for the subset of 155 severe cases considered in Worobey2022.
But there were many more cases in Wuhan in Dec 2019.
How many non-severe cases in Wuhan, Dec 2019? @MichaelWorobey, in WaPo, wapo.st/3UWfdCv, estimated that “something like six percent end up in the hospital.”
174-260 severe cases➡️2726-4073 non-severe cases➡️3000-4000 in significant figures.
So, in addition to the 174-260 severe cases, it is reasonable to assume that 3000-4000 non-severe cases were circulating through the city of Wuhan during Dec 2019, each with an approximately 10-day window of infectiousness.
These 3000-4000 cases, plus the 174-260 severe cases, would have constituted a complex pattern of transmission. Imagine the Singapore example with thousands of nodes, rather than 30.
Now, imagine that transmission pattern translated into a geographic pattern of residences.
Of course, neither the transmission pattern nor the geographic pattern will ever be known.
But, if we could add those 3000-4000 additional residences, the result would be … all over the map. Many new clusters and individual dots would appear, with their locations determined by random events, e.g., which metro car did an infected person get into?
It is not possible to predict what the unknown geographic pattern would look like, but it is not credible to assume that those 3000-4000 additional locations would simply follow the pattern of severe-case residences shown in Worobey2022 Fig 1A.
Consequently, there is no basis for the claim by @michaelworobey that “we fully expect the cases that we don’t sample to come from exactly the same geographic distribution as the ones that we do sample” (WaPo wapo.st/3UWfdCv)
CONCLUSION
Worobey2022 used an invalid centering model to interpret the geographic pattern of 155 residences of severe COVID cases in Wuhan, Dec 2019, in support of their claim that the Huanan market was the origin point of the pandemic.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
This thread examines two claims in Worobey et al:
Dec-2019 COVID case-residences in Wuhan were not concentrated in (1) areas of high population density or (2) areas with a high proportion of older persons. science.org/doi/10.1126/sc…
The specific claims in Worobey et al:
🌐Dec-2019 cases did not reside in areas with high population density of (1) all age groups or (2) older persons.
🌐Fig 1E, S9 and S10 are enlisted to support the claims.
Fig 1E purports to represent the spatial distribution of COVID cases in Wuhan in Jan-Feb-2020.
It includes no population density data, and therefore cannot be used to support the claims.
This thread addresses the claim in Worobey et al that KDE analysis shows centering of Dec 2019 COVID case-residences on the Huanan Market. science.org/doi/10.1126/sc…
This apparent centering is an artifact due to use of an overly-large bandwidth in the KDE calculation.
Adjusting the bandwidth parameter to more realistic values shifts the center of the KDE pattern away from the Huanan market, to an neighbourhood north of the market where there truly is a significant cluster of case-residences.
Worobey et al base their interpretation on simplified KDE maps in which the influence of each data point is smeared over a large area (oversmoothing).
The pattern is centered on the Huanan Market, but this is simply an artifact of oversmoothing.
Worobey et al. (2022) science.org/doi/10.1126/sc…
Consider the KDE probability contours for the residences of Dec 2019 cases.
Data from zenodo.org/record/6908012…
*Linked* cases in green.
The map of the linked-cases KDE was omitted from the article...
...although the KDEs for all-cases and unlinked-cases were prominently displayed on Fig. 1, and featured in various tweets emitted by the authors.
The map ☝️uses the following from the data files supplied with the article at zenodo.org/record/6908012
▸ maps ▸ geojson
▸ who_cases_dec-2019.linked.KDE.contours.geojson
▸ who_cases_dec-2019.notLinked.KDE.contours.geojson
COVID-in-wastewater for BC Lower Mainland, to 4-June-2022. 🪡
From bccdc.ca/Health-Info-Si…
Red line at zero level added to emphasize that relatively high levels persist in wastewater, discordant with the low level of officially reported cases. (1/n) #covid19bc
The graph is dominated by the Jan-Dec peak, which effectively and conveniently scales down the current persistent high levels. The eye tends not to catch that the levels post-peak are much higher than the levels before. (2/n)
The description in the text seems determined to ignore these persistent high levels, and appears to focus on shorter-term bumps and noise in order to use the word “decreased”… (3/n)
(1/n) I can’t reply directly to this @angie_rasmussen tweet, promoting the famous epicenter preprint, now under peer review.
But I can re-up my critique.
TLDR: preprint gets an F, due to problems with math, cartography, and spatial analysis.
Sorry, nothing about furin cleavage.
(2/n) The main fallacies
▪️Centrality is not causality.
▪️Simplification is not analysis.
▪️The spread of a complex disease in a complex city of 11 million cannot be reduced to a model of diffusion in a homogeneous, isotropic medium. bit.ly/36KRtNX
(3/n) Math error
The centroid (X,Y) of a point cloud in a cartesian coordinate system is (mean of x, mean of y).
Cannot be calculated as (median of x, median of y).
Cannot be calculated from latitude/longitude, which is not cartesian.
(Centroids are a big deal in the preprint.)
So, this is where we are in BC.
▪️COVID still rampant, although we only know that from wastewater measurements @YVRCovidPlots.
▪️16,000 HCWs off sick at least 1 day last week, as per @adriandix.
▪️Doctors and such holding in-person, maskless conferences. @BCPSQC@DoctorsOfBC