1. Sequences with Orf1a:K1795Q usually (>80% of the time) come from persistent infections (excluding Gamma and BS.1).
2. About 4-10% of all long persistent infections acquire Orf1a:K1795Q.
2/
Let's revisit the first assumption with new data.
How many times did Orf1a:K1795Q appear in sequences in the last 6 months?
69
How many of those were ‘older’ lineages?
About 60.
So definitely over 80%. Assumption still stands.
3/
Now the second assumption.
How many times has Orf1a:1795Q appeared in BA.4* lineages since it stopped circulating (which I set at 2/1/23)?
21
How many total BA.4* sequences were reported during that time?
204
21/204 = 10.3%. About the same. Assumption still stands.
4/
So if the ~60 persistent infections with Orf1a:1795Q reported in the last 6 months represent 5-10%, that means the actual number should have been 600-1200 out of 479k total sequences.
= 0.125-0.25% of all sequences. Similar number as before.
5/
This is a bit lower range than what was estimated in the paper. They estimated 0.1-0.5% were more that 60 days, but I suspect what I am measuring is actually a smaller cohort of people with longer infections.
I was really conservative where I drew the line for when lineages stopped circulating, so for my calculation the infection really had to have lasted at least 4 months.
Look at the vasty different distribution of seqs of BA.1* and BA.1*+1795Q.
7/
Interestingly, none of the patients in the Nature paper had actually picked up Orf1a:K1795Q. I think this is because these patients had not ‘incubated’ long enough.
They did identify two mutations that did appear somewhat ‘persistent specific’: Orf1a:1638I and 4311I.
8/
Orf1a:1638I is clearly highly prevalent in persistent infections. Look at the difference between BA.1* and BA.1*+1638I.
Not as selective as 1795Q, but clearly linked to persistent infection.
9/
Orf1a:4311I was not as obvious, but it did still show a small degree of enrichment in persistent infections.
10/
So the summary really hasn’t changed since last summer.
At least 1 in 1000 SARS-CoV-2 sequences appear be derived from very long persistent infections.
11/
Caveats:
-The data isn’t perfect. At least some of the samples had the wrong date entered, and at least some of the samples represent patients that were sampled multiple times.
-We don’t know whether people with persistent infections are more or less likely to be sampled.
12/
Finally, the most important caveat (also pointed out by @LongDesertTrain). The seqs are all from nasal samples.
How many more persistent infections occur with the virus replicating elsewhere in the body? We know this occurs.
IMO, more research is needed on this.
13/
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Biosafety restrictions and Biothreat assessment: a rant.
Regulations are fine, but double standards are frustrating.
1/
Background, my lab studies cryptic lineages: evolutionarily advanced SARS-CoV-2 lineages detected in wastewater from an unknown source (we are now pretty certain cryptics are from long persistent infections).
Background: Cryptic lineages are evolutionarily advanced SARS-CoV-2 lineages detected in wastewater from an unknown source.
We are fairly certain that these are derived from patients with persistent SARS-CoV-2 infections.
1/
If you are curious why we believe that cryptic lineages are from individual patients and not some non-human source, please read this thread. The paper on this should be coming out soon.
The NJ lineage was first detected in mid-2023, but we didn’t get a clean look at its Spike sequence until November. It’s a B.1-derived lineage, so it is probably at least a 3 year infection.
3/
The key question with a new variant is if it has the ability to get through our first major line of defense, our neutralizing antibodies (NA).
The vast majority (>90%) of NAs target the receptor binding domain of Spike, so that is where I look for answers.
2/
There are 3 main classes of NAs that target the RBD, and we know a lot about where each targets. This is a cheat sheet I made about 3 years ago based on the data from @jbloom_lab and @yunlong_cao.
COVID numbers always seem to go down early in the year, but this year is more optimistic than most. 1/
In January 2021 numbers were rapidly declining, but there was uncertainty because the first major variants of concern (Alpha, Beta, Gamma) were moving in.
Those 3 didn’t have much impact, but unforeseen was Delta, which arrived in May. 2/
By February 2022 the first Omicron wave (BA.1) was rapidly declining, but it was already clear that BA.2 was going to be next.
Although BA.1 had no descendants, unforeseen was just how adaptable BA.2 and its descendants (BA.2.12.1/BA.4/BA.5) were going to be.