Marc Johnson Profile picture
Sep 18, 2022 19 tweets 7 min read Read on X
SARS-CoV-2 lineages, Cryptic lineages, and a prediction of what the next dominant VOCs will be.
We sequence SARS-CoV-2 from wastewater using a different approach than most. We sequence smallish fragments of the genome so that we can tell if particular mutations are derived from the same virus. This way we are better able to determine if there is anything new circulating. Image
About a year and a half ago we detected something from a St. Louis sewershed that didn't make sense. SARS-CoV-2 lineages whose sequence did not match anything that had been seen from patients. We called these 'cryptic lineage'. Image
Wanting to know if this kind of thing was common, we started collaborating with @DrJDennehy to see if we could find anything similar in other places. NYC also had cryptic lineages. Not the same lineages, but lineages with similar characteristics.
nature.com/articles/s4146…
By the end of 2021, we had identified 9 sewersheds (out of about 180 that were being routinely surveyed) that contained cryptic lineages, including a lineage from California with @RoseKantor.
medrxiv.org/content/10.110…
My obsession with finding and characterizing these cryptic lineages accelerated in 2022. To date we have screened wastewater from around 700 sewersheds. In total we have found 24 putative lineages. I say putative because some of them we have only seen once.
We still didn't know where these lineages were coming from. I was convinced of an animal reservoir. What changed my mind is when we tracked a lineage in Wisconsin with @dho all the way to a single set of bathrooms. There is no animal contribution. It is from a person. Image
What we think is happening is that cryptic lineages are very long-term infections in people (often >1 year). Perhaps GI infections. The hosts are obviously mounting an immune response, but are not able to clear the infection.
Because there are no genetic bottlenecks from spreading from person-to-person, these viruses basically push the evolutionary fast-forward button.
What can these lineages tell us about circulating viruses? These are the sites in the Spike RBD that are most commonly changed in the cryptic lineages. We had seen lineages with these changes long before they were seen in Omicron. Image
When Omicron arrived (shown in red) it had mutations at many of the same sites. The main exceptions were L452 and N460, which were common in cryptic lineages, but were not in Omicron (BA.1). Image
About six months later we started seeing Omicron lineages with mutations at L452. Highlighted in red are the changes in BA.4/5 (which has L452R), but there were also lineages with L452Q, L452M, and others. These lineages took over, but there were still no changes at N460. Image
Finally, a few months ago N460K appeared in an Omicron lineage (BA.2.75). However, it was not combined with any of the L452 mutations. Nonetheless, BA.2.75 and its derivatives have continued to expand and slowly displace other lineages. Image
A few days ago, a new lineage appeared in a MO wastewater sample. Omicron with L452R+N460K (and K444M). This appears to match a new lineage that was just designated as BU.1 a few days ago. Currently there are only 13 sequences in GISAID with this combination of mutations. Image
However, I recently learned from @CorneliusRoemer that I was behind in my lineages. There is another completely independent lineage that arrived at almost exactly the same combination. This lineage is designated BQ.1.1. This group has L452R+K444T (rather than K444M).
In addition, BQ.1.1 has also picked up R346T (also in BA.4.6). We haven't focused as much on this region of spike, but most cryptic lineages that we have checked also have a mutation at this site. It seems to be pretty critical. BQ.1.1 is new, but it seems to be taking off.
There you have it. BQ and BU. I don't know if they will cause a spike in total cases, but barring something completely new appearing, I predict that they will be the dominant VOCs in the coming months.
Addendum. It seems a few other lineages have hit very similar 'jackpot' combinations very recently. In addition to BU and BQ:

BW.1 (BA.5.6 derivative)
BS.1 (BA.2.3 derivative)
BR.2 (BA.2.75 derivative)
other undesignated

They all have N460K, L452R + other changes.
Real pango experts, please correct me if I botch any of these, or have missed any. @CorneliusRoemer @PeacockFlu @siamosolocani @LongDesertTrain

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Marc Johnson

Marc Johnson Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @SolidEvidence

Feb 11
Brief update on the new cryptic lineage we found from Petersburg City, Virginia.

We went back and screened all of the samples from that sewershed since the beginning of 2024 and learned a few things about it.
1/ Image
First, I think I was wrong about the lineage being JN.1 derived. I thought it was JN.1 because it had 22926C (455S), but it looks like it only acquired that recently.

In samples as recent as December the lineage lacked 455S and 456L.
2/ Image
That would mean the lineage is BA.2.86-derived, which suggests it was acquired probably early 2024.

Caveat, as @LongDesertTrain points out, persist infections hate 455S. It’s possible that the lineage was JN.1, but reverted at 455, but then gained 2 nt creating 455A.
3/ Image
Read 8 tweets
Jan 31
Wastewater variant update. This is the composite data from over 1,000 US samples collected over the last 6 weeks.
1/ Image
You have to extrapolate a little bit because several changes are shared by multiple lineages.

It appears that the new lineage I mentioned last week (MC.10.1 + 445P) is around 4% and is the fastest growing of the lot. It now has a PANGO designation - PA.1
2/
LP.8 is still expanding is is probably about 12% now. Since it is a KP.3.1.1 derivative, KP.3.1.1* might become dominant again.

LF.7 seems to be holding on too at about 4%.
3/
Read 7 tweets
Jan 26
Here's the latest composite US wastewater data.

It's a little bit confusing this week.

1/ Image
Clearly LP.8 is still the main lineage gaining traction. All of its changes are moving in the same direction (up).

LF.7 is much lower, but looking a little bit more alive than last week.
2/ Image
445P is mixed. It is decreasing, but we know that the signal is a mix of LB.1.3.1 and a new lineage which is MC.10.1+445P (which also has A435S).

445P is decreasing, but much of that is likely the drop in LB.1.3.1 when you compare to 183H.

3/
Read 6 tweets
Jan 24
What fraction of patient sequences are derived from persistent SARS-CoV-2 infections? (volume 3)

This is something that we can actually calculate.
1/
The key is the mutation Orf1a:K1795Q, which frequently appears in persistent infections (and even more often in cryptic lineages).

2/
Each time I make this calculation, I check 2 empirical numbers.

1. What fraction of sequences with Orf1a:K1795Q are from persistent infections

2. What fraction of persistent infections acquire Orf1a:K1795Q

3/
Read 17 tweets
Jan 22
There was a nice paper that recently came out about a patient that had COVID for 521 days.

I find it interesting for different reasons that most though.
1/

nature.com/articles/s4152…
We also study persistent infections, except we don't know who the patients are.


2/
Here's what immediately got my attention, the RBD mutations the clone acquired. Every one of the mutations we've seen before in cryptic lineages.
3/ Image
Read 13 tweets
Jan 5
Interesting, it looks like Atlanta, Georgia has a REALLY old cryptic lineage.

I would call it the CDC variant, but it’s the wrong sewershed.

1/ Image
I didn’t even know Georgia was doing wastewater sequencing, but a bunch of sequences appeared in SRA right after Christmas.

15 of the samples from the sewershed with pop. 190k (Cobb county) had the s2m fix.
2/
The ‘s2m fix’ means the variant is derived from BA.1 (original Omicron) or earlier. This lineage lacked 29742T, so it probably isn’t a Delta.

Strangely, I saw another s2m fix from Atlanta about a year ago, but different sewershed and the sequence doesn’t match.
3/
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(