Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Marc Johnson

@SolidEvidence

Sep 18, 2022 • 19 tweets • 7 min read • Read on X

Scrolly

SARS-CoV-2 lineages, Cryptic lineages, and a prediction of what the next dominant VOCs will be.

We sequence SARS-CoV-2 from wastewater using a different approach than most. We sequence smallish fragments of the genome so that we can tell if particular mutations are derived from the same virus. This way we are better able to determine if there is anything new circulating.

About a year and a half ago we detected something from a St. Louis sewershed that didn't make sense. SARS-CoV-2 lineages whose sequence did not match anything that had been seen from patients. We called these 'cryptic lineage'.

@DrJDennehy

Wanting to know if this kind of thing was common, we started collaborating with @DrJDennehy to see if we could find anything similar in other places. NYC also had cryptic lineages. Not the same lineages, but lineages with similar characteristics.
nature.com/articles/s4146…

@RoseKantor

By the end of 2021, we had identified 9 sewersheds (out of about 180 that were being routinely surveyed) that contained cryptic lineages, including a lineage from California with @RoseKantor.
medrxiv.org/content/10.110…

My obsession with finding and characterizing these cryptic lineages accelerated in 2022. To date we have screened wastewater from around 700 sewersheds. In total we have found 24 putative lineages. I say putative because some of them we have only seen once.

@dho

We still didn't know where these lineages were coming from. I was convinced of an animal reservoir. What changed my mind is when we tracked a lineage in Wisconsin with @dho all the way to a single set of bathrooms. There is no animal contribution. It is from a person.

What we think is happening is that cryptic lineages are very long-term infections in people (often >1 year). Perhaps GI infections. The hosts are obviously mounting an immune response, but are not able to clear the infection.

Because there are no genetic bottlenecks from spreading from person-to-person, these viruses basically push the evolutionary fast-forward button.

What can these lineages tell us about circulating viruses? These are the sites in the Spike RBD that are most commonly changed in the cryptic lineages. We had seen lineages with these changes long before they were seen in Omicron.

When Omicron arrived (shown in red) it had mutations at many of the same sites. The main exceptions were L452 and N460, which were common in cryptic lineages, but were not in Omicron (BA.1).

About six months later we started seeing Omicron lineages with mutations at L452. Highlighted in red are the changes in BA.4/5 (which has L452R), but there were also lineages with L452Q, L452M, and others. These lineages took over, but there were still no changes at N460.

Finally, a few months ago N460K appeared in an Omicron lineage (BA.2.75). However, it was not combined with any of the L452 mutations. Nonetheless, BA.2.75 and its derivatives have continued to expand and slowly displace other lineages.

A few days ago, a new lineage appeared in a MO wastewater sample. Omicron with L452R+N460K (and K444M). This appears to match a new lineage that was just designated as BU.1 a few days ago. Currently there are only 13 sequences in GISAID with this combination of mutations.

@CorneliusRoemer

However, I recently learned from @CorneliusRoemer that I was behind in my lineages. There is another completely independent lineage that arrived at almost exactly the same combination. This lineage is designated BQ.1.1. This group has L452R+K444T (rather than K444M).

In addition, BQ.1.1 has also picked up R346T (also in BA.4.6). We haven't focused as much on this region of spike, but most cryptic lineages that we have checked also have a mutation at this site. It seems to be pretty critical. BQ.1.1 is new, but it seems to be taking off.

There you have it. BQ and BU. I don't know if they will cause a spike in total cases, but barring something completely new appearing, I predict that they will be the dominant VOCs in the coming months.

Addendum. It seems a few other lineages have hit very similar 'jackpot' combinations very recently. In addition to BU and BQ:

BW.1 (BA.5.6 derivative)
BS.1 (BA.2.3 derivative)
BR.2 (BA.2.75 derivative)
other undesignated

They all have N460K, L452R + other changes.

@CorneliusRoemer

Real pango experts, please correct me if I botch any of these, or have missed any. @CorneliusRoemer @PeacockFlu @siamosolocani @LongDesertTrain

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @SolidEvidence

Marc Johnson

@SolidEvidence

Jan 24

We found a new (I think) cryptic lineage this week.
I know I say this all the time, but this is really weird.
Warning, this thread is for nerds only.
1/

Here’s what we do. Every week we download all of the new sequences from SRA and run a bunch of screens to look for anachronistic or cryptic lineages.

This new one popped up in 3 different screens.
2/

A good way to spot anachronistic lineages is to look for sequences that have been deleted in contemporary lineages. The virus can only undo a deletion through recombination. If we find seqs that lack the deletions, they have to be old (or contaminated with something old).
3/

Read 16 tweets

Marc Johnson

@SolidEvidence

Nov 23, 2025

What should we expect this flu season?

Here’s a forecast from a wastewater perspective (because sh*t don’t lie)

1/

Background. The 4 main kinds of influenza circulating among humans (in order of severity) are:
FluA H3N2
FluA H1N1
FluB
FluC (many don’t know this one)

2/

Last season, there was a pretty even split between H1N1 and H3N2, with a little bit of FluB late in the season. At least according to CDC patient data.
3/

Read 13 tweets

Marc Johnson

@SolidEvidence

Nov 21, 2025

https://x.com/SolidEvidence/status/1758255313440907668

This is wild.

Remember the NJ crytic lineage?

I posted 18 months ago that the Spike was too divergent to predict ACE2 binding, and asked if someone else could figure it out.

Some colleagues took me up on it.

Guess what they found?
1/

https://x.com/SolidEvidence/status/1758255313440907668

This preprint just came out. @wchnicholas and team reconstructed and tested the NJ Spike and found that it has the tightest ACE2 binding of any SC2 Spike ever measured.
2/
medrxiv.org/content/10.110…

https://x.com/SolidEvidence/status/1590072665561497601

We first found the NJ variant in 2023 because this sewershed from NJ with 1.5 million people because it regularly had a sequence that was a reversion to the bat sarbeco sequence, which is common in cryptics.
3/

https://x.com/SolidEvidence/status/1590072665561497601

Read 9 tweets

Marc Johnson

@SolidEvidence

Oct 31, 2025

Can you take a quarter cup of composite sewage, simply ask ‘what’s in there?’, and find out all of the pathogens circulating in that community?

That is the question we asked in our latest pre-print.

Turns out you can.
1/
medrxiv.org/content/10.110…

We are not the first group to do unbiased sequencing of wastewater to monitor circulating viruses, but I think we are the first to ever do it at this scale.

Weekly wastewater samples for 18 months, totaling over 85 Billion sequence reads.

2/

Among the ‘known’ viruses, there was a fairly even split between bacteria viruses (phages) and eukaryotic viruses.
This was just raw reads though, if you look at diversity there was considerably more species of phages.
3/

Read 23 tweets

Marc Johnson

@SolidEvidence

Oct 24, 2025

Help me out, I’ve got another wastewater virus mystery.

This one really blows my mind.
1/

Starting in the late 2023, + @securebio have been doing ultra-deep metagenomic sequencing of the virome from Columbia, MO wastewater.

We’ve collected and sequenced sample for over 90 consecutive weeks.
2/Lung.fish

We sequence about a billion reads per sample. That’s generated about 16TB of data from this site so far.

To put this in perspective for people my age, it would take a stack of 3.5 in floppy disks 200 miles high to store this data.
3/

Read 12 tweets

Marc Johnson

@SolidEvidence

Oct 17, 2025

It looks like Coeur d’Alene, ID cryptic is gone for now, but it has still managed to answer a lot of lingering questions for me about SARS-CoV-2 evolution, and what to expect next.

Here's a whole genome summary and interpretation.
1/

For a long time cryptic lineages were all from pre-Omicron lineages.

I started wondering:

Will there be Omicron cryptics?

If so, will they have the same evolutionary trajectories as the pre-Omicron cryptics?

ID shows that the answer to both questions is yes.
2/

We don’t do a lot of whole genome sequencing, so I sent 3 samples to @dho lab, who got fantastic sequences for all 3.
These samples were virtually 100% cryptic, so we have nearly complete coverage of the genome for a change.
3/

Read 12 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Marc Johnson

Try unrolling a thread yourself!

More from @SolidEvidence

Marc Johnson

Marc Johnson

Marc Johnson

Marc Johnson

Marc Johnson

Marc Johnson

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!