Marc Johnson Profile picture
Nov 29 12 tweets 4 min read Read on X
GISAID vs SRA/WW
I thought I would do a little comparison to see how wastewater sequencing data compares with patient sequencing data in evaluating viral trends.
1/
cdc.gov/nwss/index.html
For WW I took all of the samples from our most recent SRA download that were collected in the last month (~500 samples). This wasn’t normalized.

For the patient side I used Cov-Spectrum data (because it's public) from the last month (8,302 sequences).
2/
cov-spectrum.org/explore/World/…
There are about 50k patient samples collected for sequencing each month, but there is always a delay before they are all sequenced and uploaded.

In this regard, the WW data is much faster.
3/
Example: Verily (who has CDC/NWSS contract) already has 145 samples available from the last 2 weeks.

By contrast, there are only 13 US patient sequences available that were collected in the last 2 weeks.

Plus, each WW sample represents 10s to 100s of thousands of people.
4/ Image
From the SRA/WW samples there were 80+ changes in Spike that were essentially consensus. These perfectly matched the changes in JN.1.

Also near consensus were F456L (99%) and Q493E (90%).

From the patient sequences F456L was 96% and Q493E was 89%.

Not a bad match.
5/ Image
Next on the WW list was S31- (68%), T22N (20%), and F59S (18.5%).
From the patient sequences S31- was 61%, T22N (30%) and F59S (27%).

T22N/F59S is mostly XEC. The small disconnect is probably because more of the WW data is mostly from the US where XEC isn't as prevalent.
6/ Image
US sequences from the last month were: 31- (69%) , S22N (24%), and F59S (21%).
7/
The 31- includes KP.3.1.1*, but also a bunch of other lineages.

To estimate KP.3.1.1 prevalence I looked outside of spike. 13,121T was 58%. (12,616T had low coverage).

In patients 12,616T and 13,121T were both 56%.

Good agreement, KP.3.1.1 at ~56-58%.
8/ Image
Next from WW/SRA were R346T (17.6%), T572I (8.1%), Q183H (6.9%) and H146Q (6.4%).
Numbers for patients were 10%, 10%, 3%, and 1%, respectively.
Not sure why there is a bit more of a disconnect with these, but I’m guessing it is because the patient data is behind. We’ll see.
10/ Image
Here are the rest of the WW changes at 2% or higher. Despite my prognosticating, F456V (MV.1, not listed) is still only at 0.2%.
11/ Image
It's worth noting that in addition to being fast, wastewater sequencing is pretty cheap compared to patient sequencing.
12/
All in all, I'd say the WW data does quite well at getting a fast, cheap and accurate overview.

The future of wastewater surveillance is not certain, but I hope future administrations recognize that it is an efficient and cost-effective means of monitoring pathogens.
13/13

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Marc Johnson

Marc Johnson Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @SolidEvidence

Nov 28
Maryland variant, retrospective analysis.

I decided to have a more careful look back at the evolution of the Maryland cryptic lineage.
1/ Image
Standard explanations and disclaimers.
Cryptic lineage: unique, evolutionary advanced SARS-CoV-2 lineages detected in wastewater from an unknown source.
Cryptics are not from animals, they are long term infections.
2/
Cryptics generally are not contagious and we think they are probably GI infection.

The virus in wastewater is not infectious.

3/
Read 12 tweets
Nov 22
If anyone wants to follow along with the Maryland variant (or doesn't believe my analysis), have a look for yourself.

1/
Go to
Type in SRR31400336 and start alignment.

This is a sample from the Maryland sewershed collected on November 7 of this year.
2/deeperseq.genomium.org
This is the RBD region of the Maryland sewershed, below is a normal sewershed.

It doesn't take a molecular virologist to see that one doesn't look like the other.

3/ Image
Read 4 tweets
Nov 20
Maryland folks, I need another favor.
There is a person from Anne Arundel county that has been infected with SARS-CoV-2 for about 3 years (Delta infection).

They probably don’t even know they are infected, but they are shedding a ton of viral material in wastewater
1/ Image
I’m trying to find this person without invading their privacy, if they are willing to be found.

Here are a few threads I’ve written about this variant if you want to read up.

x.com/SolidEvidence/…
x.com/SolidEvidence/…
x.com/SolidEvidence/…
2/
We figured out that the signal is from the Patuxent sewershed (78k people). We’ve been detecting the variant there since this Spring.

Interestingly, the COVID WW spikes were dates when the cryptic sequence was highly prevalent.

Those spikes were driven by one person!

3/ Image
Read 8 tweets
Nov 17
Need a little help.

Does anyone know someone that works at the Patuxent Water Reclamation Facility in Crofton, MD that they could put me in touch with?

Here's why.
1/ Image
There is a cryptic lineage (unique, evolutionarily advanced SARS-CoV-2 lineage detected in wastewater) that we have been detecting from a Maryland sewershed all year.

The lineage is derived from Delta, so it's from a person that was first infected about 3 years ago.

2/ Image
Please don't ask me how we know this isn't coming from an animal.

That's what I thought at first too, but at this point we are all but certain that the sequences are coming from individual people. Here's one of many threads I've written about this.
3/
Read 12 tweets
Nov 15
Postmortem.

Thanks for all of the retweets!

Late last night the twitter/x account was suspended and I also can't view the ResearchGate account anymore.

Here's a summary of some of the discussion/findings.

1/
I was going to post the full exchange I had with 'Julia', but I can't view it anymore. It was ~three exchanges and they were very benign. I stopped when I decided she was probably fake, but then she tried to reengage a few days later, which is when I investigated.
2/
For those who think I'm naive.

I have had numerous 'weird' messages from people that found me on this platform that turned out to be EXTREMELY beneficial both personally and professionally.

If something isn't obviously fraud, I take it at face value (with skepticism).
3/
Read 12 tweets
Nov 14
I knew there were a lot of fake accounts on this platform, but they are usually obvious. I had no idea how intricate and complex the ruse could be.

Get a load of this story.
1/
About a week ago I got a DM from an account asking me a benign but specific question about my research.

This happens to me all the time. I’ve met some interesting people this way.

Sometimes people even look up my number and call my office. It happens.
2/ Image
The account looked legitimate. They are a paying account (I’ve never seen a bot that pays).
They’ve been on the platform for over 2 years.
They have 52 followers, some of whom I recognize, and they compulsively repost interesting science posts (including some of mine).
3/ Image
Read 16 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(