Marc Johnson Profile picture
Mar 10 19 tweets 5 min read Read on X
Imagine that you were to experience a thunderstorm for the first time as an adult.

The sky turns dark in the middle of the day and lightning bolts start coming out of the sky.

This couldn’t seem natural.

This is kind of how I see things with COVID origins debates.
1/ Image
I’m from the Midwest, so tornadoes and violent thunderstorm seem normal to me.

I’m also a virologist, so some of the wild stuff that Coronaviruses do seems normal too.

2/
No one is 100% sure where SARS-CoV-2 came from, so I won’t argue that it’s natural.

I’m just going to explain (again) why a natural origin would not be surprising at all to me.

I’m not calling anyone stupid, I’m just explaining my point of view.

3/
The biggest difference between SARS-CoV-2 and the other Sarbecoviruses is the furin-cleavage site (FCS) that was inserted between S1 and S2, which has been argued is proof the virus was engineered.

There are other arguments too, but the FCS is the only point I’m discussing.

4/ Image
I’ll make 4 points.
1. Random insertions in Coronaviruses are common.
2. Insertions most often occur in genetic ‘soft spots’
3. The S1/S2 border is a clear ‘soft spot’.
4. Furin-cleave sites (FCS) are easy to make.

5/
I study cryptic lineages, which appear to be SC2 persistent infections that have lasted a very long time.

We don’t know who they come from, but we detect them in wastewater.


6/doi.org/10.1371/journa…
We’ve been working on a project to curate the whole genome of as many cryptic lineages as possible from wastewater sequences deposited into public databases.

We’ve curated 18 so far and they are pretty diverse.

7/ Image
One of the first things we noticed about these lineages is that insertions are really common. About a third of the cryptic lineages had at least one insertion.

The insertions are not random. There are ‘soft spots’ where insertions are tolerated.
8/
Here is an example.

2 of the 18 cryptic sequences contained an insertion in *precisely* the same site in the M gene.

Both were duplications from Orf1a, but they were completely different insertions.

9/ Image
Other soft spots in the SARS-CoV-2 genome are in the NTD of Spike, and at the S1/S2 border.

We know because we’ve seen lots of insertions occur there.

Here’s an example of an S1/S2 insert in the lineage AT.1, which circulated in 2021.


10/
Here’s another example, an insertion just before the FCS in a BA.1.1 sub-lineage that circulated in early 2022.

In this case we don't know for sure where the insertion sequence came from, probably a human RNA.

11/ Image
So insertions are common, and the place where the furin-cleave site (FCS) in SC2 appeared is a common place to see them.

So what is a furin-cleavage site?

12/
A furin-cleavage site is a loosely defined 4 AA sequence that the cellular protease furin can cut.

We can try to predict what will be a good FCS, but ultimately the protease gets to decide.

13/
We’ve seen very diverse sequences that furin will cut.

They always have at least one arginine (R), but more often two or three.

14/
As it happens, R is one of the most common codons.

6/64 possible codons produce R.

If you typed the sequence to produce 4 codons at random (12 nt), there is a 32.5% chance that at least one of them is going to be an Arg (R).

15/ Image
There are over 17,000 different sequences (~0.1% of possible combinations) that produce the 'canonical' FCS (R-X-(K/R)-R).

There are also many more 'non-canonical' sequences that still work.

16/
It’s obvious that having an FCS at the S1/S2 border can be advantageous to the virus in certain situations because we see it there a lot.

That’s why investigators (DEFUSE) had proposed to study this long before SC2 existed.

17/

sciencedirect.com/science/articl…
People will ask if I’ve seen an FCS acquired in real time, or if I've see one acquired by a Sarbecovirus (which is pretty specific).

This is sort of like asking ‘have you seen a lightning strike that looks exactly like this?’.

18/
The answer is No, I have not seen precisely that, but that doesn’t mean it hasn’t happened, or that it is surprising.

I see much stranger things all the time.

Shit happens.

19/

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Marc Johnson

Marc Johnson Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @SolidEvidence

Mar 9
Back to curating the viruses we find from randomly sequencing the wastewater virome.

I've been a virologist for 30 years, how have I not heard of some of these viruses?

I don't think we knew about smacoviruses when I went to grad school.
Quail influenza? Who has pet quail and is flushing their waste? And please stop.

Could be a duck or chicken host, but still.
We have a program that compares the sequences to a viral database, but you kind of have to BLAST some of them individually to confirm what is real. H9N2 was real.

This would be a fun crowd sourcing activity.
Read 11 tweets
Mar 3
One more round.

We've started a project where we are doing unbiased sequencing of the wastewater 'virome'.

Dec/Jan SARS-CoV-2 was the most common human RESPIRATORY virus detected, but what else was in the top 5?
You all have gotten most of them.
#1 SARS-CoV-2
#2 Adenoviruses
#3 AlphaCoronaviruses (229E & NL63)
#4 other BetaCoronaviruses (HKU1 & OC43)
#5 is an Influenza, but which one?
#6 is RSV (A+B)

I know, the grouping was not consistent.
Answer.
1 - SARS-CoV-2
2 - Adenovirus
3 - AlphaCoronaviruses
4 - other BetaCoronoviruses.
5 - Influenza C (surprised me).
6 - RSV

This has been a mild flu season compared to last, but maybe we were just looking for the wrong one. They don't test for FluC (supposed to be mild).
Read 4 tweets
Mar 2
We found an interesting cryptic lineage from Charlotte, NC recently.

[cryptic lineage = an unique evolutionarily advanced SARS-CoV-2 lineage found in wastewater from an unknown source]

1/ Image
This is one that popped up in a screen for sequences with the s2m ‘fix’.

Explained here.

2/

Image
The sample doesn’t contain B.1.1 or Delta changes in the N gene. This means it is presumably derived from a B.1 lineage that circulated over 3 years ago.

Long infection.

3/

Explained here.

Image
Read 11 tweets
Mar 2
Ohio cryptic lineage update.

Many people have asked what happened with the Ohio cryptic lineage, so I thought I would give an Ohio update.

[cryptic lineage = an unique evolutionarily advanced SARS-CoV-2 lineage found in wastewater from an unknown source]

1/ Image
Background.

Last year we described a cryptic lineage that was regularly detected in wastewater samples from Columbus, Ohio and Washington Court House (WCH), a town about 40 miles away.

2/
The lineage appeared off and on for about a year, but in May 2023 the shedding became extreme.

The COVID wastewater levels in WCH were really high, but based on the sequencing, it was almost entirely the cryptic lineage. It was all from the one person.
3/ Image
Read 13 tweets
Feb 26
Swiss variant update, another record is broken.

1/ Image
Late last year the Swiss deposited a whole bunch of wastewater sequences going back a few years. We screened the data and found a cryptic lineage from Lousanne in 2021.

I don’t think I ever posted about it because it wasn’t that exotic, and it disappeared 2 years ago.

2/ Image
The lineage was present from 5/21-7/21, 5 samples total. It’s one of many lineages that was a B.1 derivative with the s2m ‘bat reversion’ T29758G.

3/ Image
Read 9 tweets
Feb 26
I found (or rediscovered) another cryptic lineage this weekend, and this is the weirdest one yet.

Sometimes it almost feels like someone is playing an extremely elaborate prank on me.

I'm not sure even I believe this one.

1/
The lineage is from a sewershed in Switzerland. The group submitted hundreds of sequences recently and the lineage appeared twice in samples from late last year, both times from the same sewershed.

2/
The lineage is derived from B.1.416*, a lineage that circulated in late 2020 and was most prevalent in Switzerland.

I’m still working on reconstructing the genome, but there were several revealing bits in the parts I’ve analyzed.

3/ Image
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(