Imagine that you were to experience a thunderstorm for the first time as an adult.
The sky turns dark in the middle of the day and lightning bolts start coming out of the sky.
This couldn’t seem natural.
This is kind of how I see things with COVID origins debates. 1/
I’m from the Midwest, so tornadoes and violent thunderstorm seem normal to me.
I’m also a virologist, so some of the wild stuff that Coronaviruses do seems normal too.
2/
No one is 100% sure where SARS-CoV-2 came from, so I won’t argue that it’s natural.
I’m just going to explain (again) why a natural origin would not be surprising at all to me.
I’m not calling anyone stupid, I’m just explaining my point of view.
3/
The biggest difference between SARS-CoV-2 and the other Sarbecoviruses is the furin-cleavage site (FCS) that was inserted between S1 and S2, which has been argued is proof the virus was engineered.
There are other arguments too, but the FCS is the only point I’m discussing.
4/
I’ll make 4 points. 1. Random insertions in Coronaviruses are common. 2. Insertions most often occur in genetic ‘soft spots’ 3. The S1/S2 border is a clear ‘soft spot’. 4. Furin-cleave sites (FCS) are easy to make.
5/
I study cryptic lineages, which appear to be SC2 persistent infections that have lasted a very long time.
We don’t know who they come from, but we detect them in wastewater.
We’ve been working on a project to curate the whole genome of as many cryptic lineages as possible from wastewater sequences deposited into public databases.
We’ve curated 18 so far and they are pretty diverse.
7/
One of the first things we noticed about these lineages is that insertions are really common. About a third of the cryptic lineages had at least one insertion.
The insertions are not random. There are ‘soft spots’ where insertions are tolerated.
8/
Here is an example.
2 of the 18 cryptic sequences contained an insertion in *precisely* the same site in the M gene.
Both were duplications from Orf1a, but they were completely different insertions.
9/
Other soft spots in the SARS-CoV-2 genome are in the NTD of Spike, and at the S1/S2 border.
We know because we’ve seen lots of insertions occur there.
Here’s an example of an S1/S2 insert in the lineage AT.1, which circulated in 2021.
Back to curating the viruses we find from randomly sequencing the wastewater virome.
I've been a virologist for 30 years, how have I not heard of some of these viruses?
I don't think we knew about smacoviruses when I went to grad school.
Quail influenza? Who has pet quail and is flushing their waste? And please stop.
Could be a duck or chicken host, but still.
We have a program that compares the sequences to a viral database, but you kind of have to BLAST some of them individually to confirm what is real. H9N2 was real.
We've started a project where we are doing unbiased sequencing of the wastewater 'virome'.
Dec/Jan SARS-CoV-2 was the most common human RESPIRATORY virus detected, but what else was in the top 5?
You all have gotten most of them.
#1 SARS-CoV-2
#2 Adenoviruses
#3 AlphaCoronaviruses (229E & NL63)
#4 other BetaCoronaviruses (HKU1 & OC43)
#5 is an Influenza, but which one?
#6 is RSV (A+B)
The sample doesn’t contain B.1.1 or Delta changes in the N gene. This means it is presumably derived from a B.1 lineage that circulated over 3 years ago.
Many people have asked what happened with the Ohio cryptic lineage, so I thought I would give an Ohio update.
[cryptic lineage = an unique evolutionarily advanced SARS-CoV-2 lineage found in wastewater from an unknown source]
1/
Background.
Last year we described a cryptic lineage that was regularly detected in wastewater samples from Columbus, Ohio and Washington Court House (WCH), a town about 40 miles away.
The lineage appeared off and on for about a year, but in May 2023 the shedding became extreme.
The COVID wastewater levels in WCH were really high, but based on the sequencing, it was almost entirely the cryptic lineage. It was all from the one person. 3/
Late last year the Swiss deposited a whole bunch of wastewater sequences going back a few years. We screened the data and found a cryptic lineage from Lousanne in 2021.
I don’t think I ever posted about it because it wasn’t that exotic, and it disappeared 2 years ago.
2/
The lineage was present from 5/21-7/21, 5 samples total. It’s one of many lineages that was a B.1 derivative with the s2m ‘bat reversion’ T29758G.
I found (or rediscovered) another cryptic lineage this weekend, and this is the weirdest one yet.
Sometimes it almost feels like someone is playing an extremely elaborate prank on me.
I'm not sure even I believe this one.
1/
The lineage is from a sewershed in Switzerland. The group submitted hundreds of sequences recently and the lineage appeared twice in samples from late last year, both times from the same sewershed.
2/
The lineage is derived from B.1.416*, a lineage that circulated in late 2020 and was most prevalent in Switzerland.
I’m still working on reconstructing the genome, but there were several revealing bits in the parts I’ve analyzed.