What's going on with ORF7b? Something unusual has occurred in ORF7b in two unrelated variants—two of the only non-XBB variants that remain competitive. They took entirely different routes to reach the same end point: a new ORF7b protein.
But first a short primer on ORF7b. 1/
SARS-CoV-2 genes are often divided into three categories: 1. Non-structural proteins (NSPs) 2. Structural proteins (spike, envelope, membrane, nucleocapsid or S, E, M, N) 3. Accessory proteins 2/
The 16 NSPs are in ORF1ab & make up the first ~2/3 of the genome. They have diverse functions, but most are primarily involved in viral replication in one way or another. Here are descriptions of the functions of the 16 NSPs from a few different sources. 3/
Structural proteins (envelope, membrane, spike, nucleocapsid—E, M, S, N) compose the physical structure of the virus. N has also been implicated in innate immune evasion. 4/
Accessory proteins (ORF3a, ORF6, ORF7a, ORF7b, ORF8, ORF9b, & possibly others) are less essential than the first two categories, & seem to be mainly involved in immune evasion. 5/
But it's not always clear what accessory proteins do. ORF8, after having had dozens of powerful functions attributed to it, has been nearly erased, resulting in an apparent increase in fitness. ORF7a has also been destroyed repeatedly. 6/
ORF7b is the smallest protein, at 43 amino acids (AA), & early ORF7b stop codons, which prematurely halt protein construction, have made regular appearances (though not as commonly as in ORF8). At first, the new ORF7b mutations seem similarly destructive of ORF7b. 7/
Each AA is coded by a "codon" made up of 3 nucleotides (nt), so any deletion not divisible by 3 completely changes the composition of the AA appearing after it. This is called a frameshift. Example: a 1-nt deletion in ORF6 in XBB.1.28.1 (spotted by @shay_fleishon). 8/
Frameshifts usually quickly lead to a stop codon, truncating the protein & making it incapable of carrying out its previous function. The 1-nt deletion in ORF6 in XBB.1.28.1, for example, finds a stop codon at AA site #11, hacking off the last 84% of ORF6. 9/
One variant—XBC.1.6, a Deltacron—has a 2-nucleotide deletion in ORF7b codon 13, causing a frameshift. But this frameshift doesn't lead to a stop codon until codon 32-33. 10/
The other variant—FR.1, a BN.1 descendant (which is a BA.2.75 descendant)—has a 1-nucleotide insertion at the same location. This causes a frameshift *identical* to the 2-nuc deletion in XBC.1.6, the only difference being an extra F residue in FR.1. 11/
So the new, frameshifted ORF7b is about ~75% as long as the old version. But the amino acid composition drastically differs, particularly in the number of polar vs nonpolar AA residues. Post-frameshift, 7 of the first 14 AA are polar vs 0/14 in the old ORF7b. 12/
But 10 of the last 11 AA residues of the original ORF7b are polar/hydrophilic, & the new, truncated ORF7b lacks these altogether. I think it's safe to say these are two very different proteins. 13/
But what does it all mean? The fact that two of the most competitive non-XBB variants have both stumbled on the same new protein through different routes suggests it may be beneficial in some way, perhaps throwing a spanner in the works of our immune response. 14/
Or the new ORF7b could be a junk protein w/no function. ORF7b likely isn't important enough to have a major impact. But I think this illustrates the sort of evolutionary surprises that are possible. Even as some are erased (ORF8), entirely new proteins can be created. 15/
Indeed at least one already has! It resulted from a 3-nucleotide mutation in N, creating a new TRS-B & leading to the production of the N* protein, which has been shown to interfere with the immune response and increase viral loads. 16/ nature.com/articles/s4146…
N* is now universal except for in Deltacrons like the XBC lineage. It's was a major development that hasn't seemed to get the attention it merits. But that would be an entirely different thread. Time to end this one. 17/
As always to the labs, lab workers, and researchers doing the groundwork, without which none of this would be possible. As funding is reduced and sequencing decreases, their work is more important than ever. 18/end
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Very proud to be a co-author on this comprehensive preprint on the novel, growing saltation lineage BA.3.2, together with @Tuliodna, Darren Martin, Dikeledi Kekana, and lead author @graemedor. 1/9
I would normally write a summary 🧵 of the BA.3.2 mutational analysis here, but as much of my contribution parallels my previous BA.3.2 threads I'll just link to those here, w/brief descriptions of each.
BA.3.2 emerged in Nov 2024 after ~3 years of intrahost evolution with >50 new spike AA muts, but since then, it's changed very little. Could the drug molnupiravir (MOV) galvanize BA.3.2 into pursuing new evolutionary paths? A new 89-mut MOV BA.3.2 seq suggests it could. 1/11
Background on MOV: It's a mutagenic drug. Its purpose is to cause so many mutations that the virus becomes unviable & is cleared. But we've long known this often does not happen. Instead, the virus persists in highly mutated form & can be transmitted. 2/
I was an author on a paper published in @Nature that conclusively showed not only that MOV has created highly mutated, persistent viruses, but that these viruses have transmitted numerous times. See 🧵 below by lead author @theosanderson. 3/
The most valuable viral research tools—@nextstrain & CovSpectrum—are being destroyed, not only blocked from new data but now forbidden from even sharing info from the PAST. Why?
Because GISAID is run dictatorially by a con man, paranoid egomaniac, & liar named Peter Bogner. 1/
I use CovSpectrum & Nextstrain every day—& I'm not the only one. Every Covid thread I've ever posted here has relied partly on CovSpectrum & Nextstrain for information & visuals. These vital tools have now been stolen from us by a world-class grifter. 2/ thinkglobalhealth.org/article/to-fin…
For years scientists knew something was very, very wrong with GISAID, but the breakout story (from which much of this 🧵is based) came 2 years ago in @ScienceMagazine from @sciencecohen & Martin Enserik. 3/ science.org/content/articl…
3/77 sequences from the latest Netherlands upload are BA.3.2 as well as 4/86 seqs from Queensland, Australia, consistent w/the steady, slow growth we've seen in Germany, the UK, Ireland, & much of Australia. 1/4
One interesting (and possibly coincidental) aspect of the BA.3.2 tree: Two large branches have NSP14 mutations at adjacent AA residues—ORF1b:T1896I and ORF1b:H1897Y. 2/4
I don't have any idea what functional effects either of these mutations would have. They are both C->T mutations, which is the most common type, but they've been relatively uncommon throughout the pandemic, with fewer than 8000 sequences combined. 3/4
The first instance involved a small cluster of sequences that hospitalized several people & resulted in the death of a young child in early 2022. More on this one later. 2/15
The most recent example requires some background. In late 2024, a spectacularly mutated Delta appeared in Spain with 40 new spike mutations and numerous Cryptic markers.
Normally, I would write a thread about such a remarkable sequence, but there were some issues... 3/15
@StuartTurville has pointed out that WA delayed Covid spread longer than elsewhere in Australia. China has a somewhat similar immune history (as do other SE Asian countries). Perhaps BA.3.2 will do well in China once it arrives there? 2/4