Something’s happening here: BA.2.86 and the furin cleavage site (FCS)
The FCS has been highly conserved in all SARS-CoV-2 lineages. Why is it disappearing so much more frequently in BA.2.86/JN.1? 1/16
The FCS, located around S:681-685, is one of the most distinctive features of SARS-CoV-2. Its presence causes the spike to be cleaved within the cell by furin, priming it for membrane fusion & cell entry (which requires a 2nd spike cut by TMPRSS2). 2/16
Early studies showed the FCS to be essential for SARS-CoV-2 infection of lung cells, and a study by @PeacockFlu showed that the FCS was required for transmission (in ferrets) & evaded host immune defenses. 3/16
The FCS is not only highly conserved but has been enhanced by P681H/P681R—which have been universal for nearly 3 years. Omicron added the FCS-adjacent S:N679K, but apart from that, even mutations in the surrounding region have been uncommon. 4/16
But in BA.2.86—mainly JN.1 (BA.2.86 + S:L455S = JN.1) as it’s the dominant form of BA.2.86—FCS-destroying mutations have arisen with remarkable frequency. To be clear, this is a tiny minority of all BA.2.86, but it’s a clear departure from previous trends. 5/16
The most common FCS-destroying mutation has been S:R683W, which has been in ~36 BA.2.86* sequences. These are very scattered, so they represent 20 or more independent acquisitions. The largest R683W cluster seems to be five sequences. 6/16
FCS-destroyer S:R683Q is less common, with 14 sequences, but 11 of those sequences belong to the same branch, which has spread across 5 countries and 3 continents. 7/16
Intriguingly, this branch is part of a larger (400-sequence) JN.1 branch with a 2-nucleotide mutation: ORF1b:V1271T (NSP13_V348T). Two-nucleotide mutations acquired in one leap are extremely rare, yet this one occurred in three different (small) XBB lineages in 2023. 8/16
Prior to 2023, ORF1b:V1271T was in just 25 sequences—one Delta, the rest BA.1/2/5. All acquired both nuc mutations in one jump. Only 278 sequences ever have had 1 of these 2 nuc mutations but not the other, a strong sign each individually is deleterious. 9/16
Curiously, there appears to be a 4-sequence branch just below the S:R683Q one (within the ORF1b:V1271T lineage) that has lost S:H681R through an S:R681P reversion. The furthest sequence on this branch seems to have acquired the extraordinarily rare S:P681S. 10/16
It’s not entirely clear of the four S:R681P sequences on the branch above are real or some sort of artifact, but the sequences look clean, come from 3 different originating labs and 2 different submitting labs, and fit the pattern of unusual FCS mutations in BA.2.86. 11/16
While the FCS-destroying R683W & R683Q are the most extreme examples, several other FCS or FCS-adjacent mutations have sprung up w/BA.2.86. S:K679M, for example, is in 22 BA.2.86, compared to 29 previous sequences ever (of which zero had S:681R). 12/16
More common have been mutations at the FCS-adjacent S:S680 in BA.2.86. S680F is the most frequent, but S680P and S680Y have also appeared numerous times. I do not know what effect this is expected to have on the FCS. 13/16
S:R681H reversions (which brings 681 back to the residue in almost all other Omicron lineages) have also been quite common. But there are so many contaminated sequences recently that deciphering exactly how common this reversion has been would require a lot of work. 14/16
There also seem to be an increase in mutations in various other FCS-region residues, but I don’t have the energy to do an analysis of that right now. I’m guessing lineage arch-guru @siamosolocani can speak to this though. 15/16
What is the meaning of it all? People probably get tired of me saying this, but I don’t know. S:681R is thought by most to increase FCS efficiency. Does this somehow reduce transmissibility in BA.2.86? Are FCS-loss mutations a kind of overcompensation? Thoughts welcome. 16/16
I like the idea of @PriscillaFalzi1—premature shedding of S1 (which makes cell attachment impossible) could explain this. This would also help explain the numerous apparent K679N & R681P reversions. As with NTD deletions, there's a stability tradeoff.
The ∆69-70 pendulum is in the midst of yet another swing.
What makes the alternation from ∆69-70 to S:H69+ S:V70 even more remarkable is that deletions are one-way mutations. They essentially cannot be reversed. There has never been an insertion at S:69-70 in SARS-CoV-2. 1/5
So how, after repeated deletion of S:69-70, have S:H69 + V70 been restored? This cannot happen via stepwise evolution/antigenic drift.
But it can if a new variant derives from an old, vanished lineage that's never had ∆69-70 & has spent months evolving in a single person. 2/5
There has been great work on deletions in the NTD region of spike by @GuptaR_lab, @EnyaQing, @GroveLab, & others, some of which I tried to describe in the thread below.
But a precise explanation for the ∆69-70 pendulum remains elusive. 3/5
Spike mutations to cysteine are rare—esp. in the receptor-binding domain (RBD). When 2 Cs show up at once, it usually means a new disulfide bond has been created. I don't think this has ever been seen in the SARS-CoV-2 RBD. 1/11
Disulfide bonds are covalent bonds (i.e. very strong ones) between two cysteine (C) residues. The spike protein has famous evolved very rapidly over the past 3 years, but the 4 existing RBD disulfides are *absolutely* conserved. 2/11
I'm definitely not an expert on structure, but disulfide bonds are clearly crucial.
I don't know for sure that G447C-G496C form a disulfide, but I'd be shocked if they don't.
Amazing tool made by @Asinickle1 & @OliasDave shows 447 & 496 are very close—about 5 Å apart. 3/11
Minimize the number of times you get Covid. Your body will thank you.
Not only can repeat infections inflict Long Covid, they also, as @zalaly has shown, increase the risk of a multitude of ailments, such as cardiovascular events. 1/5
Anthropomorphized Delta, channeling Mark Twain: "Reports of my death are greatly exaggerated."
That's 66 private mutations for those keeping count. 1/6
Like the legendary Indonesian Delta, it has the extremely rare S:R452L reversion.
Unlike that one—but like most Deltas—it has few RBD mutations, meaning it would be unlikely to survive in the current immune environment, even if it were to transmit. 2/6
I spend a lot of time analyzing and documenting outlier SARS-CoV-2 sequences. Recently, I’ve noticed a fascinating pattern: the repeated appearance of paired mutations in two narrow NSP12 regions >2400 nucleotides distant from each other. 1/45
NSP12 is the RNA-dependent RNA polymerase (RdRp), part of the SARS-CoV-2 replication complex, which also includes NSP7-10 and NSP13-14. I discussed some basics of the RdRp & coronavirus genome replication previously. 2/45
The first two-thirds of the SARS-CoV-2 genome consists of ORF1a & ORF1b. When the full genome enters a ribosome, the cell protein-making machine, only ORF1a is translated ~75% of the time. The ribosome hits a stop codon & cuts loose. No ORF1b. 3/45
GW.5 is my favorite XBB* lineage. It's always pulling crazy tricks, like deleting entire genes or adding 5-7 AA mutations at once. Here's it's latest antic: grabbing a chunk of the nucleocapsid gene (top) and stuffing it into the spike NTD (bottom). 1/4
These two sequences share two other mutations not found in other related sequences and are from different labs on opposite sides of the world, so this is real, not an artifact. 2/4
Incredibly, this exact insertion, in the exact same location, has occurred once before, and in a completely unrelated sequence—an XBC.1.2.1 from Denmark collected in December 2022. (XBC is a "Deltacron" lineage now mainly circulating in Australia.) 3/4