You've likely seen the term "phylum" as well for high level taxononomic groups below the level of kingdom and above the level of class (e.g., chordates, arthropods, molluscs, etc.) -- same origin.
Next, a quick bit about the anatomy of an evolutionary tree (phylogeny).
I'm not going to get into a lot of detail about reading trees here, but I do want to talk about what we mean by "clade", "monophyly", and "paraphyly", and why that is relevant to understanding SARS-CoV-2 variant evolution.
The term "monophyly" refers to a grouping of lineages that includes the common ancestor, all of its descendants (not missing any), and no lineages that arose outside that group.
Monophyletic groups -- and only monophyletic groups -- are called "clades".
Clades are nested inside larger clades as you extend back in time / to deeper nodes in the tree.
If a group includes the ancestor and *some* (but not all) of its descendants, it is not monophyletic and thus not a clade. Rather, it is paraphyletic.
A quick an easy way to know if you have a clade is to imagine cutting the tree at a particular branch. If you make one cut and grab the cut branch, everything you're holding is a clade. If you have to cut two branches to make the group, it's not a clade.
Another way of saying this is that if you have two groups with names, and one is nested within the other phylogenetically, then only the one nested within the other is a clade. The one in which that clade is nested is paraphyletic. (This will be clear in the next tweets).
So, let's think about "birds" and "reptiles".
Birds are all descended from a common ancestor, and the named group "birds" includes all of the descendants and no lineages from outside that group. So "birds" is monophyletic, i.e., a clade.
But "birds" are nested within the group we commonly refer to as "reptiles". Put another way, some "reptiles" (specifically crocodilians) are more closely related to birds than they are to other "reptiles". Here's the figure from Baum and Offner (2008) once again:
To make a group named "reptiles", you have to group all the "reptiles" (lizards, snakes, turtles, crocodiles, etc.) and cut out birds. So one descendant lineage is left out, and "reptiles" is not a clade.
There are lots of familiar groups with names that are paraphyletic: "invertebrates", "fishes", and "monkeys" are all paraphyletic.
In terms of scientific classification, there is generally a view that taxonomic names should only be given to clades. However, in broader discussions, it can be useful to refer to groups by a name even if they're paraphyletic. So don't worry about saying "reptile" or "fish".
Ok, so now on to SARS-CoV-2 variants. These also evolve through a branching process involving nested groupings of ancestors, descendants, and more or less closely related lineages. In general, we refer to clades and give those labels.
With Omicron, we've have several waves this year in various places in the world, each (so far) caused by one variant. BA.1, then BA.2, then BA.5.
BA.1 is no longer circulating. What we've been talking about with new variants are all members of the BA.2 and BA.5 lineages.
The evolution of Omicron lineages is not always simple. For one thing, there are notable examples of hybridization across lineages, most notably XBB which is a recombinant between BJ.1 x BM.1.1.1.
Let's look more closely at the Omicron clade (as a group "Omicron" is monophyletic). First, we see a major split between BA.1 and BA.2 lineages.
(Both Pango [e.g., BA.1] and Nextstrain [e.g., 21K] labels are indicated).
Within BA.2 is where it gets a bit more complicated. It's quite clear that BA.4 and BA.5 are each clades, and that together they form a clade.
However, the BA.4+BA.5 clade is itself descended from BA.2, and is nested within BA.2. We also have the Nextstrain groups 22C and 22D nested within 21L.
It is important to bear in mind that the order of branching (topology) shown closer to the tips (terminal nodes) on the tree can be challenging to resolve. The interpretation above is based on assuming that there is no clear split between BA.4+BA.5 and all the other "BA.2"s.
In addition, BA.2 being paraphyletic is contingent on the ancestor of (BA.2+(BA.4+BA.5)) being called BA.2. This is what makes BA.4+BA.5 nested within BA.2.
If the ancestor of (BA.2+(BA.4+BA.5) had not been BA.2, and there had been clear split between BA.4+BA.5 and BA.2, and it would be two clades, like with BA.1 and BA.2 or BA.4 and BA.5.
In the end, it doesn't really matter too much except that we need to be clear to distinguish the ancestral BA.2 that caused its own wave from the descendant lineages still called BA.2 (including BM, etc.) and the other descendant lineages now called BA.5 (including BQ, etc.).
I did manage to make you read a whole thread on evolutionary trees. So that's a win.
I should have linked to this thread on how the names work. One of the complexities is that BA.2, BA.5, BQ.1.1, etc., are aliases that keep the Pango lineages from getting too long.
What's happening with SARS-CoV-2 variants in Canada? 🧵
As elsewhere, we no longer have a series of distinct waves caused by one variant at a time. Instead, we have multiple variants, none of them close to a frequency of 100%. Here are some graphs by @DGBassani showing this for the whole pandemic, the last year, and the last 6 mo.
We've had 7 distinct waves in Canada since March 2020, two caused by the original (wild type) virus, and then one each resulting from the Alpha, Delta, Omicron BA.1, BA.2, and BA.5 variants. Aside from the BA.5 wave, these have involved clear peaks.
💯. The fitness landscape in which the virus is evolving is not static. What is adaptive will change according to availability of hosts (immunity, our behaviour, etc.). It would be foolish to assume that there is only one local adaptive peak and that the virus is stuck there.
How could the virus get past a low fitness valley? Here's three ways:
1) There is no deep valley because the peak moves and the virus tracks it.
2) Recombination or other major mutational changes mean a large jump to new peaks.
3) A new variant evolves within a host (e.g., someone immunocompromised with a persistent infection). What is fit at that level is very different from what is fit at the host population level.
Some may violate academic freedom, confidentiality commitments with external research partners, etc. Plus, what a great way to say you don't trust or respect your people.
Law faculty may want to review too. Media should dig into what different university administrators have been doing.
I have mentioned several times that deeply divergent lineages -- that is, very distinct new variants that have qualified for Greek letter names -- seem to have evolved within individual hosts (specifically, someone immunocompromised with a persistent infection).
1/
That's different from the rapid diversification of Omicron lineages, which results from evolution at the level of the human host population -- in particular, selection for transmission to new hosts (especially immune escape).
Well...
2/
This paper published today makes an interesting claim that Omicron did not evolve in an immunocompromised human host, but in mice.
This could all be standard stuff, mostly long-standing policy, whatever. Not the point. The point is, because the admin at @uofg are utterly non-transparent and do not value trust or relationships at all, we have heard nothing about where this came from.
Why was this policy created in Oct. 2022? Who wrote it? Who approved it? How much of this is required boilerplate, and how much was optional? Were unions consulted? Why weren't we told this was coming?
If you want to tell people you don't respect or trust them:
"Private investigators may be retained to document employee activity outside of work using video camera technology."
"Sensors attached to key pieces of equipment detect and report on equipment location"
We still don't know what it will mean to have a cloud of immune-escaping variants at the same time. There are several possibilities, but for now it's looking like we're still going to be talking about what individual variants are doing rather than the cloud per se. 🧵
Here's what we mean by a "cloud" of variants. There are multiple lineages, all derived from either BA.2 or BA.5, which have independently evolved to have a similar set mutations that confer immune escape.