On phylogenetics (evolutionary trees), monophyly (clades), and paraphyly. And SARS-CoV-2 variant evolution, of course. 🧵

(All of these ideas are covered in detail in this open access articel: evolution-outreach.biomedcentral.com/articles/10.10…).
First, some obligatory reference to etymology and Greek word origins.

Phylogenetics / phylogeny: Greek phûlon ("tribe, clan, race") + genetikós ("origin, source, birth").

Monophyletic / monophyly: Greek moûnos ("one") + phûlon ("tribe, clan, race").
Paraphyletic / paraphyly: Greek pará ("beside, near") + phûlon ("tribe, clan, race").

Clade: Greek kládos (“shoot, branch”).
You've likely seen the term "phylum" as well for high level taxononomic groups below the level of kingdom and above the level of class (e.g., chordates, arthropods, molluscs, etc.) -- same origin.
Next, a quick bit about the anatomy of an evolutionary tree (phylogeny). Basic components of an evolutionary tree. Figure captions in
I'm not going to get into a lot of detail about reading trees here, but I do want to talk about what we mean by "clade", "monophyly", and "paraphyly", and why that is relevant to understanding SARS-CoV-2 variant evolution.
The term "monophyly" refers to a grouping of lineages that includes the common ancestor, all of its descendants (not missing any), and no lineages that arose outside that group.
Monophyletic groups -- and only monophyletic groups -- are called "clades".

Clades are nested inside larger clades as you extend back in time / to deeper nodes in the tree. Evolutionary trees are like family trees, but for species or
If a group includes the ancestor and *some* (but not all) of its descendants, it is not monophyletic and thus not a clade. Rather, it is paraphyletic.

Image source: online.ucpress.edu/abt/article-ab… Monophyletic groups (clades) vs. paraphyletic groups (not cl
A quick an easy way to know if you have a clade is to imagine cutting the tree at a particular branch. If you make one cut and grab the cut branch, everything you're holding is a clade. If you have to cut two branches to make the group, it's not a clade.
Another way of saying this is that if you have two groups with names, and one is nested within the other phylogenetically, then only the one nested within the other is a clade. The one in which that clade is nested is paraphyletic. (This will be clear in the next tweets).
So, let's think about "birds" and "reptiles".

Birds are all descended from a common ancestor, and the named group "birds" includes all of the descendants and no lineages from outside that group. So "birds" is monophyletic, i.e., a clade.
But "birds" are nested within the group we commonly refer to as "reptiles". Put another way, some "reptiles" (specifically crocodilians) are more closely related to birds than they are to other "reptiles". Here's the figure from Baum and Offner (2008) once again: Figure from Baum and Offner (2008) showing clades and non-cl
To make a group named "reptiles", you have to group all the "reptiles" (lizards, snakes, turtles, crocodiles, etc.) and cut out birds. So one descendant lineage is left out, and "reptiles" is not a clade.
There are lots of familiar groups with names that are paraphyletic: "invertebrates", "fishes", and "monkeys" are all paraphyletic. A group named "monkeys" would be paraphyletic. It
In terms of scientific classification, there is generally a view that taxonomic names should only be given to clades. However, in broader discussions, it can be useful to refer to groups by a name even if they're paraphyletic. So don't worry about saying "reptile" or "fish".
Ok, so now on to SARS-CoV-2 variants. These also evolve through a branching process involving nested groupings of ancestors, descendants, and more or less closely related lineages. In general, we refer to clades and give those labels.

Image: covariants.org Simplified evolutionary tree of SARS-CoV-2 variants.
Most recently, there has been extensive branching of new lineages within the large Omicron clade.

Images: nextstrain.org and @dfocosi Phylogeny of SARS-CoV-2 variants.Diagram showing relationships and mutations of Omicron varia
With Omicron, we've have several waves this year in various places in the world, each (so far) caused by one variant. BA.1, then BA.2, then BA.5.

BA.1 is no longer circulating. What we've been talking about with new variants are all members of the BA.2 and BA.5 lineages.
The evolution of Omicron lineages is not always simple. For one thing, there are notable examples of hybridization across lineages, most notably XBB which is a recombinant between BJ.1 x BM.1.1.1. Excerpt of Omicron lineage figure showing hybridization in t
Let's look more closely at the Omicron clade (as a group "Omicron" is monophyletic). First, we see a major split between BA.1 and BA.2 lineages.

(Both Pango [e.g., BA.1] and Nextstrain [e.g., 21K] labels are indicated).

Image: covariants.org Simplified phylogeny of Omicorn lineages.
Within BA.2 is where it gets a bit more complicated. It's quite clear that BA.4 and BA.5 are each clades, and that together they form a clade. The BA.4 and BA.5 clades.
However, the BA.4+BA.5 clade is itself descended from BA.2, and is nested within BA.2. We also have the Nextstrain groups 22C and 22D nested within 21L. BA.1 and BA.2 lineages.
It is important to bear in mind that the order of branching (topology) shown closer to the tips (terminal nodes) on the tree can be challenging to resolve. The interpretation above is based on assuming that there is no clear split between BA.4+BA.5 and all the other "BA.2"s.
In addition, BA.2 being paraphyletic is contingent on the ancestor of (BA.2+(BA.4+BA.5)) being called BA.2. This is what makes BA.4+BA.5 nested within BA.2.
If the ancestor of (BA.2+(BA.4+BA.5) had not been BA.2, and there had been clear split between BA.4+BA.5 and BA.2, and it would be two clades, like with BA.1 and BA.2 or BA.4 and BA.5.
In the end, it doesn't really matter too much except that we need to be clear to distinguish the ancestral BA.2 that caused its own wave from the descendant lineages still called BA.2 (including BM, etc.) and the other descendant lineages now called BA.5 (including BQ, etc.).
I did manage to make you read a whole thread on evolutionary trees. So that's a win.
I should have linked to this thread on how the names work. One of the complexities is that BA.2, BA.5, BQ.1.1, etc., are aliases that keep the Pango lineages from getting too long.

Original Omicron = B.1.1.529
BA.1 = B.1.1.529.1
BA.2 = B.1.1.529.2
BQ.1 = B.1.1.529.5.3.1.1.1.1.1

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with T. Ryan Gregory

T. Ryan Gregory Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @TRyanGregory

Oct 20
What's happening with SARS-CoV-2 variants in Canada? 🧵
As elsewhere, we no longer have a series of distinct waves caused by one variant at a time. Instead, we have multiple variants, none of them close to a frequency of 100%. Here are some graphs by @DGBassani showing this for the whole pandemic, the last year, and the last 6 mo. Variant frequencies in Cana...Variant frequencies in Cana...Variant frequencies in Cana...
We've had 7 distinct waves in Canada since March 2020, two caused by the original (wild type) virus, and then one each resulting from the Alpha, Delta, Omicron BA.1, BA.2, and BA.5 variants. Aside from the BA.5 wave, these have involved clear peaks.
Read 13 tweets
Oct 20
💯. The fitness landscape in which the virus is evolving is not static. What is adaptive will change according to availability of hosts (immunity, our behaviour, etc.). It would be foolish to assume that there is only one local adaptive peak and that the virus is stuck there.
How could the virus get past a low fitness valley? Here's three ways:

1) There is no deep valley because the peak moves and the virus tracks it.

2) Recombination or other major mutational changes mean a large jump to new peaks.
3) A new variant evolves within a host (e.g., someone immunocompromised with a persistent infection). What is fit at that level is very different from what is fit at the host population level.
Read 8 tweets
Oct 20
Strongly recommend that faculty and staff unions at Ontario universities review the newly disclosed electronic monitoring policy at their institution.

Here's @uofg, which can include putting trackers on our equipment and hiring private investigators.

uoguelph.ca/hr/news-item/n…
Some may violate academic freedom, confidentiality commitments with external research partners, etc. Plus, what a great way to say you don't trust or respect your people.
Law faculty may want to review too. Media should dig into what different university administrators have been doing.
Read 4 tweets
Oct 18
I have mentioned several times that deeply divergent lineages -- that is, very distinct new variants that have qualified for Greek letter names -- seem to have evolved within individual hosts (specifically, someone immunocompromised with a persistent infection).

1/
That's different from the rapid diversification of Omicron lineages, which results from evolution at the level of the human host population -- in particular, selection for transmission to new hosts (especially immune escape).

Well...

2/
This paper published today makes an interesting claim that Omicron did not evolve in an immunocompromised human host, but in mice.

pnas.org/doi/full/10.10…

3/
Read 5 tweets
Oct 18
This could all be standard stuff, mostly long-standing policy, whatever. Not the point. The point is, because the admin at @uofg are utterly non-transparent and do not value trust or relationships at all, we have heard nothing about where this came from.

uoguelph.ca/hr/715-electro…
Why was this policy created in Oct. 2022? Who wrote it? Who approved it? How much of this is required boilerplate, and how much was optional? Were unions consulted? Why weren't we told this was coming?
If you want to tell people you don't respect or trust them:

"Private investigators may be retained to document employee activity outside of work using video camera technology."

"Sensors attached to key pieces of equipment detect and report on equipment location"
Read 5 tweets
Oct 18
We still don't know what it will mean to have a cloud of immune-escaping variants at the same time. There are several possibilities, but for now it's looking like we're still going to be talking about what individual variants are doing rather than the cloud per se. 🧵
Here's what we mean by a "cloud" of variants. There are multiple lineages, all derived from either BA.2 or BA.5, which have independently evolved to have a similar set mutations that confer immune escape.

(Image by @dfocosi) The cloud of convergent Omicron variants.
Read 28 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(