The Maryland sequence (2021-11) is ancestral to most of the recently detected #monkeypox sequences, suggesting this lineage has likely been circulating in humans for months, if not longer.
It may offer hints about how mutations in this lineage have accumulated.
2/6
The Florida, Texas, & Virginia sequences stem from diversity most recently sampled in Nigeria in 2017, but were sampled after travel to a wide area, ranging from West Africa to East Africa or the Middle East.
Their position & divergence may suggest that #monkeypox has been circulating in humans more globally than realised, over the last few years.
All 4 of these sequences have similar types of mutations to those previously observed - possibly the product of antiviral systems.
4/6
We are grateful for the fast #opendata sharing by @CDCgov & @CrystalGigante et al to @NCBI - & also the many other labs worldwide who have shared sequences openly, in the past & recently.
Without such sharing, none of these analyses would be possible, nor so accessible.🙏🏻
5/6
You can find these 4 recent sequences - alongside all of the other #opendata#monkeypox seqs - on @GenSpectrum & LAPIS API, for fast, easy, automated access.
clades.nextstrain.org can now detect if sequences are potentially contaminated or are recombinants.
This is how it works:
Nextclade places sequences on a reference tree. Mutations relative to the nearest neighbor are identified (called private) and analyzed further 1/
These private mutations are split into three categories:
1) Reversions to reference.
2) Labeled: Mutations to a genotype that is common get labeled with the name of the clade the genotype is common in.
3) Unlabeled: Mutations that are neither reversions nor labeled. 2/
Good quality sequences that are not recombinants will only very rarely have several reversions or labeled mutations. Thus the number of reversions and labeled mutations can be used as a sensitive indicator of sequence quality or when scanning for recombination. 3/
We've just updated Nextstrain clade designations to partition clade 21A (corresponding to the Delta WHO variant) into subclades 21I and 21J following our previously defined rules for defining clades when mutational and frequency thresholds are met. 1/4
Clade 21I is still a Delta variant virus, but possesses additional spike mutation A222V and ORF1a mutations P1640L, A3209V, V3718A and T3750I. 2/4
Clade 21J is still a Delta variant virus, but possesses additional ORF1a mutations A1306S, P2046L, P2287S, V2930L, T3255I and T3646A, ORF7b mutation T40I, as well as N mutation G215C. Clade 21J is now the predominate form of Delta with an estimated ~79% global frequency. 3/4
We have just released a new version of the Nextclade web application clades.nextstrain.org for #SARSCOV2. We now use Nextalign (C++ via wasm) under the hood, which allows for many new exacting features!
A quick tour!
1/7
Analysis now takes a few seconds to start, but overall it has become much faster and analyzing hundreds of sequences should be no problem. You can download the reference alignments, including translations of SARS-CoV-2 genes, for further analyses.
2/7
You can now examine the diversity of the nucleotide alignment and alignments of different proteins. Select a gene from the drop-down or click on a gene in the genome annotation panel below.