Pango lineages (e.g., B.1.1.7 “alpha”) are the lingua franca of genomic epidemiology and an awesome example of open science. Maintaining Pango is also a ton of work! In our latest preprint @jdm1771 addresses this with autolin, a method for automated lineage proposals.
Get all of the gory details about autolin from the preprint and read on for the twitter brochure. biorxiv.org/content/10.110…
May 20, 2022 • 9 tweets • 6 min read
10,048,466! That’s a lot of #SARSCoV2 genomes in the single largest phylogeny ever that we update and optimize every single day! Here, I’ll explain how we are doing pandemic-scale phylogenomics.
We start by aggregating all of the new SARS-CoV-2 genomes from @GISAID, @NCBI, and @CovidGenomicsUK. After QC, we add each genome to the ever-growing phylogeny using @yatishturakhia’s amazing tool, UShER: nature.com/articles/s4158…