[Thread] In-silico molecular overclocking of RaTG13.
TLDR: 191-nt RdRp segments of SARS-CoV-2, RaTG13 and Ra7896 show an unexpected molecular clock behavior. In-silico synonym mutations is one probable explanation.
Assuming SARS-CoV-2 is an ancestor of Ra7896 (used in RaTG13) does not fully explain it. Implied evolutionary rate would be in the order of 10^-2 substitutions/site/year in a well-conserved part of the genome, far from a normal ~10^-3 for a complete genome
So, it is not only SARS-CoV-2 having its molecular clock frozen, but also RaTG13 molecular clock running more than expected. @nerdhaspower and @quay_dr have already noted strange patterns of synonym mutations along the genome of RaTG13
I always make this assumption: WIV ability of faking sequences is not unrestricted. They would never fake aa seqs, due to the protein folding problem. In a few months or years, they would be discovered technologyreview.com/2020/11/30/101…
So, in silico, WIV would never make:
- non-synonymous mutations
- splicing within genes
But they could make:
- synonymous mutations
- swap genes or the complete genome from other real viruses
Why fabricating RaTG13?: To make it appear more distant than it really is. You probably never heard of that 98.65% identity between SARS-CoV-2 RdRp and Ra4991 RdRp. It was very dangerous!
But, if you want to make one virus appear more distant to another one without being noticed, you cannot just make orthogonal synonymous mutations, because you can get caught with the phylogenetic trees if you do not do it wisely
WIV forgot that they would eventually publish 7896 RdRp that could serve as a close outer group for SARS-CoV-2 and RaTG13.
Imagine Fig. 1 is real/base situation, A is fixed and you want B more distant.
Note: A is SARS-CoV-2, B is RaTG13 and O is the outer group (clade 7896)
If you just add orthogonal synonymous mutations to B, the clock is distorted (Fig 2, A & B not contemporaries). Correct way of faking would have been making a few backward synonymous mutations towards the outgroup (Fig 3), and then a few orthogonal synonym mutations (Fig 4)
It seems that WIV forgot the backward mutations!
I was thinking if it was better to keep this secret until WIV publish their next paper of the clade 7896 to let them commit the error again. But it clearly shows up in any tree. Their problem was not checking it this short segment
New findings support my previous assumptions: aa seqs of RaTG13 are real, although some genes could have been swapped (with other real ones)
🧵 We are revealing previously unpublished bat ACE2 sequences from undisclosed sites in Yunnan (our best guess is Mojiang and Chuxiong). These ACE2 sequences were unearthed from Bingjie Hu’s 2018 MSc thesis, submitted a year before the outbreak.
Comparing sample IDs of bat ACE2 seqs of Hu’s thesis and Guo et al. (2020) we identified 3 unpublished seqs from an unspecified trip to Yunnan (possibly 8th trip to Mojiang) and 2 more seqs (possibly Chuxiong). Similar to how we identified the 7896 clade.
The 3 unpublished samples (possibly from Mojiang) are depicted in Fig 3.2, and sourced from Rhinolophus pusillus (Rpu, as in RpYN06 or BANAL-103)
YunNan_9483_Rpu
YunNan_9487_Rpu
YunNan_9527_Rpu
Note: WIV never reported Rpu’s from Yunnan, although the HKU team did from Mojiang.
🧵Natural and non-natural live isolates (WIV & related)
TLDR:
- A substantial portion of the research remains unpublished.
- Missing live isolate WIV15.
- No samples, isolates, chimeric and MA viruses in 2017-2019?
- What is "Rs4874" really?
https://t.co/9JPusWbl9l
First, it is very important to distinguish between a sequence and an isolated virus. Isolating a virus is very difficult. The ratio is perhaps one in hundreds sequenced. WIV has obtained thousands of virus sequences but only a few live viruses.
Misc:
- Rambaut now more involved (added to "Software" and "Data curation" in the contributions).
- The 155 cases turned now into 156 cases.
- Fig 1E then vs now (they forgot to update Fig 2B too).
- Fig 1A: many cases shifted East hundreds of m.
Judge yourself. They were looking for new SARS viruses different from SARS1 but this one found investigating a COVID-like outbreak in miners was not interesting and they made a mistake saying the miners had SARS antibodies.