Huge congrats to @jdbackman, Manuel Ferreira, @gabecasis, @RegeneronDNA, and co. on @Nature 450k exome @uk_biobank paper 🎉🎉🎉

Especially exciting to see ALL ancestries analyzed (not just EUR like most other UKB exome papers thus far)
nature.com/articles/s4158… Image
Across 454k individuals, they found 12.3M variants (99.6% MAF<1%). This is:
- 1.3x the coding variation in TOPMed & gnomAD combined
- 8x what could be found from TOPMed imputation of UKB (info >0.3) Image
Testing both individual variants and burden tests across 3702 binary and 292 quantitative traits (2.3B tests) using REGENIE, they found 8865 associations (Bonerroni P<2.18e-11) across 564 genes, 492 traits, and 2283 gene-trait pairs. Image
Often forgotten in RV analyses is whether a RV signal is explained by LD with a common variant.

To test for this, they performed a GWAS using imputed data and conditioned the associations on top common variants of which 91% were still significant after conditioning.
Replication is always important but can be difficult with rare variants, partially due to lacking additional large cohorts not used in the initial discovery. With help from a smaller cohort of 133k exomes from @GeisingerHealth, 81% of associations replicated.
Although many different allele frequency thresholds were used for burden tests, they did find 69 (15 novel) associations discovered solely through burden tests of singleton variants (MAF ~1e-6).
Looking for protective associations differs significantly between binary and quantitative traits.
- 131 protective associations in quantitative traits
- 5 in binary traits Image
Unsurprisingly, the majority of traits with a RV signal have more common variants signals. However, 20 traits only had rare variant associations, of which the signals all came from CHIP genes. These associations were indeed correlated with age and had low AB (>35%; i.e. somatic) Image
To help pinpoint likely genes in GWAS loci, they ran GWAS for each of the 492 traits with a RV association finding >107k independent loci. After conditioning on GWAS loci, RV associations were:
- 59x enriched for the nearest gene
- 11.4x for genes within 1MB of GWAS loci Image
So despite what some call an arbitrary decision to assign the nearest gene to a GWAS locus, this really isn't a terrible decision as corresponding RV associations are 59.4x enriched in the nearest gene.
For those attending #ASHG21, the first author, @jdbackman, is giving a #plenary covering the results of this manuscript during plenary session 1 at 1:20pm -1:40pm EST.
Perhaps most importantly, the full associations will be publicly available via @GWASCatalog (they have already been deposited and are currently undergoing quality checks).

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Jack Kosmicki

Jack Kosmicki Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @jakphd

20 Oct
Last in this #ASHG21 late breaking plenary session is Bailey Martin-Giacalone presenting on germline variants in cancer predisposition genes predict survival for children with rhabdomyosarcoma
#ASHG21 Martin-Giacalone: Want to look at germline (not somatic) variants associated with rhabdomyosarcoma (RMS ).

Exome-sequenced 615 RMS cases and 9963 adult controls.
#ASHG21 Martin-Giacalone: Examined 63 cancer predisposition genes. Found 7.3% RMS cases had variants (not sure what type??) compared to 1.5% of controls. TP53, NF1, HRAS had the largest excess.
Read 8 tweets
20 Oct
Next up in the #ASHG21 late breaking plenary session is Elisa De Franco (@Elisa_EDF) presenting loss of primate-specific gene ZNF808.
#ASHG21 @Elisa_EDF: studying mice can provide insights into human biology, but there are differences. Mice have 2 genes for insulin (Ins1, Ins2), humans have 1 (INS).
#ASHG21 @Elisa_EDF: looked at 2877 neonatal diabetes patients from 111 countries and want Identify genes with pancreatic genesis.
Read 9 tweets
20 Oct
Next up in the #ASHG21 plenary session is Jonathan Sebat (@sebatlab) covering WGS of #Autism combining common and rare variants.
#ASHG21 @sebatlab: Found more de novo variants in cases than controls, rare inherited variants overtransmitted to cases, polygenic scores overtransmitted to cases. As such, all 3 categories are associated with #Autism risk.
#ASHG21 @sebatlab: created rare variant and common variant risk scores and both were associated with #autism status.
Read 8 tweets
20 Oct
#ASHG21 first up in the late breaking plenary session is Wenhan Lu looking at pleiotropy in the UKB exomes available at genebass.org
#ASHG21 Lu: observe large-scale pleiotropy across individual variants and genes.
24 genes have >10 independent phenotypic associations
#ASHG21 Lu: Group 491 ICD diseases into 41 domains.
Example: LDLR and ADH1B each have multiple gene associations across different domains
Read 7 tweets
19 Oct
Really impressive talk on noncoding #constraint in #gnomAD genomes from Siwei Chen (@konradjk's lab).

#ASHG21
Chen: Calculated constraint on 1kb windows using Z scores.

How do known non-coding elements (like enhancers) look when viewed through the lens on constraint?

#ASHG21
Chen: When looking at largest constraint Z-scores (top Z-score was 4 on the figures)
- Super enhancers ~3x enriched
- ENCODE cCRE enhancers ~2.25x enriched
- FANTOM enhancers ~1.75x enriched

#ASHG21
Read 8 tweets
19 Oct
Next up is my @RegeneronDNA colleague, Julie Horowitz, presenting "Common and rare variant analysis of 21K psoriasis cases and 623K controls identifies novel, protective associations in several genes in the type 1 interferon #ASHG21
Horowitz: Previous GWAS of psoriasis have identified >60 loci, but no large scale sequencing of psoriasis has been performed to identify 1) very rare variants and 2) burden tests.

#ASHG21
Horowitz: Leveraging data from >5 cohorts and 4 ancestries (EUR, AFR, SAS, AMR), they performed a trans-ancestry meta-analysis across a total of 21k cases and 623k controls.

#ASHG21
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(