2/ GWAS have linked 1000s of genetic variants to disease, but datasets to resolve association signals to coding genes & higher impact rare/low frequency alleles are emerging only now, e.g. through sequencing @uk_biobank or leveraging bottlenecked populations like @FinnGen_FI
3/ We harmonized disease endpoints captured in both biobanks & conducted coding-wide association (CWAS) meta-analyses. This identified 975 replicating signals, most from missense alleles. 1/3 were novel, 145 had MAF<2%. This included "benign" variants in monogenic disease genes.
4/ Notably, not only sample size but also variant population enrichment substantially boosted association power - the more the rarer a variant. This supports more studies in bottlenecked populations like @FinnGen_FI & other ethnicities.
4/ Notably, not only sample size but also variant population enrichment substantially boosted association power - the more the rarer a variant. This supports more studies in bottlenecked populations like @FinnGen_FI & other ethnicities.
5/ We further tested our CWAS sentinel variants for association with 117 quantitative @uk_biobank biomarkers. This identified pleiotropic disease loci & in several cases generated hypotheses on the mechanisms linking variants to disease & underlying drug safety/efficacy signals.
6/ We are super excited to see the gap between Mendelian & complex disease narrowing. ~1/3 of sentinel variants were in genes linked to matching diseases in #ClinVar, yet only few annotated as pathogenic. We also report extended phenotypes for medically actionable ACMG3.0 genes.
7/ Our manuscript showcases numerous vignettes, with a special emphasis on Atrial Fibrillation. For instance, we use Clustered MR to relate AF variants to pulse rate & describe a FIN-enriched missense variant in PITX2 that likely enhances expression of disease-relevant AF genes.
8/ We are grateful for the wonderful @FinnGen_FI participants & collaboration partners, particularly @mitja_kurki@APalotie & @dalygene. Summary statistics from this amazing project are released bi-annually & can be explored in a public results browser r5.finngen.fi
Our joint @FinnGen_FI+@uk_biobank coding variant study is now out @Nature. We bridge rare & common disease genetics by linking protein-coding variants from >650k individuals to >700 disease endpoints, resolve GWAS loci & explain mechanisms 👏@ben_b_sun 🧵rdcu.be/cHCMs
We tweeted earlier on
• power of cross-biobank study to find rare/low-frequency signals
• @FinnGen_FI bottlenecked population benefits
• pleiotropy linking biomarkers & disease
• ClinVar variant disease associations threadreaderapp.com/thread/1450067…
This new🧵is on a few vignettes
For instance, we refined insights on variants in F5 & F10 genes in the coagulation cascade. E.g., ~1/4 Europeans carry the F5 missense variant His865Arg that we found associated with protection from pulmonary embolism