While it is true that the gene closest to a GWAS peak is not always the causal gene, it is also true that it usually is.
In fact, we can quantify how often we should expect the causal gene to be the closest gene, and that number is about 70%
3 papers from 2021 help pin this down:
Activity-by-contact (ABC-Max) predicts a causal gene for a GWAS SNP using a combination of cell-type specific chromatin accessibility, epigenome marks and chromatin conformation, which can also be estimated by SNP-TSS distance: pubmed.ncbi.nlm.nih.gov/33828297/
There were several large pQTL studies published in 2021. I've been referencing this one by @pietzner et al. When protein abundance is the trait, the hypothesis is the cognate gene (the one encoding the protein) is the causal gene:
pubmed.ncbi.nlm.nih.gov/34648354/
Regeneron's flagship paper on the exome sequencing effort in the UK Biobank included this now famous figure:
pubmed.ncbi.nlm.nih.gov/34662886/
These papers provide 3 independent approaches to quantifying the distribution of ordinal rank for the causal gene from a lead GWAS SNP
Here I'm defining distance to the "gene body" (TSS-TES)
At least in ABCmax the lead variant has been fine-mapped.
closest gene: 70%-76%
Here's another visualization of this amazing convergence, plotted on a log scale to capture some of the finer structure at higher ordinal rank.
And another link to the 3 papers:
pubmed.ncbi.nlm.nih.gov/33828297/
pubmed.ncbi.nlm.nih.gov/34648354/
pubmed.ncbi.nlm.nih.gov/34662886/

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Eric Fauman

Eric Fauman Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Eric_Fauman

16 Oct 21
Having this enormous collection of pQTLs allows us to answer the question (again):

Which is more relevant:

Distance of a GWAS SNP to the TSS (transcription start site) or to the gene body of a candidate gene?
pubmed.ncbi.nlm.nih.gov/34648354/
Usually you get the same closest gene measuring to TSS or to gene body.

But in the top case a pQTL for ACAA1 sits inside an irrelevant gene but is closer to the TSS for ACAA1.

But a pQTL for DNAJC17 sits closer to a TSS for a random gene despite sitting within DNAJC17.
Turns out if TSS_closest_gene and gene_body_closest_gene disagree, the gene body metric is right twice as often

This is especially true if the SNP sits within a gene, even if it is not a missense variant

(Again though, usually TSS and gene_body agree on the closest gene (77%))
Read 5 tweets
15 Oct 21
When protein abundance is the trait, the simplest assumption is that the gene encoding the protein is the causal gene.
This catalog of 10,674 pQTLs from @pietznerm et al provides a rare unbiased look at GWAS SNP->causal gene genomic properties.
I took a quick look at the SNP-gene distances for all cases where the lead SNP had an rsID and the trait had a unique HGNC gene symbol. 3,475 cases SNP and cognate gene are on the same chromosome, 2,985 times within 500kb, with a very strong distance dependence. Image
For this study the authors provided the VEP consequence for each pQTL so we can look how often the cognate gene is the closest gene as a function of that consequence
Even ignoring missense variants, if the variant falls within a gene that's a strong predictor. Image
Read 5 tweets
15 Oct 21
In mapping SNPs to genes we clearly can do better than taking the closest gene, but that should be the baseline by which we compare other methods.
@cr_farber et al, I hope you'll consider this before submitting this for publication.
In this preprint the authors started with 1,097 lead SNPs for bone mineral density from pubmed.ncbi.nlm.nih.gov/30598549/ and applied TWAS and eQTL colocalization to identify "potentially causal genes"
To validate the approach the authors constructed a list of 1,399 "known bone" genes and noted enrichment of their TWAS/eQTL selected genes.

But the enrichment for "known bone" genes is much greater for genes closest to the lead SNPs. ImageImage
Read 7 tweets
27 Mar 21
A well-behaved GWAS yields strong signals for the kinds of genes that contribute to the phenotypic variation.
This provides strong priors for discerning likely causal genes hidden at other loci.

With this in mind, let's revisit the telomere GWAS

medrxiv.org/content/10.110… Who's that causal gene. A silhouette stands near a chromosom
Just going by closest gene, many telomere biology related themes emerge
I colored in this figure from the paper according to closest approach of each gene to a telomere GWAS signal.

Clearly this GWAS is telling us to look at telomere biology.
Read 12 tweets
31 Jan 21
Today's GWAS of urolithiasis, kidney stones and other stones of the urinary tract, provides a wonderful window into calcium, phosphate and vitamin D metabolism.
One nice thing about putting my GWAS interpretations here in Twitter is I can always quickly find what I may have written about a gene or a trait before.

Here's my write up on urolithiasis from 2 years ago in a completely different cohort, biobank japan

Do GWAS demonstrate good reproducibility?

On the left is the top hits from @finngen; on the right the top hits from Biobank Japan.
5 of 6 loci from FinnGen also found in BBJ.
Note some of the lead SNPs may differ, but the causal genes line up.

meta-analysis anyone?(@masakanai?)
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(