, 13 tweets, 4 min read
My Authors
Read all threads
Biological and statistical interactions \thread part 2 (for part 1 see quoted tweet)

If you know something about biology you can hypothesize about the form of the statistical interaction you expect to see—but (a) you need to know a lot already to do this, and (b) not seeing the expected pattern does not automatically mean your hypothesized mechanism is wrong.
You could also use what you (think you) know about biological mechanism to improve estimation precision when your sample size is small—e.g. if you are interested in genotype-specific treatment effects.
.@TimothyLash gave a nice example of this involving tamoxifen pharmacogenetics in a recent @HarvardEpi seminar. Keep an eye out for that.
Note that all of the examples I’ve given so far require knowing something about the specific changes specific alleles make to gene products and their likely consequences.
“Maybe this intergenic SNP regulates this nearby gene that might do something related to this thing related to exposure” won’t cut it.
So what about going the other way? Can we make inferences about underlying biology from the shape of the relationship between G and E and outcome Y, l(G,E) Not without additional assumptions. There are too many possible paths from G,E to Y.
This is particularly relevant for genome-wide scams for interactions. Say you run a GWAS and you find a statistical interaction between an exposure and a variant G (function unknown). As much as you might like, you can’t really infer much about the function of the variant.
Even where you know the exposure acts thru process X, you can’t automatically conclude that G affects X. Best case scenario: you come up with falsifiable hypotheses about what G is doing (which you then have to try to falsify).
Maybe testing falsifiable hypotheses coming from the few statistical GxE interactions identified via GWAS to date will yield more biological insights than testing hypotheses from the 1,000s of GWAS-identified marginal genetic associations—or maybe not.
To be clear—there are good reasons to study the joint effects of genes and environment on traits in epidemiologic studies. But inferring underlying biology is the heaviest lift.

In general, “statistical interaction” per se does not necessarily directly address the scientific goals of the study.
The form of the analysis and its interpretation should be driven by the scientific question. Horses for courses. \fin
Missing some Tweet in this thread? You can try to force a refresh.

Enjoying this thread?

Keep Current with Peter Kraft

Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just three indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!