Sasha Gusev Profile picture
Jan 26, 2024 18 tweets 10 min read Read on X
I've written up a "crash course" on population genetics parameters useful for thinking about recent selection, heritability, and group differences (as part of a longer write-up on these concepts).



I'll summarize the key points here 🧵: gusevlab.org/projects/hsq/#…
Image
A preface: if you're generally interested in population genetics it's better to learn from first principles, and I've linked some useful resources to that end (many free). In particular (spoiler) recent evolution excludes some of the more interesting concepts and personalities. Image
But one downside of the general approach is that it can be hard to get a feel for real time (for example when populations are modeled in terms of 4Ne\mu). Here we'll fix three parameters based on data: time (t=65k years), population size (Ne=10k), and selection (s=~10^-4).
Image
Image
We can start by modeling how genetic variants move under neutral drift: very slowly! In 120k years a 5% allele is expected to accumulate just ~1% of drift variance. We can also think in terms of allele "age", and common variants are VERY old (mostly pre-migration).


Image
Image
Image
Image
Now let's add selection. Under the weak coefficients we see in real data, selection acts very slowly. Most common variants under negative selection will stay common. And new variants under positive selection will stay rare. It would take ~300k years for a 95% allele to go to 1%.

Image
Image
Image
These shifts are even slower under stabilizing selection, where traits move towards a fitness optimum instead of directionally up/down. This is likely the way populations have adapted to changing environments (we'll come back to this later).
Image
Image
Now that we have a model for selection and drift, we can test for whether variants are under selection. It turns out this test is very powerful when selection is strong, even 100 samples is enough. Whereas in the "nearly neutral" range it is effectively undetectable. Image
We can quantify genetic variance using F_ST, a fundamental measure of within versus between population correlations and often misunderstood. Part of the confusion is there are two derivations - Nei's and Hudson's - and they can be meaningfully different.


Image
Image
Image
Image
Under strong assumptions, F_ST can be related to population size and migration, but it is compatible with many different population dynamics in a way that can be non-linear and unintuitive.

Image
Image
Image
Moreover, F_ST is highly dependent on *which variants* are used to estimate it, and this can lead to highly unintuitive results. For example the apparently higher F_ST within chimps than between chimps/humans -- an artifact of how sub-populations are tested.

Image
Image
Image
A useful derivation is that Hudson's F_ST bounds the difference in trait mean between populations under neutral drift. We can confirm this in simulations. For a typical ~10% heritable trait, the (African/European) population difference is at most 1.5% (in either direction). Image
That's under neutrality, but under stabilizing selection, things are constrained even further but in complicated ways. After a shift in the fitness optimum, genetic variation is first rapidly selected on, and then gradually (and mostly arbitrarily) purified out of the population. Image
Between populations with the same fitness optimum, the mean trait value will be more constrained than under neutrality. But, it will also look like genetic variation has changed MORE substantially (e.g. F_ST). Interpretation is even more complicated with environmental shifts.
Image
Image
Finally, this brings us to the Breeder's Equation, which connects heritability and the response to selection under a fixed environment. In controlled breeding experiments (e.g. maize) response can be stable for many generations (consistent with polygenicity and new mutations).

Image
Image
Image
But in natural populations, the response often appears static or even negative (aka the "stasis paradox")! I highlight some examples compiled by Walsh & Lynch: bias in heritability estimates, indirect/environmental confounding, or shifts in the environment are all at play.

Image
Image
Image
This "missing response" echoes the debate around "missing heritability", where molecular methods often produce lower estimates and identify environmental confounders. I wonder ... are humans more like maize under controlled breeding or like natural evolving populations? /fin
By the way, all the figures and simulations are pretty simple but I've put the code here in case it's useful: . Let me know if you spot an error.github.com/gusevlab/hsq_a…
@threadreaderapp unroll

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Sasha Gusev

Sasha Gusev Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @SashaGusevPosts

Jun 24
Oof. Polygenic scores for IQ lose 75% of their explained variance when adding family controls, even worse than the attenuation for Educational Attainment. These are the scores Silicon Valley is using to select embryos 😬.

A few thoughts on this study ...
The TEDS cohort used here is a very large study with high-quality cognitive assessments collected over multiple time points. It is probably the most impressive twin study of IQ to date. That means very little room for data quality / measurement error issues.
It is important to highlight surprising null results. Just last week we were hypothesizing that large IQ score attenuation could be a study bias or an artifact of the Wilson Effect. Now we see it replicate in an independent study with adults.

Read 12 tweets
Jun 11
Racism twitter has taken to arguing that observed racial differences must be "in part" explained by genetic differences, though they demure on how much. Not only is this claim aggressively misleading, it is completely unsupported by data. A 🧵: Image
Image
Image
Genetic differences between any two populations can go in *either* direction, matching the phenotypic differences we observe or going against them. Genes also interact with the environment, which makes the whole notion of "explaining" differences intractable. Image
The mere fact that a trait is heritable within populations tells us nothing about the explanatory factors between populations. See: Lewontin's thought experiment; Freddie de Boer's analogy to a "jumping contest"; or actual derivations (). pubmed.ncbi.nlm.nih.gov/38470926/Image
Image
Image
Read 13 tweets
Jun 6
James Lee and @DamienMorris have an interesting perspective paper out describing "some far-reaching conclusions" about the genetics of intelligence. This type of "where are we now" paper is very fun and more people should write them! So, where are we now? 🧵 Image
It's a short paper and it surveys three core findings from the past decade of intelligence genetics. These sections follow a structure that I would cheekily call ... "make a bold claim in the title, then walk it back in the text".
First up, they address the concern that associations with intelligence may actually be mediated by functionally irrelevant traits like physical appearance or pigment. The argument is that IQ GWAS has demonstrated enrichments for CNS/brain structure gene sets. This is true! Image
Image
Read 19 tweets
May 20
The SAT/meritocracy debate has always been a bit odd to me when the test makers themselves have studies showing self-reported high-school GPA is a consistently better predictor of college GPA and always adds on top of SATs. Image
Clearly SATs are neither the only nor even the best measure we have of college success and "holistic" admissions can be "meritocratic". It's up for debate whether the additional <10% predictive variance SATs give you are worth the high-school testing industrial complex.
A challenge with all of these analyses is they are measured after selection on the predictor variables themselves, which can induce biased estimates through range restriction. The raw correlations are even lower, and it is hard to know whether correcting is appropriate. Image
Read 5 tweets
May 11
Hanania advocated passionately against "race mixing" for years, so he knows what he's talking about here. But it's worth adding that race-IQ obsessives also tend to make very poor predictions about the future. Let's review ...
The Bell Curve, published at the peak of the 80-90's crime wave, predicted a coming dystopian urban hellscape with a "cognitive underclass" living in state-managed facilities. Not only did all this fail to materialize, but crime rates collapsed.

Image
Charles Murray has nevertheless spent the following 30 years predicting vindication for his claims was just around the corner ... each time pointing to a new corner.

Read 11 tweets
Apr 27
Nice! Here we have an interesting paper using genetic ancestry to classify race/ethnicity in modern data and algorithms. Let's take a look at what this paper found: 🧵
First, I don't want to get too hung up on language, but TCB's tweet starts talking about "ethnicity", then shifts to "continental ancestries", and then entirely omits the largest ethnic group in the US: Hispanics. These terms have distinct definitions (). nap.nationalacademies.org/catalog/26902/…Image
Image
Anyway, how well can this paper actually impute ethnicity from genetic ancestry in a large cancer population ()? ~17% of the time it gets Hispanic classification completely wrong or a no-call! worldscientific.com/doi/10.1142/97…Image
Read 18 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(