Apologies to the morning folks in the education session (5am landing and an hour or so walking the streets of Florence required a bit of sleep...)
But I'm only chuffing here! Enjoying listening to @KKuchenbaecker explaining the vanishing boundaries of ancestry #WCPG2022
@KKuchenbaecker KK: Defining specific ancestry groups (e.g. through PCA) can be valuable, but comes with risks of abuse by bad actors. Definitions are not always straightforward and easy to get wrong - relatives have an influence on this (and this also --> misestimates of relatives) #WCPG2022
Roadmap for sustainable diverse gneomic studies: pubmed.ncbi.nlm.nih.gov/35145307/. Need to engage groups being studied as active partners and leads of the studies, and have strong focus on capacity building. Highlights efforts in Pakistan #WCPG2022
Next up is @egatkinson to talk about developing better methods to engage with ancestral diversity in GWAS. Starts with a point KK made earlier - EUR dominates GWAS and this isn't changing. Risks exacerbating existing health disparities #WCPG2022
@egatkinson Studying understudied populations is morally, clinically and scientifically justified. Particular among these are admixed groups, including African Americans and Latinos (1/3 of US population). Data often exists, but is not being used #WCPG2022
Why? Fear population stratification will bias GWAS results. Pop strat - coincidence of ubiquitous ancestral geographic differences in variant frequency with genographic differences in phenotype. Usually addressed with PCA - but crude, limited, tends to exclude admixed #WCPG2022
Ancestry essentially dealt with by avoiding it - discarding all but the bulk of collected samples. Enter TRACTOR - split individuals into their ancestral segments, and analyse all data by focussing on segments, not individuals #WCPG2022
How does TRACTOR work? Uses ubiquitous patchwork chromosomes of all organisms (segments of correlated DNA inherited from ancestors) --> segments of local ancestry #WCPG2022
Assess local ancestry (through chromosome painting), and then analyse by pieces of same local ancestry. Chromosome painting uses phased genotype data and compares variants on the basis of ancestry allele frequencies #WCPG2022
Local ancestry inference approach is very accurate - >97% correct calls from simulated data, and produces useful outputs (e.g. local ancestry specific weights for polygenic risk scoring). Resulting TRACTOR GWAS boosts power of GWAS compared to traditional methods #WCPG2022
TRACTOR doesn't have major power loss when local ancestry irrelevant, and has good control of false positivity. Simulated effect sizes are recapitulated accurately by TRACTOR across multiple models. #WCPG2022
TRACTOR is also valuable for fine-mapping - reduces number of potentially causal variants, prioritises different variants to traditional analysis (because it incorporates information from disruptive LD blocks from different local ancestries) #WCPG2022
TRACTOR also improves phasing, effectively recovers some long range tracts of ancestry. Read the paper for more: nature.com/articles/s4158…#WCPG2022
Next, Ying Wang presents on incorporating diverse ancestry data in polygenic risk scores. Begins with a definition of polygenic risk scores - GWAS-weighted sum of risk alleles (ID'd from GWAS) in new genotype data independent of GWAS #WCPG2022
Polygenicity means different patterns of risk alleles across at-risk individuals are expected, so PRS summarises [covers up?] between-individual diversity #WCPG2022
E.g. risk stratification and stratified screening in breast cancer - high risk (indexed by PRS) --> early screening, low risk --> later/no screening #WCPG2022
PRS analysis: do a GWAS. GWAS has errors e.g. winner's curse (overestimation of effect sizes due to preferentially selecting variants with true effect + positive noise). Numerous methods to deal with this, esp. machine learning methods but also simple LD-clumping #WCPG2022
Metrics used to assess PRS effectiveness - R2, AUC. Separation of PRS distribution in cases and controls. Stratification of sample into relevant strata and OR between strata. #WCPG2022
PRS accuracy is theoretically limited by (broad-sense) SNP-based heritability. Also practical limits of measurable heritability and GWAS, target sample sizes #WCPG2022
Transferability of PRS is limited across ancestries, esp. others --> AFR. Transferability variable across studies and phenotypes. Partly due to differences in allele frequency and LD (variable by phenotype due to different causal variants) #WCPG2022
Methods being developed to address this e.g. PRSCSx. But methods development not enough, need major efforts to prioritise diversity (H3Africa, AllofUS) etc. But major efforts are still EUR dominated. Need more from global south #WCPG2022
Examples from the Global Biobank Meta-analysis Initiative. 2.2M individuals. PRS accuracy heterogeneous across ancestries and biobanks. e.g. very different performance of EUR asthma PRS-->EAS in different EAS biobanks #WCPG2022
Leave-one-biobank out GWAS --> much more powerful PRS than biggest asthma GWAS #WCPG2022
Genetics are not enough to counter health disparities. Need to measure and model environments effectively. Systematic modelling of environments through identifying latent factors #WCPG2022
PGC 1000+ scientists from 50+ countries. Primary funding from NIH, but many others as well. 8 sites, 16 PIs in USA, UK, and Ireland. Coordinating committee including regional representatives. #WCPG2022
I am in the #WCPG2022 IDEA plenary, which discusses the need and approaches to decolonising psychiatric genetics. Panel members are Olivia Matshabane, Paola Giusti-Rodriguez @paolagiustirodriguez, @hailianghuang, and @MeeraPurushott1. Chairs are Lea Davis and John Nurnberger.
[I will tweet as much as I can here, but a discussion is always more challenging to tweet]
First question highlights the impact of colonialism in research. This includes lack of resources and support for ECRs, the need for engaging with and working alongside studied groups throughout the research process, including subsequent data storage, access and usage. #WCPG2022
Next is Omar Shanta who will talk about a GWAS of CNVs in 500k individuals (127k cases, mostly EUR, some AFR). Needed consistent CNV calling across entire cohort. Difficult platforms, which makes things challenging. #WCPG2022
Aims: what is the contribution of rare CNVs, which are those CNVs per disorder, how do rare CNVs compare across disorders? #WCPG2022
Assess multiple metrics of CNV burden, comparing tier 1 (pli > 0.5) and tier 2 (pLI < 0.5). #WCPG2022
I'm back (after a slightly longer lunch) - @juandelahozco is presenting on longitudinal trajectories in the EHR from the Clínica San Juan de Dios, Manizales, Colombia. #WCPG2022
Diagnoses of severe mental illness from ICD10, validated against structured interview and chart review. Some BIP-MDD mismatches (see tweets on @loldeloo talk earlier), but agreement is generally very good, especially for SCZ #WCPG2022
Want to extract presentation information from notes - developed NLP algorithm, required named entity recognition and negation detection. Extracted features align with ICD10 codes #WCPG2022
I am in the PUMAS session at #WCPG2022, listening to @b_gelaye talking about the NeuroGAP study
NeuroGAP seeks to build collaboration across Africa, particularly across early career researchers. Phenotyping working group has members across Kenya, Uganda, Ethiopia and South Africa, as well as from the Broad. Lots of clinicians, valuable for building phenotypes #WCPG2022
8 phenotyping manuscripts accepted, 8 more to come, describing the details of psychiatric phenotyping within and across different African countries #WCPG2022
It's day 3, and @sebatlab is outlining the spectrum of genetic influences that affect autism. Clear that although there are strong rare variant effects, these are not monogenic disorders #WCPG2022
Combining together results from common, rare, and structural variants allows a broader picture of genetic risk. See over-transmission of genetic risk at all levels from parents to affected children #WCPG2022
Can combine common and rare variants together into scores that are more predictive than either score alone. Inverse correlation - individuals with autism with high rare score tend to have lower common score [conditional on having autism - crosses liability threshold] #WCPG2022