I am in the PUMAS session at #WCPG2022, listening to @b_gelaye talking about the NeuroGAP study
NeuroGAP seeks to build collaboration across Africa, particularly across early career researchers. Phenotyping working group has members across Kenya, Uganda, Ethiopia and South Africa, as well as from the Broad. Lots of clinicians, valuable for building phenotypes #WCPG2022
8 phenotyping manuscripts accepted, 8 more to come, describing the details of psychiatric phenotyping within and across different African countries #WCPG2022
See different loadings and importance of items in scales across different countries. Underlying cultural nuances in phenotyping that need to be captured in instruments #WCPG2022
After a brief hiatus due to phone issues, I'm back - @loldeloo is now speaking about the genetics of severe mental illness in South America, particularly Colombia #WCPG2022
The population history of the Americas has created a complex admixture. Here, focussing on admixed populations in Brazil and Colombia. #WCPG2022
Brazil has broad three way admixture, driven by Portuguese male and Amerindian female mating. Also large African population following enforced displacement (i.e. slave trade). Create one of the genetically heterogeneous countries in the world #WCPG2022
PUMAS Brazil aims to recruit 20k participants of Black/Pardos (mixed) ancestry. 10k cases with structured interview and 10K controls. #WCPG2022
PUMAS Colombia recruiting 1k patients from Cartagena, which also has three way admixture #WCPG2022
Paisa Colombia Biobank - Misión Origen. 9M individuals in mountain region of Colombia. Admixed populations from Spanish and Native American populations, multiple bottlenecks from mountain migrations. Results in extensive homozygosity and IBD segments (> Finnish) #WCPG2022
Biobank focusses on 50k participants with severe mental illness and 50k controls (from local healthcare provider). EHR phenotypes, available on 400k patients. First EHR linked biobank in South America #WCPG2022
Extracting data from EHR. Compared diagnoses to SCID interview - good concordance, kappa 0.7-0.9. A lot of non-concordance is BIP-MDD mismatches, many of which are intermediate (i.e. diagnostically challenging individuals, categorisation is difficult) #WCPG2022
Geospatial analysis shows distinct diagnosis specific hotspots. Also see disease trajectories: medrxiv.org/content/10.110… (more from first author later!) #WCPG2022
Recruitment across studies ongoing - lifting off after pandemic, hopeful for successful recruitment on time #WCPG2022
Phenotype harmonisation is a priority across the studies. Cross cultural training aiming for uniform administration of scales. Item-level harmonisation as well #WCPG2022
Now @danhowrigan will talk about Blended Genome-Exome. To capture genome diversity, could do deep WGS but this is expensive. Alternatively can do deep WES and GWAS array imputation. But most arrays and imputation are EUR biased. Logistical challenges as well #WCPG2022
Enter the Blended Genome Exome. Unbiased common variant capture. 1x WGS + WES. $150 per sample, less at scale. 60 samples per lane of sequencing. Lots of lessons learned from mass sequencing during COVID. #WCPG2022
Low pass imputation using GLIMPSE software. Low pass seq - 0-2 reads in a many places. Select haplotypes and impute likelihood through hidden Markov chain. #WCPG2022
How good is BGE? Compare to 30x as truth, and GSA chip imputation as competition. Needed to calibrate - maximum return for effort. #WCPG2022
BGE has more (11.7 Vs 6.9M) and more accurate SNP results for AFR than GSA. Need to do some mild QC (posterior > 0.8) to ensure high concordance with 30x #WCPG2022
1KG imputation better than HRC - 3M more SNPs. Capture best at high freq MAF > 5%, but still get >90% concordance even for singletons #WCPG2022
Imputing rare coding variants - capturing about 61% of coding singletons. Capture more from the deep WES. #WCPG2022
Imputing across ancestry subsets is generally high quality as well. Capturing a lot of SNPs. Runtime of imputation is robust to sample size - time per sample is stable as sample sizes increase #WCPG2022
• • •
Missing some Tweet in this thread? You can try to
force a refresh
PGC 1000+ scientists from 50+ countries. Primary funding from NIH, but many others as well. 8 sites, 16 PIs in USA, UK, and Ireland. Coordinating committee including regional representatives. #WCPG2022
I am in the #WCPG2022 IDEA plenary, which discusses the need and approaches to decolonising psychiatric genetics. Panel members are Olivia Matshabane, Paola Giusti-Rodriguez @paolagiustirodriguez, @hailianghuang, and @MeeraPurushott1. Chairs are Lea Davis and John Nurnberger.
[I will tweet as much as I can here, but a discussion is always more challenging to tweet]
First question highlights the impact of colonialism in research. This includes lack of resources and support for ECRs, the need for engaging with and working alongside studied groups throughout the research process, including subsequent data storage, access and usage. #WCPG2022
Next is Omar Shanta who will talk about a GWAS of CNVs in 500k individuals (127k cases, mostly EUR, some AFR). Needed consistent CNV calling across entire cohort. Difficult platforms, which makes things challenging. #WCPG2022
Aims: what is the contribution of rare CNVs, which are those CNVs per disorder, how do rare CNVs compare across disorders? #WCPG2022
Assess multiple metrics of CNV burden, comparing tier 1 (pli > 0.5) and tier 2 (pLI < 0.5). #WCPG2022
I'm back (after a slightly longer lunch) - @juandelahozco is presenting on longitudinal trajectories in the EHR from the Clínica San Juan de Dios, Manizales, Colombia. #WCPG2022
Diagnoses of severe mental illness from ICD10, validated against structured interview and chart review. Some BIP-MDD mismatches (see tweets on @loldeloo talk earlier), but agreement is generally very good, especially for SCZ #WCPG2022
Want to extract presentation information from notes - developed NLP algorithm, required named entity recognition and negation detection. Extracted features align with ICD10 codes #WCPG2022
It's day 3, and @sebatlab is outlining the spectrum of genetic influences that affect autism. Clear that although there are strong rare variant effects, these are not monogenic disorders #WCPG2022
Combining together results from common, rare, and structural variants allows a broader picture of genetic risk. See over-transmission of genetic risk at all levels from parents to affected children #WCPG2022
Can combine common and rare variants together into scores that are more predictive than either score alone. Inverse correlation - individuals with autism with high rare score tend to have lower common score [conditional on having autism - crosses liability threshold] #WCPG2022