Graham Erwin (@grahamserwin) #ASHG22: Half of the human genome is repetitive, most are transposons but about 3% is made up of tandem repeats. Tandem repeats cause >40 human diseases.
Graham Erwin (@grahamserwin) #ASHG22: Friedreich's ataxia is caused by a triplet expansion in the first intron of FXN gene. Patients have 70++ repeats, which leads to lower expression/protein levels.
Graham Erwin (@grahamserwin) #ASHG22: Polyamides bind to DNA specifically and are very small (cell permeable). Prior example to tie one of these to a factor that would lead to higher transcription. pubmed.ncbi.nlm.nih.gov/29192133/
Now want to target other repeats.
Graham Erwin (@grahamserwin) #ASHG22: Most of the tandem repeats are >150 repeats. But with short read sequencing, it is hard to map these repetitive regions.
Graham Erwin (@grahamserwin) #ASHG22: ExpansionHunter De Novo detects repeat expansions from short read data. Modified the code a bit to find 160 recurrent repeat expansions identified across 7 cancer types.
Graham Erwin (@grahamserwin) #ASHG22: Does the rate of microsatellite instability correlate with recurrent repeat expansions?
Nope.
Graham Erwin (@grahamserwin) #ASHG22: Have several examples of recurrent repeat expansions in the cancer types that they want to dig into in the future.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Maureen Pittman #ASHG22: Oligogenic inheritance describes situations were there are some loci (more than one, less than thousands) that contribute to a disease.
Congenital heart disease (CHD) is potentially one of these situations.
Maureen Pittman #ASHG22: Example of a family where GATA6 was passed on from an unaffected parent to a child with CHD. Common variants don't seem to contribute much -- could it be due to other rare variants?
Maureen Pittman #ASHG22: Oligogenic genes are presumed to have related functions and could potentially compensate for each other or could interact, etc.
But finding oligogenic gene examples has been hard due to all of the combinations you have to test.
Scott Younger #ASHG22: Genomic Answers for Kids (GA4K). Currently have >10k subjects enrolled and have returned ~1k diagnoses.
Focus here is large scale assays, specifically establishing a protocol to do patient-derived iPSC reprogramming at scale. Have generated 250 iPSC lines
Scott Younger #ASHG22: Using antisense oligonucleotides (ASOs) to modulate gene expression. Proof of their use in TNNT2 gene.
Scott Younger #ASHG22: Robust restoration of dystrophin expression in DMD-patient organoids when applying a DMD-targeting ASO. Phenotypes restored as well (beating cells in dish).
Yi Ding (@yi_ding_) #ASHG22: Polygenic scoring estimates genetic value by aggregating signal across many loci. Potential for polygenic scores has already been well established -- but so have the challenges, especially the low accuracy in non-European populations.
Yi Ding (@yi_ding_) #ASHG22: Genetic ancestries are continuous and cluster-free.
- By forcing clusters on ancestry, miss inter-individual variability.
- The boundaries between these groups is also fairly arbitrary.
- Clustering also typically removes any unclassified individuals
Yi Ding (@yi_ding_) #ASHG22: Goal is to evaluate PGS accuracy at an individual level.
Using genetic distance to characterize and individual's unique position (Euclidiean distance) in genetic ancestry continuum. Also evaluating individual PGS accuracy.
Now on in the plenaries -- Catherine Robertson #ASHG22: Beta cells in the islets of the pancreas produce insulin. Type 1 diabetes is defined by loss of insulin production due to cell death or dysfunction. Disease risk is both genetic and environmental.
Catherine Robertson #ASHG22: Many common variants tied to T1D, but causal variants are hard to find -- some seem to impact cis-regulatory processes.
Now doing single cell/nuclei RNA-seq, ATAC-seq, and 5' RNA-seq.
Catherine Robertson #ASHG22: Nice overlap of all three data types for the PDX1 locus. Can describe regulatory processes across various cell types or stimuli +/-.
Hypothesis: T1D variants alter regulatory processes in islets.
Jazlyn Mooney (@Jazlyn_Mooney) #ASHG22: Want to trace the genealogical ancestors in African American individuals. 1870 census was the first collected to have more detailed information on African Americans in the US -- but still not a lot. Many do not know their roots.
Jazlyn Mooney (@Jazlyn_Mooney) #ASHG22 draws a distinction: genealogical and genetic ancestors are not the same.
Can use ancestry proportions from the present to estimate the number of ancestors from each source population in past. Implementing a mechanistic model of admixture
Jazlyn Mooney (@Jazlyn_Mooney) #ASHG22: Model estimates the number of genealogical ancestors. Applying to African American ancestry individuals. Slave trade moved many individuals to the Americas. Shows time epochs on the model (slave trade, slavery, segregation era).
Alex Diaz-Papkovich #ASHG22: Genetic data has structure due to geography, non-random mating, culture, etc. Visualization helps with this, but how do we delineate these clusters?
Alex Diaz-Papkovich #ASHG22: Genetic data is very noisy and we frequently may not know how many clusters truly exist. Clustering is also open to misinterpretation -- clusters are useful but are abstractions!
Alex Diaz-Papkovich #ASHG22: Notes that UMAP isn't a clustering algorithm. HDBSCAN is the algorithm used to cluster in this talk -- unsupervised and works well with UMAP data.