Jason Sheltzer Profile picture
Jun 2, 2021 23 tweets 9 min read Read on X
New paper from @joans and me! A pan-cancer, cross-platform analysis identifies >100,000 genomic biomarkers for cancer outcomes. Plus, a website to explore the data (survival.cshl.edu) and a (controversial?) discussion of “cause” vs. “correlation” in cancer genome analysis.
We used every type of data collected by TCGA (RNASeq, CNAs, methylation, mutation, protein expression, and miRNASeq) to generate survival models for each individual gene across 10,884 cancer patients. In total, we produced more than 3,000,000 Cox models for 33 cancer types.
Within each cancer type, we identified thousands of biomarkers for favorable and dismal patient outcomes. The most common adverse biomarkers included overexpression of the mitotic kinase PLK1, methylation of the transcription factor HOXD12, and mutations in TP53.
GO term analysis revealed common gene groups among adverse and favorable biomarkers, including cell cycle genes (upregulated in deadly cancers) and developmental transcription factors (methylated in deadly cancers).
We could use these biomarkers to stratify patient outcomes in clinically-ambiguous situations, including Stage 1a breast cancer and Gleason 7 prostate cancer. In general, gene expression and DNA methylation biomarkers provided the most prognostic information.
So now here’s where it gets weird: aside from mutations in TP53, we didn’t see many cancer driver genes score as strong biomarkers in our prognostic analysis. KRAS, EGFR, RB1, PIK3CA, RB1, NF1… mutation, methylation, or altered expression of these genes wasn’t really prognostic.
In the literature, if some gene is associated with worse cancer outcomes, then that is typically presented as evidence that that gene is an important cancer driver. But clearly KRAS and PIK3CA are important cancer drivers and they didn’t score in our analysis… so what gives?
To investigate this, we analyzed lists of cancer driver genes, and then we compared their prognostic significance to randomly-permuted gene sets. Surprisingly, verified oncogenes were no more likely to be prognostic than any randomly chosen gene in the genome!
For instance - KRAS mutations clearly drive lung cancer. But KRAS mutations in lung cancer are *not* associated with worse patient outcomes. In some cases, mutations in specific oncogenes are associated with *better* outcomes, not worse outcomes.
If you infer the importance of a gene from survival analysis (which is exceptionally common in the literature, and is something I’ve previously done myself) - you could accidentally conclude that CENPA is a more important driver of prostate cancer progression than MYC:
In general, our analysis provides genome-wide evidence that inferring *causation* (gene A is a driver of cancer progression) from *correlation* (gene A is overexpressed in deadly cancers) is not appropriate for patient outcome analysis, even if it’s commonly done.
Next, we looked at cancer drug targets. Again, it is routine to see the fact that a gene is associated with deadly cancers presented as evidence that that gene is a good drug target. But is this link justified by the data?
We looked at the targets of all FDA-approved cancer drugs, and we found that these drug targets were no more likely to be prognostic than any randomly-selected gene in the genome!
Consider PD1 as a drug target. High levels of PD1 (PDCD1) are associated with patient survival. So you might think that PD1 inhibitors would kill people! But cancers don’t work like that - survival correlation is not causation - and PD1 inhibitors in fact prolong survival.
(You could imagine that this is a type of post-hoc fallacy - maybe these genes are non-prognostic because of the existing therapies. But we did a sub-analysis on drugs approved after 2017 [post-TCGA], and we observed the same pattern).
Then we asked - what happens if you target the worst adverse features in the genome? Maybe those are still the best drug targets? Among the top 50 prognostic factors in the genome, we found that 16 have been targeted in clinical trials, and 15 of them have failed.
We believe this is because the most prognostic factors are not selective oncogenes. They’re housekeeping cell cycle genes that are ubiquitously expressed, and they’re essential across cell types. No cell type-selectivity = systemic toxicity and trial failure.
Successful cancer drug targets may be adverse biomarkers, favorable biomarkers, or they may have no survival correlation whatsoever. Our data demonstrates that this type of prognostic analysis should be uncoupled from therapeutic target development.
To put this in perspective - imagine a KM plot of 10,000 senior citizens: “people receiving dialysis” vs “people not receiving dialysis”. Individuals receiving kidney dialysis are more likely to die than individuals who are not receiving dialysis...
Based strictly on this correlative observation, one could assume that kidney dialysis kills people! Yet, we know that people receiving dialysis are likely to be older and have several medical comorbidities, and dialysis saves their lives. Same thing in cancer genomics!
Inferring functional relationships and prioritizing drug targets based on correlative outcomes analysis may be inappropriate, as these relationships can be fraught with confounding variables and spurious associations.
So, let me know what you think, and take a look at our website - survival.cshl.edu. 3 million Kaplan-Meier plots to explore and lots more exciting findings to uncover. Feedback welcome!
I should add - I was playing around with some of the ideas in the paper in the thread linked below. It goes a little deeper into the drug target analysis and the misinterpretation of what survival curves mean:

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Jason Sheltzer

Jason Sheltzer Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @JSheltzer

Dec 1
New from my lab on bioRxiv - we found an existing drug that appears to be safe in humans that selectively kills chemotherapy-resistant cancer cells.
The drug is called PAC-1. It was initially developed to target cancer cells by activating the executioner caspases. But, we generated CASP3/6/7 triple-knockouts and it still eliminated cancer cells, demonstrating that it must have some other target. Image
Image
We used the PRISM dataset from @corsellos and found that PAC-1 was behaving like an iron-chelation agent. We confirmed that PAC-1 depleted cellular iron, upregulated an iron-starvation transcriptional response, and PAC-1 lethality could be reversed with iron supplementation. Image
Image
Image
Read 11 tweets
Sep 30, 2024
One week from today, the Nobel Prize in Medicine/Physiology will be announced.

Here are the 79 most likely awardees, each of whom has won two or more pre-Nobel “predictor” prizes: Image
My top picks: Horwich/Hartl for their work on chaperone-mediated protein folding. Their discoveries changed how we think about protein structure and has had significant ramifications for our understanding of neurodegenerative diseases.
Klenerman/Balasubramanian for the development of next-gen sequencing (this could be a Chem prize too). NGS has revolutionized multiple areas of medicine and medical research, and the committee likes recognizing tool/technique development (PCR, CRISPR, monoclonal antibodies, etc).
Read 10 tweets
Jul 24, 2024
The new class of HHMI investigators average 3.9 papers as corresponding author in Cell, Nature, or Science. 26 out of 26 members of this group previously trained with a PI who is in the National Academy of Sciences or who was an HHMI investigator themselves. Image
To back up, I have a longstanding interest in understanding the trajectories of academic careers and uncovering “hidden” factors that influence success. Some of my published work on this topic:

pubmed.ncbi.nlm.nih.gov/24982167/
nature.com/articles/nbt.4…
Recently, there has been a push for funding bodies to look more closely at preprints and put less emphasis on journal names. However, if you look at data that I collected from HHMI’s competition in 2018, you can see that the results are pretty similar:

Read 9 tweets
Jul 6, 2023
Check out our new study in @ScienceMagazine, where we take on a 100-year-old debate: what’s the role of aneuploidy in cancer?

We discovered that genetically removing extra chromosomes blocks cancer growth - a phenomenon we call “aneuploidy addiction”. science.org/doi/10.1126/sc…
In the 19th century, pathologists observing cancer cells under a microscope noticed that they frequently underwent weird mitoses. The chromosome bodies visible in these cells were not equally divided between daughter nuclei - in other words, they were aneuploid. Image
Early pathologists like Theodor Boveri proposed that it was this aneuploidy that actually caused cancer. But, there was no way to test it. Eventually, this theory fell out of favor - researchers discovered oncogenes and showed the impact that point mutations could have in cancer.
Read 16 tweets
Jan 10, 2023
Very excited to share a new paper from my lab: using a set chromosome-engineering tools, we show that cancers are “addicted” to aneuploidy. If you genetically eliminate single aneuploid chromosomes, cancer cells totally lose their malignant potential!
biorxiv.org/content/10.110…
To back up, for many years researchers have used the standard tools of molecular genetics to learn about the function of individual oncogenes and tumor suppressors. We can easily over-express, mutate, or knockout genes like KRAS and TP53 to study their biology.
Chromosome gain events are exceptionally common in cancer, but the genetic tools that allow us to manipulate individual genes don’t work for these chromosome-scale copy number changes. You can’t package a whole chromosome in a lentivirus to over-express it.
Read 13 tweets
Nov 28, 2022
If you choose to transfer a manuscript between Nature-family journals, you can consult a web page that lists the acceptance rates for 124 journals published by the Springer Nature Group.

I haven’t seen this data circulated before, so I copied it to share here:
According to this data, "Nature" is not actually the most selective journal. Nature Med, Cancer, and Human Behavior all have lower acceptance rates.
This could be Simpson’s paradox. Maybe a cancer paper has a 2% acceptance rate at Nature and a 4% acceptance rate at Nature Cancer, but Nature also loves to accept ML papers, which increases the overall acceptance rate?
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(