Genomics, LLMs, spatial cancer evo | Group leader @DanaFarber @Harvard | Head of Data Science at Hale Center for Pancreatic Cancer | PhD @eth | 🇷🇴🇸🇪🇨🇭🇺🇸
2 subscribers
Dec 31, 2023 • 41 tweets • 11 min read
To end 2023, I’ll share one of the most insightful & well-written papers I read in 2023.
This study @Nature links *spatial* tumor organization to immunotherapy response in breast cancer.
Immunotherapy is our strongest weapon against cancer. We need to understand it better.
🧵🧵
Long thread ahead, going deep into the molecular workings of breast cancer immunotherapy.
New: the monthly roller-coaster through October’s coolest life science papers is here 🚀🧬
3-sentence summaries of papers on evolution, single cell methodologies, genetic screens & more.
And, only for October, an educational video on fighting cancer🤺 as a bonus.
Enjoy 3x10! 1. Assembly theory (Sharma et al., Nature)
The most (in)famous paper I read this month proposes a new framework (assembly theory, that is) to explain basically everything… or, more specifically, “to unify descriptions of evolutionary selection across physics and biology” 1/3
Sep 22, 2023 • 41 tweets • 11 min read
The human genome is gradually unravelling its secrets 🎁
AlphaMissense model @ScienceMagazine: one more path lit up by deep learning in exploring the code of life 🧬
We now know with high confidence if 89% of ALL missense variants are benign or pathogenic
Key contributions🧵🧵
First things first:
Missense variants = genetic variants (i.e DNA bases) that change the amino acid sequence (i.e groups of 3 bases, building blocks of proteins) in proteins.
Missense variants are more important than non-missense ones, as more likely to have functional impact.
Jul 21, 2023 • 19 tweets • 7 min read
3 amazing papers just out @Nature, the kind worth giving up sleep for🦉
Cancer is a terrible disease, and also one that we all know too well.
It is not a new problem, rather one that exists since thousands of years & is studied in unimaginable detail.
Then why do people still die of cancer?
Let's start understanding this by taking a step back.
It’s 1938, and Public Health Services are advising people that detecting and treating cancers early will save their lives.
May 5, 2023 • 37 tweets • 10 min read
New preprint on scGPT: first foundation large language model for single cell biology, pretrained on 10 million cells.
Just as text is made of words, cells are characterized by genes
Some thoughts on how cool this is & why it challenges the status quo of single cell analysis🧵🧵
The self-attention transformer (chatGPT) is so successful in natural language processing (NLP)
But maybe single cell biology is not that different from NLP, as genes & cells correspond to words & sentences?
Only a matter of time before a paper formalized this exercise:
Automated #scRNAseq cell type annotation with GPT4, evaluated across five datasets, 100s of tissues & cell types, human and mouse.
A🧵below with my thoughts on how such tools will change how #Bioinformatics is done.
I'll start with a quick summary of the paper, such that we're all on the same page.
(The paper is also a super quick read, literally only 3 pages of text, among which 1 is GPT prompts).
The paper I am sharing today is a thoughtful philosophical perspective from @sdomcke & @JShendure proposing a new organizational framework for single cell data, as an alternative to e.g Human Cell Atlas
Compelling read for both lovers❤️ & skeptics🤔 of single cell genomics
🧵🧵
This thread is organized as follows:
1️⃣ The need to organize Biology
2️⃣ How to organize cell types?
3️⃣ A consensus ontology
4️⃣ Structure & representation of the cell reference tree
5️⃣ Resolution of tree labels
6️⃣ Example tree
7️⃣ Human tree
8️⃣ Thoughts
Mar 24, 2023 • 25 tweets • 10 min read
Do you need to analyze Spatial Transcriptomics data, but are lost in the endless sea of methods?
Here's an explainer of the new @NatureComms paper benchmarking 18 spatial cellular deconvolution methods🧵🧵
We use single cell protein quantification & single cell FISH to map #spatial interactions in genetic mosaicism & tumor microenvironment in #Glioblastoma!
Glioblastoma (GBM) is one of the deadliest, most aggressive cancers that exist, with a median survival of only 15 months.
In GBM, 'single cell heterogeneity' are not simply buzzwords.
Rather, this immense heterogeneity is a main reason of treatment failure
Feb 27, 2023 • 17 tweets • 6 min read
I need to raise awareness about an important point in #scRNAseq data analysis, which, in my opinion, is not acknowledged enough:
‼️In practice, most cell type assignment methods will fail on totally novel cell types. Biological/expert curation is necessary!
Here's one example👇
Last year, together with @LabPolyak@harvardmed, we published a study in which we did something totally awesome: we experimentally showed how a TGFBR1 inhibitor drug 💊 prevents breast tumor initiation in two different rat models!
Generating brand new functional proteins from scratch with large language models (e.g. #chatGPT)
Let’s understand this Transformers model used for protein design, how well it works & why this is important🧵👇
The very nice paper discussed in this thread comes from a team led by @nikhil_ai at Salesforce @SFResearch 👏
Inspiring Symposium on Cancer Prevention @EACRnews
95% of cancer drugs fail. 94% do not improve life quality.
An ounce of prevention is worth a pound of cure. (B. Franklin)
Cancer prevention is tremendously difficult. But it is also necessary.
We need to shift our focus.
How to move from developing cancer treatments to cancer prevention? @cohen_cyrille
How to change the single gene/ single mutation paradigm for holistic approaches considering multi-omics, lifestyle, exposure and cells as a whole? @AzraRazaMD
Jan 26, 2023 • 14 tweets • 5 min read
Graph Neural Networks (#GNNs) & their applications to life sciences are an exciting #DeepLearning area to discover!
But, to develop or apply GNN methods, we first need to understand the maths behind.
So, back to basics!
Here's a plain language summary of what's behind GNNs👇
This summary is based on @PetarV_93’s recent paper with introductory theoretical notions on Graph Neural Networks.
Division frenzy 🤩: T cells can divide indefinitely & long outlive their host organism!
One of 2023's most exciting papers so far!
A paper that challenges scientific paradigms & brings strong experimental evidence against long-held scientific beliefs.
Let's break it down🧵
Friends, this small 5-page @Nature paper is the perfect example of the ideal science:
1. Pick a very relevant topic (T cell adaptive immunity) 2. Ask a very relevant question related to this topic (how often can CD8+ T cells divide?)
👇 nature.com/articles/s4158…
Jan 17, 2023 • 28 tweets • 10 min read
The science of #immunotherapy can cure a patient's otherwise incurable cancer.
But sometimes immunotherapy fails completely
Shockingly, we hardly know why.
A meta-analysis of #Genomics & #Transcriptomics in >1,000 immunotherapy-treated patients aims to better understand why🧵
This 2021 @CellCellPress paper is one of the best #DataScience#Bioinformatics resources out there for understanding the genetic determinants of response to immune checkpoint inhibitors (ICIs).