Simona Cristea Profile picture
Oct 28 22 tweets 10 min read
Amazing week for #DeepLearning in #spatial #singlecell biology, with 2🔥new Graph Neural Networks methods!

1.STELLAR🇺🇸 @jure: a cell type annotation & discovery atlas-type framework
2.NCEM🇪🇺 @fabian_theis: an approach to infer cellular communication patterns

Deep dive below🧵 Image
But first, some background.

Spatial molecular biology has actually been around since the 70s. @lpachter's wonderful book-like article "Museum of Spatial Transcriptomics" comprehensively discusses history, tech & methodology advances in the past 50 years.
nature.com/articles/s4159…
Nevertheless, recent advances in single cell molecular technologies (brought by e.g. @10xGenomics & @AkoyaBio) have facilitated the high-throughout profiling of (groups of) single cells in their tissue context across embryogenesis, normal tissue development & disease progression.
The most popular high-throughput profiled data types are #SpatialTranscriptomics & #SpatialProteomics.

But the repertoire (pun!) is quickly diversifying, e.g. a very recent paper measures the transcriptome & TCR sequence of cells in situ🤯(Slide-TCR-seq)
cell.com/immunity/fullt…
To note is that most technologies do not in fact achieve single-cell resolution. Rather, they record the molecular signal from a spatial region spanning few tens of cells.

This signal can be either
1.approximated as corresponding to a single cell
2.computationally deconvoluted
Now, back to this week's first highlight, STELLAR, coming from the @Stanford labs of @jure, @GarryPNolan & @SnyderShot.

STELLAR proposes a reference-based automated annotation & discovery atlas-style framework of cell types in unseen settings.

nature.com/articles/s4159…
STELLAR combines 2 sources of information to automatically infer & discover cell type labels, by the logic that:
1.Molecular similarity means functional similarity (assumption underlying entire #scRNAseq field)
2.Spatial physical proximity means functional similarity (histology)
Graph Neural Networks (GNNs) intrinsically capture the combination of these two streams of information: molecular similarity & histology, such that cells that are spatially close together and that have similar levels of gene or protein expression are embedded close to each other.
Two main features differentiate STELLAR from existing space-aware clustering methods:

1. It also requires a spatially annotated reference, with cells labeled according to their cell types (needs to be pre-existing)
2. It does automatic annotation, without relying on human input
In other words, human curation is replaced by the reference. With more & more spatial data to be generated, one can easily see how such an approach is optimal, as the human cost is fixed. In contrast, for manual annotation, the human cost increases with number of new datasets.
Using the given reference, STELLAR learns spatial & molecular signatures that define cell types. Then, it transfers this learning to an unseen & unannotated dataset,which can even belong to a different donor or tissue!

STELLAR also identifies new cell types, unseen in reference!
In the paper, STELLAR is used to annotate CODEX multiplexed fluorescent microscopy & MERFISH spatial transcriptomics data.

It annotates 2.6 million cells labeled with 54 protein markers from the human intestine of 8 different donors, saving hundreds of hours of manual annotation Image
Let's move on to this week's most recent highlight: NCEM, coming from the lab of @fabian_theis in Munich 🇩🇪 & led by @AnnaCSchaar & @davidsebfischer.

NCEM is a graph-based statistical model that infers spatial cell-cell communication patterns.

nature.com/articles/s4158…
NCEM goes beyond the state of the art for most scRNAseq cell-cell communication inference methods nowadays, which is assessing ligand and receptor molecular signatures across cell types and inferring interactions whenever signals from both types of signatures are identified.
NCEMs are graph neural networks that predict a cell’s observed gene expression vector from its cell type label and its spatial neighborhood (niche). The model explicitly considers both statistical dependencies of gene expression, as well as cell-cell communication events.
3 types of such graphs with increasing complexity in modeling cell communication are proposed: linear, nonlinear and conditional variational autoencoder (CVAE), with various data-dependent predictive performances.

In some instances (but not all!), linear models do well enough.
To address the constraints of limited experimental capture of ligand–receptor pairs, a really nice graph kernel extends the model of receptor activity in cells conditioned on observed ligand expression. Image
NCEM works on data generated w. e.g. CODEX, MERFISH,MIBI-TOF.

It can also be applied to Visium,following deconvolution (w. e.g. cell2location @OliverStegle). The variation of expression within cell types across spots depends on their inferred composition
nature.com/articles/s4158…
For ex, applied on a deconvoluted VISIUM lymph node data, NCEM identified a bidirectional dependency of B cells and follicular dendritic cells (FDC) that indicates positive feedback between both cell types in germinal center organization, as well as B cell-mast cells interactions Image
You can start exploring these 2 brand new tools on your own spatial dataset, they both are publicly available & well documented

STELLAR: snap.stanford.edu/stellar/
NCEM: ncem.readthedocs.io/en/latest/
Lastly:

Biology is naturally graph-like & graphs modeling has been around in biology for as long as anybody remembers.

Building on this & on the promise of #DeepLearning in biology, biological applications of Graph Neural Networks (GNNs) will become more & more popular.
If these two papers made you curious to understand GNNs better, here's two fun, easy, yet engaging & complex, online resources I recommend getting started with (@distillpub):

distill.pub/2021/gnn-intro/
distill.pub/2021/understan…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Simona Cristea

Simona Cristea Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @simocristea

Oct 25
This is an important tip⚠️

When interpreting #Bioinformatics results,don't cherry-pick your gene/pathway results!

Don't only discuss/analyze the specific hits that support your hypothesis

Let the data speak for what it is.

Here's hands-on advice on how to NOT overinterpret🧵
1. Filter & sort your list:

Most people use cutoffs for adjusted p-value, but few do so for effect size. While the 2 are correlated, some genes/pathways do show significant, but minuscule, changes among conditions.

‼️Don't overinterpret such changes!Nature♥️large effect sizes
2. Sort your genes/paths by effect size.

3. Do NOT cherry-pick your favorite gene, but with random rank in the list,to center your story on.

Think of it as the Olympics:you are mostly interested in top candidates. Never heard of them? Obscure hits?

This IS V. INFORMATIVE💡
Read 13 tweets
Oct 21
Today's amazing science dives deep into the 2 strongest #cancer modulators: evolution & immune defense.

First-ever detailed temporal evolutionary trajectories for 600,000 B cell lymphoma immune cells #scRNAseq & #scTCRSeq of 32 patients during immunotherapy with 2 CAR-T drugs 🧵 Image
First, what is chimeric antigen receptor (CAR) T cell therapy?

It is an immunotherapy in which the patient's own immune cells are genetically engineered ex-vivo to recognize, attack & kill tumor cells. Then they are infused back into the patient, ready to fight the enemy!🤺 2/13
Immunotherapies have revolutionized cancer treatment & are among the most promising future approaches.

However: response rates, even if varying across cancers, remain limited, with e.g. 50% response in lymphomas.

Why such therapies fail for the other half remains a mystery 3/13
Read 13 tweets
Oct 12
Valuable #immunology cell atlas: #scRNAseq + paired B & T cell receptor seq for 330,000 tissue-resident immune cells across 16 human tissues.

CellTypist: new & robust immune cells annotation algorithm, finding 101 immune cell types in 1,000,000 cells‼️

Why this is important👇🧵
Assessing cell types in healthy human tissues is hard. That's why most human immune studies so far profiled immune cells circulating in the blood.

But we know that the multitude of immune cells residing in tissues play distinct roles in development & disease than blood cells 2/7
This study is the first to characterize in depth the single cell expression landscape of the immune system as an integrated cross-tissue machinery.

The resulting cell-level complexity is disentangled with CellTypist, a newly-introduced logistic regression-based framework. 3/7
Read 9 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(