Imran S. Haque Profile picture
Apr 26 10 tweets 8 min read Twitter logo Read on Twitter
More tweetorial! Let’s dig into the “proximity bias” we found @RecursionPharma confounding #CRISPR screens, what it means, and where it comes from.

You can always read along in the preprint here: biorxiv.org/content/10.110… Image
If you missed the first tweetorial in the series, click here to understand what the red-and-blue heatmaps here mean, and how we use them to map the functions of genes at a genome-wide level:
To recap: if you knock out each gene in the genome, plot all their pairwise similarities, and sort by genomic position, a curious pattern emerges in which #CRISPR knockouts look more similar to KOs on the same chromosome arm than to KOs on other arms.
In fact, the image shows TWO maps: the @RecursionPharma #RxRx3 map in HUVEC (above the diagonal) and the cpg0016 map in U2OS made by the JUMP-CP consortium led by @DrAnneCarpenter and @shantanuXsingh (below): the effect reproduces across labs, protocols, cell types, etc. Image
We call this “proximity bias,” as KO sims reflect genomic proximity, not just gene function.

Cool tidbit: this bias even reflects non-canonical genome struc. There’s a known fusion in U2OS between chr5q and chr19q & we see that patch of proximity bias in U2OS but not in HUVEC. Image
We also noticed that the strength of proximity bias fell off going from centromere-to-telomere, suggesting a model in which this bias was caused by chromosomal truncations: lose more genes in common, get stronger similarity. ImageImage
Searching an internal database of 25k RNA-seq samples, jackpot: strong evidence for specific losses from cut-site to telomere in a number of samples!

(Yeah, we do a lot of sequencing @RecursionPharma, too.) Image
Bulk RNA-seq doesn’t tell us whether this is a weak effect in many cells, or a strong effect in a few cells. So we searched #CRISPR datasets in @sandercbio awesome scperturb.org collection, and sure enough: clear evidence of truncations in rare subpopulations of cells!
The first image shows data from nature.com/articles/s4158… by @epic_genetix in the @satijalab; the second from nature.com/articles/s4158… by @FrangiehChris working with @BizarMd. Across labs and cell types, the conclusion holds: #CRISPR editing creates truncations. ImageImage
In tomorrow’s thread, I’ll continue to explain how proximity bias affects a broad range of #CRISPR functional genomics datasets and confounds the community’s efforts to decode #biology, by looking closely at the @CancerDepMap.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Imran S. Haque

Imran S. Haque Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ImranSHaque

Apr 25
Tweetorial time! We @RecursionPharma mapped consequences of #CRISPR screening of >17K human genes, found a systematic bias confounding all CRISPR screens, traced its molecular cause, and propose a debiasing algorithm. Image
“But Imran,” you say, “I’d rather read your thrilling 41-page manuscript than read tweet threads!”

I can’t blame you, it’s great! (I may be a biased source.) Here ya go: biorxiv.org/content/10.110…
In this first tweetorial, I’ll share some of the foundations of the similarity-based “maps” we build @RecursionPharma as background for what we found out about CRISPR by building a map over the whole genome.
Read 10 tweets
Aug 10, 2020
Happy Monday! In today's #tweetorial on our recent preprint describing @RecursionPharma's platform (biorxiv.org/content/10.110…), I'll explain the unusual 2-D drug response plots we use there and in our COVID-19 screen data at covid19.rxrx.ai..

in terms of jumping cat gifs.
A primer: the @RecursionPharma platform takes images of cells under different conditions (disease agent, disease+drug, control, etc.), and feeds the images through a custom deep network to derive a high-dimensional (128-1024D) "embedding".
Instead of measuring say, two parameters like viral titer and cell count, we measure 100s-1000s of parameters describing the morphology of cells in a plate. This information captures a lot of biology, as @i_draw_hexagons described in his tweetorial:
Read 20 tweets
Apr 26, 2020
Very proud to be part of the team @RecursionPharma working on #Covid_19 and of our preprint today: biorxiv.org/content/10.110…. Brief #tweetorial :

We developed a human cell model of SARS-CoV-2 infection, compared it to the field-standard monkey cell model, and screened ~1700 drugs.
Also: the entire cellular image dataset (~450GB of 5-channel microscopy) is available at rxrx.ai/rxrx19. 305,520 5-ch pics @ 1Mpx licensed CC-BY. Want some big image data for ML to help with the pandemic? Here it is. We've also released the DL image embeddings.
Now on to the paper. If you missed @zavaindar's explainer from Friday, it's a great one to start with. I'll provide my own insights into the work here.

Read 23 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(