There’s a lot of magicking and buzzwording around #AI and #MachineLearning in #drugdiscovery

I thought I’d do a thread on a concrete example of using ML to discover a drug candidate about to enter Phase 2a for a stroke disorder:

@RecursionPharma’s $RXRX REC-994
It’s in this Circulation paper from 2015 and illustrates the seed that became $RXRX “phenomics” platform with Chris Gibson (CEO, co-founder) and his former academic mentor Dean Li (board & co-founder) as first/last authors

ahajournals.org/doi/10.1161/ci…
The disease:

CCM (cerebral cavernous malformation) is a monogenic loss-of-function stroke disease affecting 1 in 200-500 in the US. 20% are due to the familial form usually due to a loss-of-function mutation in 1 of 3 genes, including the CCM2 gene
Unstable and leaky vascular malformations in the central nervous system strongly increase the risk of a bleeding (haemorrhagic) stroke (picture).

Other than neurosurgery, there is no treatment for CCM
Previous hypothesis-driven drug discovery approaches had failed. So the researchers tried an ML-based hypothesis-free, unbiased drug repurposing strategy to see if any of a large number of known drugs/molecules might be suitable/repurposed to treat CCM
They plated human blood vessel (microvasc endothelial) cells and used siRNA to knock down the disease-causing CCM2 gene or control siRNA.

So there are “diseased” cells that mimic CCM pathology (right) w a distinct structure and functional phenotype, and “healthy” cells (left)
All cells were stained with a few standard dyes for nucleus, membrane, actin &c and an automated microscope was used to take a large number of fluorescent microcopy pictures.

So there are many images of diseased cells, and many of healthy cells
They used the open-source software CellProfiler (by @DrAnneCarpenter et al at the Broad) to automatically identify the borders of cells and create a database of a large number of mathematical descriptors for every cell in every image
Such descriptors could be anything derivable from the stained cells images.

For example, the radial distribution of a stained adhesion molecular called VE cadherin (right), or the number of neighbours for each cell, or % cell borders touching other cells (left)
The researchers then used an ML tool called CellProfiler Analyst to “learn” a set of rules to distinguish diseased (CCM2 knockdown) from healthy cells. It’s a type of *supervised* ML, where the algo knows the outcome (healthy/diseased) for each cell in the training set.
Examples of rules to distinguish healthy and diseased cells learned by ML include the following
The researchers then exposed diseased (ie CCM2 knockdown) cells to one of 2,100 known drugs and bioactive compounds and acquired high-throughput fluorescent microcopy images for each
The images where then fed to the ML algo asking “do they look like healthy cells” to identify compounds which had “rescued” the disease phenotpye. ML did that by applying the rules it previously learned classifying each image by the % of cells scored as CCM2 knockdown or control
They selected the top 38 compounds identified by ML as rescuing the disease phenotype

(They also had 2 humans eyeball images and choose compounds, but they didn’t do so well at picking successful candidates)
They then tested the 38 compounds using a more traditional, low-throughput transcellular resistance assay (CCM makes cells “leaky”) that narrowed the top candidate list down to 7
Next, they injected the 7 compounds as transdermal wheals into endothelium-specific Ccm2 knockout mice and found that 2 reduced peri-injection site microvascular leakiness
Finally, they tested these 2 compounds in a more elaborate chronic animal model of CCM using the same type of Ccm2 knockout mice.

Both top compounds (tempol/REC-994 and Vitamin D3) reduced cerebrovascular lesions on MRI
REC-994 (tempol) is a superoxide scavenger which $RXRX are about (1Q22e) to take into phase 2 clinicaltrials.gov/ct2/show/NCT05…

This ~10yo example demonstrates how a high-throughput, simple, cheap ML-based primary screen of microscopy images can be used to comb through
a huge number of compounds that could never reasonably be tested using traditional, low-throughput assays.

It also shows how a hypothesis-free approach can identify unexpected targets – tempol/REC-994 would never have been chosen based on its previously known biology
The foundation of $RXRX current approach to AI/ML drug discovery is still the automated acquisition of a huge number of microcopy images from cells w CRISPR/Cas9-knocked genes, treated w compounds &c – supplemented w -omics data and done at enormous scale

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Christoph Nowak

Christoph Nowak Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Chris___Nowak

Oct 4, 2021
3. Q4: Possibly communication of PK modelling results – best outcome would be straightforward PK comparability (AUC[0-inf] 80-120%) – otherwise question is if FDA can be convinced by PD data showing similarity between DPs regarding efficacy-linked immunological markers
4. Possibly (Q1/2022?) communication around another Type A meeting prior to re-submitting BLA to double-check all issues are addressed
5. Re-submission of the BLA (Q1/2022?) and outcome within 6 months (PRVB think may come earlier as CRL did not mention clinical/safety questions).
Read 9 tweets
Oct 4, 2021
Update #ProventionBio #Provention $PRVB #teplizumab. For a summary of AdCom etc. see thread below. I am not involved with the company but have other conflicts of interest. All is based on public information and personal opinion.
Recap: Company has Breakthrough Therapy Designation and Priority Review, an AdCom vote 10-7 in favour for “delay of Type 1 diabetes #T1D”, and got a CRL stating CMC problems and failure to demonstrate PK/PD comparability between the studied Lilly-produced
and to-be commercialized ACG-produced teplizumab drug product (DP). A PK/PD substudy in the ongoing phase 3 PROTECT trial of 2 x 12 days of IV teplizumab 6 months apart in new-onset T1D has been completed (AGC Biologics DP ca. 30 patients, Lilly DP ca. 130 patients).
Read 11 tweets
Aug 14, 2021
Second part: on #ProventionBio #Provention $PRVB #anti-CD3 monoclonal antibody #teplizumab under #FDA consideration for the “delay of clinical #Type1Diabetes #T1D in at-risk individuals”.

Upcoming questions cont’d:
(4) If risk/benefit is acceptable, which target population is deemed suitable by the FDA? There were proponents of 3 options in EMDAC: (a) Stage 1 + 2, (b) Stage 2, or (c) Stage 2 + family Hx of T1D (= inclusion criteria of TN-10).
(a) is unlikely, I think, my vote is with (b). With regards to age, I think >8 years (as in TN-10) will be a requirement, given the SEs associated with teplizumab.
Read 26 tweets
Aug 14, 2021
Thread on #ProventionBio #Provention $PRVB and their flagship #antiCD3 monoclonal antibody #teplizumab under #FDA consideration for the “delay of clinical #Type1Diabetes #T1D in at-risk individuals”. I have no connection to the company – but I do have COIs. No recommendations
or claims of accuracy, just a summary of the status quo and personal opinion. Assume I’m biased. All public information. Feel free to correct me. For references & links, contact me. Some pictures are copies from public PRVB presentations/FDA briefing docs.
*Summary*

PRVB are seeking approval of 14-day IV teplizumab for the delay of T1D. Teplizumab is a humanised Fc-mutated anti-CD3 monocl Ab that, simply put, turns autoreactive T-cells (regardless of antigen) into exhausted T-cells

(PRVB corporate presentation slide) Image
Read 27 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

:(