Profile picture
Noel O'Boyle @baoilleach
, 11 tweets, 2 min read Read on Twitter
#11thICCS Peter Pogany - Fast molecular searching tools and their extension at GSK
Search tools at GSK: Uses MadFast from Chemaxon; SmallWorld from NextMove; FFSS from GSK; Fraggle from GSK
Reduced graphs represented as SMILES where particular features are represented using unusual elements, e.g. [Sc] for aromatic.
The reduced graph fp is useful for pharmacophoric search. Attachment points are largely ignored, for example.
Describes use of edit distance to measure chemical similarity, with SmallWorld from @nmsoftware. Then Fraggle search via fragmentation and then Tversky search - also requires postprocessing.
Why performance matters. We cluster the whole screening collection (>2M) every weekend. Highly parallelisable. Similarity and clustering is the rate limiting step in cmpd acquisition. Real time search is neccessary.
Dbs are getting bigger quickly. ZINC and Enamine REAL. Need to know if dev cmpds are in these databases as can't patent them if so.
MadFast stores fps in memory and so is faster. Do an all against all similarity calc every weekend for sphere exclusion clustering.
Overlap calculations uses InChIKeys. This caused problems due to hash collision, and are moving to SMILES (?)
Get data from SureChEMBL. Data available with days of patent becoming available.
Q: DB sizes not increasing linearly, but more than that. What will do? A: We're okay for now (up to 3/4 billion) but might be a problem later.
Missing some Tweet in this thread?
You can try to force a refresh.

Like this thread? Get email updates or save it to PDF!

Subscribe to Noel O'Boyle
Profile picture

Get real-time email alerts when new unrolls are available from this author!

This content may be removed anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just three indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member and get exclusive features!

Premium member ($3.00/month or $30.00/year)

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!