Found an example from @Alexis_Verger that OmegaFold fails on (and has low confidence). But if you hack OmegaFold to use an MSA input, it gets it right (and has high confidence)!
We've (including @thesteinegger@milot_mirdita ) updated ColabFold to use the latest optimized AlphaFold implementation that reduces compile time from ~4.5mins to ~30 seconds! We've confirmed the results are identical for both monomer and multimer predictions. Try it out! (1/2)
Here is the Pseudo-code of the changes we made and the effects on compile time. (2/2)
Rough runtime differences on mono/multi-mer predictions (on the cluster, with other jobs running) from @milot_mirdita . For a mulit-hour run (large proteins or batch-mode that avoids recompiling between preds), a few mins saved isn't much, but for interactive colab use, it is! 😅
For my latest attempt at introducing proteins to students, I made a Google Colab Notebook that predicts proteins from a single sequence. I asked the students to tweak the sequence to get a helix or two helices or... (1/5) colab.research.google.com/github/sokrypt…
I gave them the following cheat sheet: 😅 (2/5)
To make this practical, I had to make various tweaks to AlphaFold to make it compile as fast as possible (~10 seconds) and run as fast as possible (<1 second) and to avoid recompiling if the sequence length changes or the number of recycles changes. (3/5)