Parmita Mishra Profile picture
Apr 17, 2024 15 tweets 5 min read Read on X
Protein folding is so important. In 2023, DeepMind won the $250,000 Lasker award for their solution to the problem. A lot of people have asked me to explain protein folding in simple, understandable terms.

Here is my attempt at explaining just the problem.

🧵OPEN THE THREAD🧵Image
Understanding how a protein's amino acid sequence dictates its 3D shape—known as the "protein folding problem"—is a fundamental question in biology. Proteins are the workhorses of cells, and their functions depend on their shapes (structure).Image
Problem: Predicting this 3D shape from just the amino acid sequence.

This is tricky because proteins can fold in an astronomical number of ways, but only a few are biologically relevant.

Knowing the shape helps us understand function and design better treatments.Image
Before WWII, it was thought that protein properties were defined merely by their amino acid composition. However, post-1949, Frederick Sanger’s methods revealed that the sequence of these amino acids plays a crucial role. Image
Cyrus Levinthal noted in the 1960s that it would take an astronomical amount of time for a protein to randomly try each possible fold before finding the correct structure. Yet, proteins fold correctly and quickly, usually within milliseconds. Image
Protein folding is guided by an energy landscape shaped like a funnel. While there are many possible folded states, natural selection has optimized proteins to fold into a minimum-energy structure rapidly. This funnel guides the protein to its native state. Image
A popular hypothesis, Anfinsen’s dogma, essentially states 'the amino acid sequence of a protein contained all of the information needed for the protein to reach the native conformation.'

This is the 'thermodynamic hypothesis of protein folding.' AlphaFold uses this dogma.Image
From the perspective of performance, AlphaFold2 (and this dogma) have cracked the likely structure of various proteins. However, it is well-accepted that this dogma may not hold true for all proteins.

The low-hanging fruit was picked. Some problems remain.Image
Image
Folding doesn’t happen in empty space but in the bustling environment of a cell, where other molecules can influence the folding pathway. This cellular context adds another layer of complexity. As an example, let's consider molecular chaperones.
Not all proteins fold spontaneously; molecular chaperones assist in the folding of many proteins. These chaperones prevent misfolding and aggregation that can lead to complex diseases. Image
Solutions like AlphaFold model direct physicochemical interactions between amino acids to determine the most likely 3D structure of a protein but do not account for the cellular processes, like the action of chaperones, that can affect protein folding in vivo.Image
There is a lot to this problem (to be covered in other 🧵's), and the problem of protein-molecule, protein-protein, and protein-drug interactions, making the usage of AlphaFold2 in real-life scenarios difficult. The functional problem extends beyond a static training database.
The problem of predicting likely structures, assuming they are static and isolated, is solved. However, it is fair to say that the functional 'protein folding problem' is now solving protein complexes, based on interactions.
DeepMind's AlphaFold-Multimer, their protein complex solution, was not half as successful as their protein structure solution.

Protein complex prediction, in my opinion, bridges computational biochemistry and systems biology in unthought-of ways.
When it comes to solutions for drug discovery, understanding protein-drug interactions is a prerequisite. Essentially, here is an example of how solving a problem on paper is never enough in biology, and blackboxes might not necessarily work. This is also an example of technical constraints in data collection.

Observing protein folding in real time challenges even the most advanced scientific instruments, demanding ultra-fast and precise techniques to catch these fleeting processes; this, on top of in-vivo measurements within a cell being a grand, traditional challenge.

I will cover this topic in more detail over time, but tl;dr - protein structures? somewhat solved. protein complexes? we are so early.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Parmita Mishra

Parmita Mishra Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @parmita

Apr 1
As someone who actively works in drug discovery, I want to dispel a myth.

Stimulants were not “designed for ADHD.” They were discovered by accident in the 1930s because they calmed hyperactive boys.

Also the origin story of the most prescribed psychiatric drugs in history.
🧵 Image
The entire research pipeline - the clinical trials, the diagnostic criteria, the dosing models - was built around one phenotype: hyperactive boys who couldn’t sit still in class.

Most stimulant studies were conducted on white males. The DSM criteria? Based on young boys. Image
I have ADHD. I’ve had it diagnosed since I was very young. And my meds genuinely helped me. I’m not anti-medication. Stimulants changed my life in real ways. But they fixed my hyperactivity. They did NOT fix my inattentiveness.

That’s not a coincidence.
Read 21 tweets
Apr 1
🚨 There is a new COVID variant: Cicada.

It has spread around 25 US states. <1% prevalence in US but up to 30% of cases in Europe are Cicada.

* carries 70–75 genetic mutations in its spike protein
^ double the amount found in other recent dominant strains like JN.1.
🧵
Why is it called Cicada?

It’s “hibernation” pattern.

In November 2024, it was first detected.

September 2025: waste water detections increase

January 2026: first (human) patient.
Cicada has not yet dominated other strains.

If it does? We are screwed. It could absolutely drive a summer surge.

Crucially, it is more likely to hit older people and the immunocompromised.

Stay cautious!
Read 16 tweets
Mar 21
I just asked myself the most important question I’ve ever asked.

What if, god forbid, I had cancer right now? How would I save my life and would I be able to do it without Precigenetics?

The answer made me cry.

Here’s EXACTLY how I would save my own life TODAY. 🧵
Let me show you both paths. What happens today, without this platform

and what I’d actually do if I had one. Assume I had the permits to use my cells, and I could do what I want.

Then you tell me which world you want to live in.
the current reality for every cancer patient on earth:

You get a biopsy. Your tissue is fixed, stained, and sent to a pathology lab. It’s dead. The cells you need answers from are killed in the process of examining them.

You wait.
Read 32 tweets
Oct 15, 2025
I keep saying “drug discovery” but most of my audience does not understand what this means.

Here’s a thread I worked on over the past week trying to distil down drug discovery - and why it matters in the age of AI.

OPEN THE THREAD 🧵 Image
Drug discovery is how molecules become medicine.
A $180 billion guessing game where up to 97% of candidates fail in trials.

We can now predict protein folds (thanks to AlphaFold), but still cannot simulate what a drug actually does to a living cell in real time.
At its core, drug discovery asks one question:

=> What happens inside a cell when we “perturb” it? (drug it)

Traditional biology answers by destroying the cell to measure it.

Every assay is a snapshot. Every snapshot costs reagents, time, and lives. Image
Read 23 tweets
Sep 26, 2025
how do we know that people in the past had cancer and when did we even know what cancer was?

a word for cancer existed long before microscopes or pathology.

the history of cancer is far more exciting than we realize.

🧵
the idea is older than modern medicine. Hippocrates (400 BCE) used the word karkinos (crab) for tumors with “claw-like” spread. Galen (200 CE) expanded it. the word cancer is a translation of this lineage.
this existed across civilizations

in India, texts like Sushruta Samhita circa 600 BCE described “arbuda”: hard, immobile, enlarging masses that ulcerated and killed slowly. not called “cancer,” but the descriptions line up with malignancies.
Read 16 tweets
Jun 8, 2025
Biotech is a 1.55 trillion dollar industry, projected to grow to 3.88 trillion by 2030.

How is biotech incentivized to do all this up front investment to make new breakthroughs that lead to life saving treatments?

You finally get the answers today.
Biotech industry: thread. Image
Tech investors see quick iterations:

Build. Test. Pivot. Scale.

Biotech demands patience:

Experiment=>Fail=>Refine, then repeat: often over years.

Your challenge: Balancing investor impatience with biological patience.
Unlike software, biotech innovation directly translates to high-margin products (drugs).

Pharma margins (gross profit; 70%) dwarf general industry (gross profit: 40%%). Why?

Each approved drug solves life-or-death problems: high value, premium prices.Image
Read 29 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(