A fourth-year CS PhD. student at @ZJU_China and @Westlake_Uni || Supervised by: Stan Z. Li.
Jun 27 • 6 tweets • 2 min read
1/ AlphaFold is a revolutionary leap in biology. This has gifted us the AlphaFold Database (AFDB). But what happens when we use this data to train other models? We found a crucial catch. 🧵
#ProteinDesign #Bioinformatics #AlphaFoldarxiv.org/abs/2506.08365
2/🔬 The Challenge: Systematic Bias The problem isn't AlphaFold's accuracy—it's phenomenal. The issue is that the AFDB has a systematic bias. The structures are "too perfect" and don't capture the full, messy diversity of experimentally-determined structures from the PDB.