Cheng Tan Profile picture
A fourth-year CS PhD. student at @ZJU_China and @Westlake_Uni || Supervised by: Stan Z. Li.
Jun 27 6 tweets 2 min read
1/ AlphaFold is a revolutionary leap in biology. This has gifted us the AlphaFold Database (AFDB). But what happens when we use this data to train other models? We found a crucial catch. 🧵


#ProteinDesign #Bioinformatics #AlphaFoldarxiv.org/abs/2506.08365 2/🔬 The Challenge: Systematic Bias The problem isn't AlphaFold's accuracy—it's phenomenal. The issue is that the AFDB has a systematic bias. The structures are "too perfect" and don't capture the full, messy diversity of experimentally-determined structures from the PDB.