Marian Andrecki Profile picture
silicon and carbon-based decision-making
Apr 3, 2018 6 tweets 2 min read
AI safety from @MIRIBerkeley: Categorizing Variants of Goodhart's Law
arxiv.org/abs/1803.04585

Essential for understanding a class of failures for machine learning systems

Goodhart's law: "when a measure becomes a target, it ceases to be a good measure." 2/ Example: A lazy lecturer notices that students' attendance correlates with their exam performance. He decides to assign grades based on this proxy to save time on marking.