Adam Pearce Profile picture
@anthropicai, previously: google brain, @nytgraphics and @bbgvisualdata
Aug 7, 2023 8 tweets 3 min read
Do Machine Learning Models Memorize or Generalize?



An interactive introduction to grokking and mechanistic interpretability w/ @ghandeharioun, @nadamused_, @Nithum, @wattenberg and @iislucas https://t.co/ig9dp9GJBepair.withgoogle.com/explorables/gr…
@ghandeharioun @nadamused_ @Nithum @wattenberg @iislucas We first look at task where we know the generalizing solution — sparse parity. You can see the model generalizing as weight decay prunes spurious connections.