Latest Twitter Threads by @adamrpearce on Thread Reader App

Aug 7, 2023 • 8 tweets • 3 min read

Do Machine Learning Models Memorize or Generalize?

An interactive introduction to grokking and mechanistic interpretability w/ @ghandeharioun, @nadamused_, @Nithum, @wattenberg and @iislucas https://t.co/ig9dp9GJBepair.withgoogle.com/explorables/gr…

@ghandeharioun @nadamused_ @Nithum @wattenberg @iislucas We first look at task where we know the generalizing solution — sparse parity. You can see the model generalizing as weight decay prunes spurious connections.

Share this page!

Enter URL or ID to Unroll