Latest Twitter Threads by @_asubbaswamy on Thread Reader App

Nov 20, 2020 • 12 tweets • 4 min read

New preprint w/ (co-first author) @royjamesadams and @suchisaria: "Evaluating Models Robustness Under Dataset Shift" arxiv.org/abs/2010.15100

How can we evaluate *ahead of time* whether or not a model's performance will generalize from training to deployment? 1/

This is an important question for both model developers and 3rd party auditors evaluating safety. For example, the FDA regulates ML medical devices and requires evidence of model validity (internal and external). 2/

Share this page!

Enter URL or ID to Unroll