Adarsh Subbaswamy Profile picture
CS PhD Student at Johns Hopkins. ML ∩ Causality ∩ Reliability
Nov 20, 2020 12 tweets 4 min read
New preprint w/ (co-first author) @royjamesadams and @suchisaria: "Evaluating Models Robustness Under Dataset Shift" arxiv.org/abs/2010.15100

How can we evaluate *ahead of time* whether or not a model's performance will generalize from training to deployment? 1/ This is an important question for both model developers and 3rd party auditors evaluating safety. For example, the FDA regulates ML medical devices and requires evidence of model validity (internal and external). 2/