Adarsh Subbaswamy Profile picture
Computer scientist working on improving the reliability and safety of AI/ML in healthcare | PhD from @JohnsHopkins
Nov 20, 2020 12 tweets 4 min read
New preprint w/ (co-first author) @royjamesadams and @suchisaria: "Evaluating Models Robustness Under Dataset Shift" arxiv.org/abs/2010.15100

How can we evaluate *ahead of time* whether or not a model's performance will generalize from training to deployment? 1/ This is an important question for both model developers and 3rd party auditors evaluating safety. For example, the FDA regulates ML medical devices and requires evidence of model validity (internal and external). 2/