Can You Trust This Prediction? Auditing Pointwise Reliability After   Learning

Peter Schulam; Suchi Saria

arXiv:1901.00403·stat.ML·March 4, 2019·37 cites

Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

Peter Schulam, Suchi Saria

PDF

Open Access

TL;DR

This paper introduces Resampling Uncertainty Estimation (RUE), a post-training auditing method that assesses pointwise prediction reliability by estimating how predictions would vary with different training data, improving trust in high-stakes applications.

Contribution

RUE provides a novel post-training auditing algorithm that effectively detects unreliable predictions without requiring changes to the training process.

Findings

01

RUE outperforms existing reliability auditing tools in detecting inaccuracies.

02

RUE produces competitive predictive distributions compared to state-of-the-art methods.

03

RUE does not depend on specific training algorithms, making it broadly applicable.

Abstract

To use machine learning in high stakes applications (e.g. medicine), we need tools for building confidence in the system and evaluating whether it is reliable. Methods to improve model reliability often require new learning algorithms (e.g. using Bayesian inference to obtain uncertainty estimates). An alternative is to audit a model after it is trained. In this paper, we describe resampling uncertainty estimation (RUE), an algorithm to audit the pointwise reliability of predictions. Intuitively, RUE estimates the amount that a prediction would change if the model had been fit on different training data. The algorithm uses the gradient and Hessian of the model's loss function to create an ensemble of predictions. Experimentally, we show that RUE more effectively detects inaccurate predictions than existing tools for auditing reliability subsequent to training. We also show that RUE can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification