Accurate Uncertainties for Deep Learning Using Calibrated Regression

Volodymyr Kuleshov; Nathan Fenner; Stefano Ermon

arXiv:1807.00263·cs.LG·July 3, 2018·274 cites

Accurate Uncertainties for Deep Learning Using Calibrated Regression

Volodymyr Kuleshov, Nathan Fenner, Stefano Ermon

PDF

Open Access 1 Repo

TL;DR

This paper introduces a calibration method for regression models, including Bayesian neural networks, to produce reliable uncertainty estimates, improving their performance in forecasting and reinforcement learning tasks.

Contribution

It presents a simple calibration procedure applicable to any regression model, ensuring well-calibrated uncertainty estimates with sufficient data, inspired by Platt scaling.

Findings

01

Calibrated models produce credible intervals that match true outcomes.

02

Improved performance in time series forecasting.

03

Enhanced reliability in model-based reinforcement learning.

Abstract

Methods for reasoning under uncertainty are a key building block of accurate and reliable machine learning systems. Bayesian methods provide a general framework to quantify uncertainty. However, because of model misspecification and the use of approximate inference, Bayesian uncertainty estimates are often inaccurate -- for example, a 90% credible interval may not contain the true outcome 90% of the time. Here, we propose a simple procedure for calibrating any regression algorithm; when applied to Bayesian and probabilistic models, it is guaranteed to produce calibrated uncertainty estimates given enough data. Our procedure is inspired by Platt scaling and extends previous work on classification. We evaluate this approach on Bayesian linear regression, feedforward, and recurrent neural networks, and find that it consistently outputs well-calibrated credible intervals while improving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ulissigroup/uncertainty_benchmarking
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Gaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning