Faithful Heteroscedastic Regression with Neural Networks
Andrew Stirn, Hans-Hermann Wessels, Megan Schertzer, Laura Pereira,, Neville E. Sanjana, David A. Knowles

TL;DR
This paper introduces a simple optimization modification for neural heteroscedastic regression that ensures accurate mean estimates and well-calibrated variance, improving over existing methods without complex surrogates or Bayesian approaches.
Contribution
It proposes a novel, theoretically justified optimization technique for heteroscedastic neural regression that guarantees mean accuracy and enhances variance calibration.
Findings
Mean estimates match those of homoscedastic models.
Variance calibration is significantly improved.
Method recovers underlying heteroscedastic noise variance.
Abstract
Heteroscedastic regression models a Gaussian variable's mean and variance as a function of covariates. Parametric methods that employ neural networks for these parameter maps can capture complex relationships in the data. Yet, optimizing network parameters via log likelihood gradients can yield suboptimal mean and uncalibrated variance estimates. Current solutions side-step this optimization problem with surrogate objectives or Bayesian treatments. Instead, we make two simple modifications to optimization. Notably, their combination produces a heteroscedastic model with mean estimates that are provably as accurate as those from its homoscedastic counterpart (i.e.~fitting the mean under squared error loss). For a wide variety of network and task complexities, we find that mean estimates from existing heteroscedastic solutions can be significantly less accurate than those from an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Neural Networks and Applications
