Marginalised Normal Regression: Unbiased curve fitting in the presence of x-errors
Deaglan Bartlett, Harry Desmond

TL;DR
This paper introduces Marginalised Normal Regression (MNR), a new unbiased curve fitting method for data with errors in both variables, validated through analytic and numerical tests, and extended to nonlinear models with a publicly available implementation.
Contribution
The paper develops MNR, a robust Bayesian regression approach using Gaussian mixture priors, and provides a practical Python implementation called ROXY for efficient sampling.
Findings
MNR reliably produces unbiased results with Gaussian mixture priors.
Single Gaussian prior is often sufficient for unbiased regression.
The method outperforms alternative approaches in cosmological linear relation fitting.
Abstract
The history of the seemingly simple problem of straight line fitting in the presence of both and errors has been fraught with misadventure, with statistically ad hoc and poorly tested methods abounding in the literature. The problem stems from the emergence of latent variables describing the "true" values of the independent variables, the priors on which have a significant impact on the regression result. By analytic calculation of maximum a posteriori values and biases, and comprehensive numerical mock tests, we assess the quality of possible priors. In the presence of intrinsic scatter, the only prior that we find to give reliably unbiased results in general is a mixture of one or more Gaussians with means and variances determined as part of the inference. We find that a single Gaussian is typically sufficient and dub this model Marginalised Normal Regression (MNR). We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and numerical algorithms · Advanced Statistical Methods and Models · Geochemistry and Geologic Mapping
