Provably Auditing Ordinary Least Squares in Low Dimensions
Ankur Moitra, Dhruv Rohatgi

TL;DR
This paper develops efficient algorithms to accurately estimate and certify the stability of Ordinary Least Squares regression conclusions in low-dimensional settings, improving upon existing heuristics.
Contribution
It introduces provably efficient algorithms for estimating a global stability metric of OLS in low dimensions, with theoretical guarantees and practical improvements.
Findings
Algorithms provide tighter stability bounds than heuristics.
Able to certify stability even after removing a majority of samples.
Applied to Boston Housing dataset with successful stability estimation.
Abstract
Measuring the stability of conclusions derived from Ordinary Least Squares linear regression is critically important, but most metrics either only measure local stability (i.e. against infinitesimal changes in the data), or are only interpretable under statistical assumptions. Recent work proposes a simple, global, finite-sample stability metric: the minimum number of samples that need to be removed so that rerunning the analysis overturns the conclusion, specifically meaning that the sign of a particular coefficient of the estimated regressor changes. However, besides the trivial exponential-time algorithm, the only approach for computing this metric is a greedy heuristic that lacks provable guarantees under reasonable, verifiable assumptions; the heuristic provides a loose upper bound on the stability and also cannot certify lower bounds on it. We show that in the low-dimensional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStatistical Methods and Inference · Statistical and numerical algorithms · Advanced Statistical Methods and Models
MethodsLinear Regression
