Confidence Intervals for Linear Models with Arbitrary Noise Contamination

Dong Xie; Chao Gao; John Lafferty

arXiv:2511.07605·math.ST·April 3, 2026

Confidence Intervals for Linear Models with Arbitrary Noise Contamination

Dong Xie, Chao Gao, John Lafferty

PDF

TL;DR

This paper introduces a new method for constructing confidence intervals for linear regression coefficients under arbitrary noise contamination, achieving optimal length without knowing the contamination level.

Contribution

It develops a novel Z-estimation based algorithm that provides valid, adaptive confidence intervals in contaminated linear models without prior contamination knowledge.

Findings

01

Confidence intervals have valid coverage under all contamination distributions.

02

Achieves an optimal length of order O(1/√(n(1-ε)^2)), matching known contamination scenarios.

03

Contrasts with Gaussian models by avoiding adaptation costs in robust interval estimation.

Abstract

We study confidence interval construction for linear regression under Huber's contamination model, where an unknown fraction of noise variables is arbitrarily corrupted. While robust point estimation in this setting is well understood, statistical inference remains challenging, especially because the contamination proportion is not identifiable from the data. We develop a new algorithm that constructs confidence intervals for individual regression coefficients without any prior knowledge of the contamination level. Our method is based on a Z-estimation framework using a smooth estimating function. The method directly quantifies the uncertainty of the estimating equation after a preprocessing step that decorrelates covariates associated with the nuisance parameters. We show that the resulting confidence interval has valid coverage uniformly over all contamination distributions and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.