Design-based finite-sample analysis for regression adjustment
Dogyoon Song

TL;DR
This paper develops a finite-sample, design-based framework for regression adjustment in randomized experiments, providing valid confidence intervals even in high-dimensional settings where covariates outnumber observations.
Contribution
It introduces a non-asymptotic analysis that yields explicit, instance-adaptive confidence intervals for the regression-adjusted ATE estimator in high-dimensional regimes.
Findings
Finite-sample confidence intervals are valid even when p > n.
Covariate geometry influences the concentration and bias of the estimator.
The approach uses variance-adaptive martingales and Stein's method for analysis.
Abstract
In randomized experiments, regression adjustment can improve the precision of average treatment effect (ATE) estimation using covariates without requiring a correctly specified outcome model. Although well studied in low-dimensional settings, its behavior in high-dimensional regimes, where the number of covariates may exceed the number of observations , remains underexplored. Moreover, existing analyses are largely asymptotic, providing limited guidance for finite-sample inference. We develop a design-based, non-asymptotic framework for analyzing the regression-adjusted ATE estimator under complete randomization. This yields finite-sample-valid confidence intervals with explicit, instance-adaptive widths, even when . While these intervals rely on oracle (population-level) quantities, we also outline data-driven envelopes computable from observed data. Our approach hinges…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
