Off-policy estimation of linear functionals: Non-asymptotic theory for semi-parametric efficiency
Wenlong Mou, Martin J. Wainwright, Peter L. Bartlett

TL;DR
This paper develops non-asymptotic bounds for estimating linear functionals in observational studies, highlighting the importance of weighted norm error minimization and proposing an optimal two-stage regression procedure.
Contribution
It introduces a non-asymptotic analysis framework for off-policy linear functional estimation and proposes an instance-dependent optimal two-stage method based on weighted norm regression.
Findings
Non-asymptotic upper bounds on mean-squared error are derived.
Optimal procedures depend on weighted norm error minimization.
The proposed method achieves finite-sample optimality via matching lower bounds.
Abstract
The problem of estimating a linear functional based on observational data is canonical in both the causal inference and bandit literatures. We analyze a broad class of two-stage procedures that first estimate the treatment effect function, and then use this quantity to estimate the linear functional. We prove non-asymptotic upper bounds on the mean-squared error of such procedures: these bounds reveal that in order to obtain non-asymptotically optimal procedures, the error in estimating the treatment effect should be minimized in a certain weighted -norm. We analyze a two-stage procedure based on constrained regression in this weighted norm, and establish its instance-dependent optimality in finite samples via matching non-asymptotic local minimax lower bounds. These results show that the optimal non-asymptotic risk, in addition to depending on the asymptotically efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research
