Off-policy estimation of linear functionals: Non-asymptotic theory for   semi-parametric efficiency

Wenlong Mou; Martin J. Wainwright; Peter L. Bartlett

arXiv:2209.13075·math.ST·September 28, 2022

Off-policy estimation of linear functionals: Non-asymptotic theory for semi-parametric efficiency

Wenlong Mou, Martin J. Wainwright, Peter L. Bartlett

PDF

Open Access

TL;DR

This paper develops non-asymptotic bounds for estimating linear functionals in observational studies, highlighting the importance of weighted norm error minimization and proposing an optimal two-stage regression procedure.

Contribution

It introduces a non-asymptotic analysis framework for off-policy linear functional estimation and proposes an instance-dependent optimal two-stage method based on weighted norm regression.

Findings

01

Non-asymptotic upper bounds on mean-squared error are derived.

02

Optimal procedures depend on weighted norm error minimization.

03

The proposed method achieves finite-sample optimality via matching lower bounds.

Abstract

The problem of estimating a linear functional based on observational data is canonical in both the causal inference and bandit literatures. We analyze a broad class of two-stage procedures that first estimate the treatment effect function, and then use this quantity to estimate the linear functional. We prove non-asymptotic upper bounds on the mean-squared error of such procedures: these bounds reveal that in order to obtain non-asymptotically optimal procedures, the error in estimating the treatment effect should be minimized in a certain weighted $L^{2}$ -norm. We analyze a two-stage procedure based on constrained regression in this weighted norm, and establish its instance-dependent optimality in finite samples via matching non-asymptotic local minimax lower bounds. These results show that the optimal non-asymptotic risk, in addition to depending on the asymptotically efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research