Efficient estimation and data fusion under general semiparametric restrictions on outcome mean functions
Harrison H. Li

TL;DR
This paper develops a unified framework for efficient estimation in supervised learning with semiparametric restrictions on outcome mean functions, enabling improved data fusion and causal inference from combined experimental and observational datasets.
Contribution
It introduces a novel characterization of semiparametric efficiency that unifies various assumptions on bias, leading to the construction of optimal estimators for data fusion and causal effect estimation.
Findings
Efficient estimators outperform existing methods in biased observational data scenarios.
The framework applies to a wide range of semiparametric restrictions.
Numerical studies demonstrate superior performance of the proposed estimators.
Abstract
We provide a novel characterization of semiparametric efficiency in a generic supervised learning setting where the outcome mean function -- defined as the conditional expectation of the outcome of interest given the other observed variables -- is restricted to lie in some known semiparametric function class. The primary motivation is causal inference where a researcher running a randomized controlled trial often has access to an auxiliary observational dataset that is confounded or otherwise biased for estimating causal effects. Prior work has imposed various bespoke assumptions on this bias in an attempt to improve precision via data fusion. We show how many of these assumptions can be formulated as restrictions on the outcome mean function in the concatenation of the experimental and observational datasets. Then our theory provides a unified framework to maximally leverage such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference
