The Identification Power of Combining Experimental and Observational Data for Distributional Treatment Effect Parameters
Shosei Sakaguchi

TL;DR
This paper explores how combining experimental and observational data enhances the identification of distributional treatment effect parameters, especially under self-selection, by deriving sharp bounds and proposing computational methods.
Contribution
It provides nonparametric bounds for DTE parameters using combined data, clarifies the role of self-selection, and introduces a linear programming approach for sharp bounds with structural restrictions.
Findings
Combining data sources tightens the identified set for DTE parameters.
Self-selection significantly enhances identification power.
The approach is demonstrated with an empirical application on campaign ads.
Abstract
This study investigates the identification power gained by combining experimental data, in which treatment is randomized, with observational data, in which treatment is self-selected, for distributional treatment effect (DTE) parameters. While experimental data identify average treatment effects, many DTE parameters, such as the distribution of individual treatment effects, are only partially identified. We examine whether and how combining these two data sources tightens the identified set for such parameters. For broad classes of DTE parameters, we derive nonparametric sharp bounds under the combined data and clarify the mechanism through which data combination improves identification relative to using experimental data alone. Our analysis highlights that self-selection in observational data is a key source of identification power. We establish necessary and sufficient conditions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
