On Causal Inference with Model-Based Outcomes
Dmitry Arkhangelsky, Kazuharu Yanagimoto, Tom Zohar

TL;DR
This paper addresses the challenge of estimating causal effects on group-level parameters from microdata, highlighting biases in standard methods and proposing a two-step Minimum Distance framework for more accurate and transparent policy evaluation.
Contribution
It introduces a two-step MD approach that corrects endogenous weighting bias and improves causal inference in group-level analyses, especially with small sample sizes.
Findings
Standard methods are biased due to endogenous weighting.
The two-step MD approach reduces bias and improves estimation accuracy.
Application to Dutch childcare reform shows significant differences from conventional estimates.
Abstract
We study the estimation of causal effects on group-level parameters identified from microdata (e.g., child penalties). We demonstrate that standard one-step methods (such as pooled OLS and IV regressions) are generally inconsistent due to an endogenous weighting bias, where the policy affects the implicit weights (e.g., altering fertility rates). In contrast, we advocate for a two-step Minimum Distance (MD) framework that explicitly separates parameter identification from policy evaluation. This approach eliminates the endogenous weighting bias and requires explicitly confronting sample selection when groups are small, thereby improving transparency. We show that the MD estimator performs well when parameters can be estimated for most groups, and propose a robust alternative that uses auxiliary information in settings with limited data. To illustrate the importance of this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEarly Childhood Education and Development
