Double Machine Learning for Static Panel Data with Instrumental Variables: New Method and Applications
Anna Baiardi, Paul S. Clarke, Andrea A. Naghi, Annalivia Polselli

TL;DR
This paper introduces a novel Double Machine Learning estimator for static panel data with endogenous treatments, enhancing causal inference accuracy and reliability, especially with high-dimensional covariates and weak instruments.
Contribution
It develops a new panel IV DML method, provides diagnostics for weak identification, and applies it to migration studies, demonstrating improved estimation and inference over traditional methods.
Findings
Panel IV DML strengthens instrument predictive power.
Flexible adjustment can weaken instruments, affecting causal inference.
Monte Carlo simulations confirm improved accuracy and reliability.
Abstract
Panel data methods are widely used in empirical analysis to address unobserved heterogeneity, but causal inference remains challenging when treatments are endogenous and confounding variables high-dimensional and potentially nonlinear. Standard instrumental variables (IV) estimators, such as two-stage least squares (2SLS), become unreliable when instrument validity requires flexibly conditioning on many covariates with potentially non-linear effects. This paper develops a Double Machine Learning estimator for static panel models with endogenous treatments (panel IV DML), and introduces weak-identification diagnostics for it. We revisit three influential migration studies that use shift-share instruments. In these settings, instrument validity depends on a rich covariate adjustment. In one application, panel IV DML strengthens the predictive power of the instrument and broadly confirms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Spatial and Panel Data Analysis · Psychometric Methodologies and Testing
