Structural DID with ML: Theory, Simulation, and a Roadmap for Applied Research
Yile Yu, Anzhi Xu, Yi Wang

TL;DR
This paper introduces S-DIDML, a novel framework combining structural causal inference with machine learning to improve high-dimensional panel data analysis in social sciences.
Contribution
It develops an integrated architecture that retains DID structure while addressing high-dimensional confounders using orthogonalization and causal forests.
Findings
Enables precise identification of policy-sensitive groups.
Provides a standardized implementation process in Stata.
Enhances causal inference in complex intervention scenarios.
Abstract
Causal inference in observational panel data has become a central concern in economics,policy analysis,and the broader social sciences.To address the core contradiction where traditional difference-in-differences (DID) struggles with high-dimensional confounding variables in observational panel data,while machine learning (ML) lacks causal structure interpretability,this paper proposes an innovative framework called S-DIDML that integrates structural identification with high-dimensional estimation.Building upon the structure of traditional DID methods,S-DIDML employs structured residual orthogonalization techniques (Neyman orthogonality+cross-fitting) to retain the group-time treatment effect (ATT) identification structure while resolving high-dimensional covariate interference issues.It designs a dynamic heterogeneity estimation module combining causal forests and semi-parametric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Impact of AI and Big Data on Business and Society
