Rethinking Distributional IVs: KAN-Powered D-IV-LATE & Model Choice
Charles Shaw

TL;DR
This paper shows that the choice of machine learning models for nuisance functions in causal inference significantly affects results, comparing standard models with a novel KAN-based estimator in a distributional IV context.
Contribution
It introduces a new KAN-powered D-IV-LATE estimator and demonstrates its asymptotic properties and impact on substantive conclusions.
Findings
KAN-based estimator reveals more complex treatment heterogeneity
Model choice can alter causal inference outcomes
Empirical analysis highlights importance of model selection
Abstract
The double/debiased machine learning (DML) framework has become a cornerstone of modern causal inference, allowing researchers to utilise flexible machine learning models for the estimation of nuisance functions without introducing first-order bias into the final parameter estimate. However, the choice of machine learning model for the nuisance functions is often treated as a minor implementation detail. In this paper, we argue that this choice can have a profound impact on the substantive conclusions of the analysis. We demonstrate this by presenting and comparing two distinct Distributional Instrumental Variable Local Average Treatment Effect (D-IV-LATE) estimators. The first estimator leverages standard machine learning models like Random Forests for nuisance function estimation, while the second is a novel estimator employing Kolmogorov-Arnold Networks (KANs). We establish the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications
