Can predictive models be used for causal inference?
Maximilian Pichler, Florian Hartig

TL;DR
This paper demonstrates that with causal feature selection, machine learning and deep learning models can be used for more accurate causal inference and better generalization, challenging the idea that prediction and explanation are fundamentally at odds.
Contribution
It shows that constraining ML and DL models with causal feature selection allows near-unbiased effect estimation and improves generalization, bridging the gap between prediction and causal inference.
Findings
Causal feature selection reduces bias in effect estimates.
Hyperparameter tuning differs for prediction versus inference.
Causally constrained models generalize better to new data.
Abstract
Supervised machine learning (ML) and deep learning (DL) algorithms excel at predictive tasks, but it is commonly assumed that they often do so by exploiting non-causal correlations, which may limit both interpretability and generalizability. Here, we show that this trade-off between explanation and prediction is not as deep and fundamental as expected. Whereas ML and DL algorithms will indeed tend to use non-causal features for prediction when fed indiscriminately with all data, it is possible to constrain the learning process of any ML and DL algorithm by selecting features according to Pearl's backdoor adjustment criterion. In such a situation, some algorithms, in particular deep neural networks, can provide near unbiased effect estimates under feature collinearity. Remaining biases are explained by the specific algorithmic structures as well as hyperparameter choice. Consequently,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Bayesian Modeling and Causal Inference
