Optimizing Feature Selection in Causal Inference: A Three-Stage Computational Framework for Unbiased Estimation
Tianyu Yang, Md. Noor-E-Alam

TL;DR
This paper introduces a three-stage feature selection framework for causal inference that improves the accuracy and robustness of causal estimates by effectively selecting relevant variables, demonstrated through synthetic and real-world healthcare data.
Contribution
The paper proposes a novel three-stage feature selection framework that outperforms existing methods in reducing bias and variance in causal inference, especially for large-scale datasets.
Findings
Superior performance in synthetic data experiments
Lower bias and variance in causal estimates
Scalable to large datasets with feasible computation time
Abstract
Feature selection is an important but challenging task in causal inference for obtaining unbiased estimates of causal quantities. Properly selected features in causal inference not only significantly reduce the time required to implement a matching algorithm but, more importantly, can also reduce the bias and variance when estimating causal quantities. When feature selection techniques are applied in causal inference, the crucial criterion is to select variables that, when used for matching, can achieve an unbiased and robust estimation of causal quantities. Recent research suggests that balancing only on treatment-associated variables introduces bias while balancing on spurious variables increases variance. To address this issue, we propose an enhanced three-stage framework that shows a significant improvement in selecting the desired subset of variables compared to the existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Fault Detection and Control Systems · Machine Learning and Data Classification
MethodsFeature Selection · Causal inference
