Predicting treatment effects from observational studies using machine learning methods: A simulation study
Bevan I. Smith, Charles Chimedza

TL;DR
This study evaluates machine learning methods for estimating treatment effects from observational data through simulations, revealing they perform well with linear relationships but poorly with non-linear data, especially under confounding.
Contribution
It provides a systematic simulation-based assessment of machine learning counterfactual prediction methods, highlighting their limitations with non-linear data and confounding.
Findings
Machine learning models perform well on linear data.
Performance drops significantly with non-linear relationships.
Confounding does not significantly affect linear data performance.
Abstract
Measuring treatment effects in observational studies is challenging because of confounding bias. Confounding occurs when a variable affects both the treatment and the outcome. Traditional methods such as propensity score matching estimate treatment effects by conditioning on the confounders. Recent literature has presented new methods that use machine learning to predict the counterfactuals in observational studies which then allow for estimating treatment effects. These studies however, have been applied to real world data where the true treatment effects have not been known. This study aimed to study the effectiveness of this counterfactual prediction method by simulating two main scenarios: with and without confounding. Each type also included linear and non-linear relationships between input and output data. The key item in the simulations was that we generated known true causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Health Systems, Economic Evaluations, Quality of Life · Statistical Methods and Bayesian Inference
MethodsCounterfactuals Explanations
