An evaluation framework for comparing causal inference models
Niki Kiriakidou, Christos Diou

TL;DR
This paper introduces a comprehensive evaluation framework for causal inference models that incorporates statistical testing and performance profiling to provide more robust comparisons beyond traditional error metrics.
Contribution
It proposes a novel evaluation methodology using statistical evidence and performance profiles to better compare causal inference models, reducing the influence of outliers.
Findings
The framework effectively distinguishes model performance differences.
It reduces the impact of outliers on model comparison.
The methodology provides more reliable evaluation results.
Abstract
Estimation of causal effects is the core objective of many scientific disciplines. However, it remains a challenging task, especially when the effects are estimated from observational data. Recently, several promising machine learning models have been proposed for causal effect estimation. The evaluation of these models has been based on the mean values of the error of the Average Treatment Effect (ATE) as well as of the Precision in Estimation of Heterogeneous Effect (PEHE). In this paper, we propose to complement the evaluation of causal inference models using concrete statistical evidence, including the performance profiles of Dolan and Mor{\'e}, as well as non-parametric and post-hoc statistical tests. The main motivation behind this approach is the elimination of the influence of a small number of instances or simulation on the benchmarking process, which in some cases dominate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques
