Evaluation Methods and Measures for Causal Learning Algorithms
Lu Cheng, Ruocheng Guo, Raha Moraffah, Paras Sheth, K. Selcuk Candan,, Huan Liu

TL;DR
This survey reviews current evaluation methods, datasets, and challenges in causal learning algorithms, emphasizing the need for standardized benchmarks to advance research in causal machine learning with big data.
Contribution
It provides a comprehensive overview of evaluation procedures, tools, and limitations in causal learning, highlighting the necessity for standardized benchmarks and collaborative efforts.
Findings
Current evaluation methods have significant limitations.
There is a lack of publicly available benchmarks for causal learning.
Developing standardized evaluation protocols is urgent for progress.
Abstract
The convenient access to copious multi-faceted data has encouraged machine learning researchers to reconsider correlation-based learning and embrace the opportunity of causality-based learning, i.e., causal machine learning (causal learning). Recent years have therefore witnessed great effort in developing causal learning algorithms aiming to help AI achieve human-level intelligence. Due to the lack-of ground-truth data, one of the biggest challenges in current causal learning research is algorithm evaluations. This largely impedes the cross-pollination of AI and causal inference, and hinders the two fields to benefit from the advances of the other. To bridge from conventional causal inference (i.e., based on statistical methods) to causal learning with big data (i.e., the intersection of causal inference and machine learning), in this survey, we review commonly-used datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
