Progressive Generalization Risk Reduction for Data-Efficient Causal Effect Estimation
Hechuan Wen, Tong Chen, Guanhua Ye, Li Kheng Chai, Shazia Sadiq, and, Hongzhi Yin

TL;DR
This paper introduces a novel active learning framework for causal effect estimation in high-stakes domains with limited labeled data, progressively reducing generalization risk through strategic sample acquisition.
Contribution
It presents the MACAL algorithm that actively selects samples to improve causal effect estimation under limited labeling budgets, supported by theoretical risk analysis.
Findings
Theoretical analysis of generalization risk reduction via progressive data acquisition.
Development of the MACAL algorithm for batch-wise label selection.
Empirical results demonstrating improved causal effect estimation with limited labels.
Abstract
Causal effect estimation (CEE) provides a crucial tool for predicting the unobserved counterfactual outcome for an entity. As CEE relaxes the requirement for ``perfect'' counterfactual samples (e.g., patients with identical attributes and only differ in treatments received) that are impractical to obtain and can instead operate on observational data, it is usually used in high-stake domains like medical treatment effect prediction. Nevertheless, in those high-stake domains, gathering a decently sized, fully labelled observational dataset remains challenging due to hurdles associated with costs, ethics, expertise and time needed, etc., of which medical treatment surveys are a typical example. Consequently, if the training dataset is small in scale, low generalization risks can hardly be achieved on any CEE algorithms. Unlike existing CEE methods that assume the constant availability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Control Systems and Identification · Bayesian Modeling and Causal Inference
