Enhancing Treatment Effect Estimation via Active Learning: A   Counterfactual Covering Perspective

Hechuan Wen; Tong Chen; Mingming Gong; Li Kheng Chai; Shazia Sadiq,; Hongzhi Yin

arXiv:2505.05242·cs.LG·May 9, 2025

Enhancing Treatment Effect Estimation via Active Learning: A Counterfactual Covering Perspective

Hechuan Wen, Tong Chen, Mingming Gong, Li Kheng Chai, Shazia Sadiq,, Hongzhi Yin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel active learning framework for treatment effect estimation that minimizes labeling costs by optimizing factual and counterfactual coverage, supported by theoretical analysis and empirical validation.

Contribution

It formalizes treatment effect estimation as an active learning problem using coverage radii, and proposes FCCM, a new algorithm for effective data acquisition under budget constraints.

Findings

01

FCCM outperforms baselines on synthetic datasets

02

Theoretical bounds guide effective data selection

03

Proposed methods reduce labeling costs significantly

Abstract

Although numerous complex algorithms for treatment effect estimation have been developed in recent years, their effectiveness remains limited when handling insufficiently labeled training sets due to the high cost of labeling the effect after treatment, e.g., expensive tumor imaging or biopsy procedures needed to evaluate treatment effects. Therefore, it becomes essential to actively incorporate more high-quality labeled data, all while adhering to a constrained labeling budget. To enable data-efficient treatment effect estimation, we formalize the problem through rigorous theoretical analysis within the active learning context, where the derived key measures -- \textit{factual} and \textit{counterfactual covering radius} determine the risk upper bound. To reduce the bound, we propose a greedy radius reduction algorithm, which excels under an idealized, balanced data distribution. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uqhwen2/FCCM
pytorchOfficial

Videos

Enhancing Treatment Effect Estimation via Active Learning: A Counterfactual Covering Perspective· slideslive

Taxonomy

TopicsAdvanced Causal Inference Techniques · Machine Learning and Algorithms · Statistical Methods and Inference