Identifying counterfactual probabilities using bivariate distributions and uplift modeling
Th\'eo Verhelst, Gianluca Bontempi

TL;DR
This paper introduces a novel method for estimating joint distributions of potential outcomes using uplift modeling and bivariate beta distributions, enhancing counterfactual analysis without additional causal assumptions.
Contribution
It proposes a counterfactual estimator that leverages uplift scores with bivariate beta distributions, providing richer causal insights.
Findings
Effective in simulations for customer churn analysis
Reveals insights beyond standard ML and uplift models
Requires no additional causal assumptions
Abstract
Uplift modeling estimates the causal effect of an intervention as the difference between potential outcomes under treatment and control, whereas counterfactual identification aims to recover the joint distribution of these potential outcomes (e.g., "Would this customer still have churned had we given them a marketing offer?"). This joint counterfactual distribution provides richer information than the uplift but is harder to estimate. However, the two approaches are synergistic: uplift models can be leveraged for counterfactual estimation. We propose a counterfactual estimator that fits a bivariate beta distribution to predicted uplift scores, yielding posterior distributions over counterfactual outcomes. Our approach requires no causal assumptions beyond those of uplift modeling. Simulations show the efficacy of the approach, which can be applied, for example, to the problem of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCustomer churn and segmentation · Advanced Causal Inference Techniques · Consumer Market Behavior and Pricing
