Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness
Yuxuan Yang, Dugang Liu, Yiyan Huang

TL;DR
This paper systematically evaluates uplift modeling in biased real-world data, highlighting the importance of metric stability and model robustness, and introduces a benchmarking framework using semi-synthetic data to assess performance under structural biases.
Contribution
It introduces a semi-synthetic benchmarking framework for uplift models under biases and provides insights into model robustness and metric stability in such settings.
Findings
TARNet shows notable robustness to biases.
Uplift prediction and targeting are distinct objectives.
Metrics aligned with ATE offer more stable model rankings.
Abstract
In personalized marketing, uplift models estimate incremental effects by modeling how customer behavior changes under alternative treatments. However, real-world data often exhibit biases - such as selection bias, spillover effects, and unobserved confounding - which adversely affect both estimation accuracy and metric validity. Despite the importance of bias-aware assessment, a lack of systematic studies persists. To bridge this gap, we design a systematic benchmarking framework. Unlike standard predictive tasks, real-world uplift datasets lack counterfactual ground truth, rendering direct metric validation infeasible. Therefore, a semi-synthetic approach serves as a critical enabler for systematic benchmarking, effectively bridging the gap by retaining real-world feature dependencies while providing the ground truth needed to isolate structural biases. Our investigations reveal that:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCustomer churn and segmentation · Consumer Market Behavior and Pricing · Advanced Causal Inference Techniques
