Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness

Yuxuan Yang; Dugang Liu; Yiyan Huang

arXiv:2603.20775·cs.LG·March 24, 2026

Evaluating Uplift Modeling under Structural Biases: Insights into Metric Stability and Model Robustness

Yuxuan Yang, Dugang Liu, Yiyan Huang

PDF

Open Access

TL;DR

This paper systematically evaluates uplift modeling in biased real-world data, highlighting the importance of metric stability and model robustness, and introduces a benchmarking framework using semi-synthetic data to assess performance under structural biases.

Contribution

It introduces a semi-synthetic benchmarking framework for uplift models under biases and provides insights into model robustness and metric stability in such settings.

Findings

01

TARNet shows notable robustness to biases.

02

Uplift prediction and targeting are distinct objectives.

03

Metrics aligned with ATE offer more stable model rankings.

Abstract

In personalized marketing, uplift models estimate incremental effects by modeling how customer behavior changes under alternative treatments. However, real-world data often exhibit biases - such as selection bias, spillover effects, and unobserved confounding - which adversely affect both estimation accuracy and metric validity. Despite the importance of bias-aware assessment, a lack of systematic studies persists. To bridge this gap, we design a systematic benchmarking framework. Unlike standard predictive tasks, real-world uplift datasets lack counterfactual ground truth, rendering direct metric validation infeasible. Therefore, a semi-synthetic approach serves as a critical enabler for systematic benchmarking, effectively bridging the gap by retaining real-world feature dependencies while providing the ground truth needed to isolate structural biases. Our investigations reveal that:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCustomer churn and segmentation · Consumer Market Behavior and Pricing · Advanced Causal Inference Techniques