Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems
Yewen Fan, Nian Si, Kun Zhang

TL;DR
This paper addresses the issue of maximization bias in calibration for large-scale advertising recommendation systems, proposing a variance-adjusting debiasing algorithm that improves calibration accuracy without extra online costs.
Contribution
The paper introduces a novel variance-adjusting debiasing (VAD) algorithm to mitigate maximization bias, especially under covariate shifts, with theoretical analysis and practical validation.
Findings
VAD effectively reduces maximization bias in calibration.
The algorithm maintains ranking performance without additional online costs.
Empirical results show improved calibration on large-scale real-world data.
Abstract
Calibration is defined as the ratio of the average predicted click rate to the true click rate. The optimization of calibration is essential to many online advertising recommendation systems because it directly affects the downstream bids in ads auctions and the amount of money charged to advertisers. Despite its importance, calibration optimization often suffers from a problem called "maximization bias". Maximization bias refers to the phenomenon that the maximum of predicted values overestimates the true maximum. The problem is introduced because the calibration is computed on the set selected by the prediction model itself. It persists even if unbiased predictions can be achieved on every datapoint and worsens when covariate shifts exist between the training and test sets. To mitigate this problem, we theorize the quantification of maximization bias and propose a variance-adjusting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Consumer Market Behavior and Pricing
