Offline Recommender System Evaluation under Unobserved Confounding
Olivier Jeunen, Ben London

TL;DR
This paper investigates the impact of unobserved confounders on off-policy evaluation in recommender systems, highlighting biases and diagnostic limitations, and emphasizing the need for awareness and mitigation strategies.
Contribution
It characterizes the bias caused by unobserved confounders in off-policy estimators and demonstrates the limitations of existing diagnostics in detecting such biases.
Findings
Naive propensity estimation under confounding leads to severely biased metrics.
Existing diagnostics are ineffective at uncovering biases caused by unobserved confounders.
Bias depends on true, unobserved logging propensities, making it non-identifiable.
Abstract
Off-Policy Estimation (OPE) methods allow us to learn and evaluate decision-making policies from logged data. This makes them an attractive choice for the offline evaluation of recommender systems, and several recent works have reported successful adoption of OPE methods to this end. An important assumption that makes this work is the absence of unobserved confounders: random variables that influence both actions and rewards at data collection time. Because the data collection policy is typically under the practitioner's control, the unconfoundedness assumption is often left implicit, and its violations are rarely dealt with in the existing literature. This work aims to highlight the problems that arise when performing off-policy estimation in the presence of unobserved confounders, specifically focusing on a recommendation use-case. We focus on policy-based estimators, where the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques · Distributed Sensor Networks and Detection Algorithms
MethodsFocus
