Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Haruka Kiyohara; Masatoshi Uehara; Yusuke Narita; Nobuyuki Shimizu,; Yasuo Yamamoto; Yuta Saito

arXiv:2306.15098·stat.ML·June 28, 2023

Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

Haruka Kiyohara, Masatoshi Uehara, Yusuke Narita, Nobuyuki Shimizu,, Yasuo Yamamoto, Yuta Saito

PDF

1 Repo

TL;DR

This paper introduces Adaptive IPS (AIPS), a new unbiased and low-variance estimator for off-policy evaluation of ranking policies that accounts for diverse user behaviors, improving accuracy over existing methods.

Contribution

It proposes a general formulation for user behavior in OPE, develops AIPS which is unbiased and variance-minimizing, and provides a data-driven method to select user behavior models.

Findings

01

AIPS achieves lower MSE than existing estimators.

02

Empirical results show significant accuracy improvements.

03

Effective OPE under diverse user behaviors.

Abstract

Ranking interfaces are everywhere in online platforms. There is thus an ever growing interest in their Off-Policy Evaluation (OPE), aiming towards an accurate performance evaluation of ranking policies using logged data. A de-facto approach for OPE is Inverse Propensity Scoring (IPS), which provides an unbiased and consistent value estimate. However, it becomes extremely inaccurate in the ranking setup due to its high variance under large action spaces. To deal with this problem, previous studies assume either independent or cascade user behavior, resulting in some ranking versions of IPS. While these estimators are somewhat effective in reducing the variance, all existing estimators apply a single universal assumption to every user, causing excessive bias and variance. Therefore, this work explores a far more general formulation where user behavior is diverse and can vary depending on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aiueola/kdd2023-aips
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.