Counterfactual Evaluation of Slate Recommendations with Sequential   Reward Interactions

James McInerney; Brian Brost; Praveen Chandar; Rishabh Mehrotra; Ben; Carterette

arXiv:2007.12986·cs.LG·August 25, 2020

Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions

James McInerney, Brian Brost, Praveen Chandar, Rishabh Mehrotra, Ben, Carterette

PDF

1 Repo

TL;DR

This paper introduces a new counterfactual evaluation method for sequential slate recommendations that reduces variance and relaxes independence assumptions, improving bias and data efficiency.

Contribution

A novel counterfactual estimator for sequential rewards that leverages causal graphical assumptions to improve evaluation accuracy in recommendation systems.

Findings

01

Outperforms existing methods in simulation tests.

02

Achieves lower bias in reward estimation.

03

Demonstrates improved data efficiency in live system.

Abstract

Users of music streaming, video streaming, news recommendation, and e-commerce services often engage with content in a sequential manner. Providing and evaluating good sequences of recommendations is therefore a central problem for these services. Prior reweighting-based counterfactual evaluation methods either suffer from high variance or make strong independence assumptions about rewards. We propose a new counterfactual estimator that allows for sequential interactions in the rewards with lower variance in an asymptotically unbiased manner. Our method uses graphical assumptions about the causal relationships of the slate to reweight the rewards in the logging policy in a way that approximates the expected sum of rewards under the target policy. Extensive experiments in simulation and on a live recommender system show that our approach outperforms existing methods in terms of bias and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

spotify-research/RIPS_KDD2020
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.