Unbiased Offline Evaluation of Contextual-bandit-based News Article   Recommendation Algorithms

Lihong Li; Wei Chu; John Langford; Xuanhui Wang

arXiv:1003.5956·cs.LG·March 13, 2015·1 cites

Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms

Lihong Li, Wei Chu, John Langford, Xuanhui Wang

PDF

Open Access 5 Repos

TL;DR

This paper introduces a data-driven, unbiased offline evaluation method for contextual bandit algorithms in news recommendation, avoiding simulator biases and aligning well with online results.

Contribution

The paper presents a novel replay methodology for unbiased offline evaluation of contextual bandit algorithms, which is simple, adaptable, and provably accurate.

Findings

01

Empirical results match theoretical predictions.

02

Offline replay evaluation aligns with online performance.

03

Method outperforms traditional simulator-based evaluations.

Abstract

Contextual bandit algorithms have become popular for online recommendation systems such as Digg, Yahoo! Buzz, and news recommendation in general. \emph{Offline} evaluation of the effectiveness of new algorithms in these applications is critical for protecting online user experiences but very challenging due to their "partial-label" nature. Common practice is to create a simulator which simulates the online environment for the problem at hand and then run an algorithm against this simulator. However, creating simulator itself is often difficult and modeling bias is usually unavoidably introduced. In this paper, we introduce a \emph{replay} methodology for contextual bandit algorithm evaluation. Different from simulator-based approaches, our method is completely data-driven and very easy to adapt to different applications. More importantly, our method can provide provably unbiased…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Multimodal Machine Learning Applications