The Bayesian Linear Information Filtering Problem

Bangrui Chen; Peter I. Frazier

arXiv:1605.09088·cs.LG·October 25, 2016

The Bayesian Linear Information Filtering Problem

Bangrui Chen, Peter I. Frazier

PDF

TL;DR

This paper formulates the information filtering problem as a Bayesian sequential decision process, introduces new heuristic policies, and demonstrates their near-optimal performance through theoretical bounds and empirical evaluation.

Contribution

It introduces a Bayesian linear model for user preferences, develops new heuristic policies, and provides a computational upper bound to evaluate their optimality gap.

Findings

01

DTD-UCB and DTD-DP outperform benchmarks

02

Performance of heuristics is close to the theoretical upper bound

03

Significant improvement demonstrated on real and simulated data

Abstract

We present a Bayesian sequential decision-making formulation of the information filtering problem, in which an algorithm presents items (news articles, scientific papers, tweets) arriving in a stream, and learns relevance from user feedback on presented items. We model user preferences using a Bayesian linear model, similar in spirit to a Bayesian linear bandit. We compute a computational upper bound on the value of the optimal policy, which allows computing an optimality gap for implementable policies. We then use this analysis as motivation in introducing a pair of new Decompose-Then-Decide (DTD) heuristic policies, DTD-Dynamic-Programming (DTD-DP) and DTD-Upper-Confidence-Bound (DTD-UCB). We compare DTD-DP and DTD-UCB against several benchmarks on real and simulated data, demonstrating significant improvement, and show that the achieved performance is close to the upper bound.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.