A Markov Decision Process Analysis of the Cold Start Problem in Bayesian Information Filtering
Xiaoting Zhao, Peter I. Frazier

TL;DR
This paper models the cold-start problem in Bayesian information filtering as a Markov decision process, deriving optimal and heuristic policies to improve user relevance feedback with limited data.
Contribution
It introduces a Bayesian MDP framework for the cold-start filtering problem and provides efficient algorithms for optimal and heuristic forwarding policies.
Findings
Efficient computation of Bayes-optimal policies when users examine all forwarded items.
Upper bounds and heuristics for limited user examination scenarios.
Simulation results based on real-world arXiv data demonstrate effectiveness.
Abstract
We consider the information filtering problem, in which we face a stream of items, and must decide which ones to forward to a user to maximize the number of relevant items shown, minus a penalty for each irrelevant item shown. Forwarding decisions are made separately in a personalized way for each user. We focus on the cold-start setting for this problem, in which we have limited historical data on the user's preferences, and must rely on feedback from forwarded articles to learn which the fraction of items relevant to the user in each of several item categories. Performing well in this setting requires trading exploration vs. exploitation, forwarding items that are likely to be irrelevant, to allow learning that will improve later performance. In a Bayesian setting, and using Markov decision processes, we show how the Bayes-optimal forwarding algorithm can be computed efficiently when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing
