Sequential Matrix Completion
Annie Marsden, Sergio Bacallado

TL;DR
This paper introduces a sequential matrix completion algorithm using bandit policies, notably Information-Directed Sampling, for recommender systems, demonstrating its effectiveness on large real datasets and highlighting challenges for theoretical regret bounds.
Contribution
First implementation of Information-Directed Sampling in large-scale matrix completion, advancing bandit-based collaborative filtering methods with practical experimental validation.
Findings
Information-Directed Sampling outperforms Thompson Sampling in simulations
The approach achieves state-of-the-art performance on real datasets
Challenges remain in establishing finite-horizon regret bounds
Abstract
We propose a novel algorithm for sequential matrix completion in a recommender system setting, where the th entry of the matrix corresponds to a user 's rating of product . The objective of the algorithm is to provide a sequential policy for user-product pair recommendation which will yield the highest possible ratings after a finite time horizon. The algorithm uses a Gamma process factor model with two posterior-focused bandit policies, Thompson Sampling and Information-Directed Sampling. While Thompson Sampling shows competitive performance in simulations, state-of-the-art performance is obtained from Information-Directed Sampling, which makes its recommendations based off a ratio between the expected reward and a measure of information gain. To our knowledge, this is the first implementation of Information Directed Sampling on large real datasets. This approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Neural Networks and Applications
