Dynamic Slate Recommendation with Gated Recurrent Units and Thompson Sampling
Simen Eide, David S. Leslie, Arnoldo Frigessi

TL;DR
This paper presents a Bayesian recurrent neural network for slate recommendation that models user interactions over time, incorporates exploration via Thompson Sampling, and demonstrates improved diversity and click rates in real-world and offline datasets.
Contribution
It introduces a scalable variational Bayesian RNN recommender with hierarchical priors, combines it with bandit strategies, and develops in-slate Thompson Sampling for enhanced exploration.
Findings
Explorative strategies match or outperform greedy methods.
Click rates improve due to increased diversity.
Model effectively learns user preferences and item groupings.
Abstract
We consider the problem of recommending relevant content to users of an internet platform in the form of lists of items, called slates. We introduce a variational Bayesian Recurrent Neural Net recommender system that acts on time series of interactions between the internet platform and the user, and which scales to real world industrial situations. The recommender system is tested both online on real users, and on an offline dataset collected from a Norwegian web-based marketplace, FINN.no, that is made public for research. This is one of the first publicly available datasets which includes all the slates that are presented to users as well as which items (if any) in the slates were clicked on. Such a data set allows us to move beyond the common assumption that implicitly assumes that users are considering all possible items at each interaction. Instead we build our likelihood using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
