Representation Learning in Low-rank Slate-based Recommender Systems

Yijia Dai; Wen Sun

arXiv:2309.08622·cs.IR·September 20, 2023

Representation Learning in Low-rank Slate-based Recommender Systems

Yijia Dai, Wen Sun

PDF

Open Access

TL;DR

This paper introduces a sample-efficient representation learning algorithm for slate-based recommender systems modeled as low-rank MDPs, aiming to improve reinforcement learning efficiency in large state-action environments.

Contribution

It proposes a novel RL algorithm tailored for low-rank slate recommendation environments and constructs a simulation setup for evaluation.

Findings

01

Effective in large state-action spaces

02

Improves sample efficiency in RL for recommendations

03

Provides a new simulation environment for testing

Abstract

Reinforcement learning (RL) in recommendation systems offers the potential to optimize recommendations for long-term user engagement. However, the environment often involves large state and action spaces, which makes it hard to efficiently learn and explore. In this work, we propose a sample-efficient representation learning algorithm, using the standard slate recommendation setup, to treat this as an online RL problem with low-rank Markov decision processes (MDPs). We also construct the recommender simulation environment with the proposed setup and sampling method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Reinforcement Learning in Robotics