On the Opportunities and Challenges of Offline Reinforcement Learning for Recommender Systems
Xiaocong Chen, Siyu Wang, Julian McAuley, Dietmar Jannach, Lina Yao

TL;DR
This paper reviews the emerging application of offline reinforcement learning in recommender systems, highlighting its potential to address data efficiency issues and discussing current challenges and future research directions.
Contribution
It provides an inclusive survey of offline reinforcement learning techniques applied to recommender systems, emphasizing challenges, opportunities, and future research pathways.
Findings
Offline RL can leverage existing datasets to improve recommender systems.
Current research in offline RL for recommender systems is limited but promising.
Identifies key challenges and future opportunities in the field.
Abstract
Reinforcement learning serves as a potent tool for modeling dynamic user interests within recommender systems, garnering increasing research attention of late. However, a significant drawback persists: its poor data efficiency, stemming from its interactive nature. The training of reinforcement learning-based recommender systems demands expensive online interactions to amass adequate trajectories, essential for agents to learn user preferences. This inefficiency renders reinforcement learning-based recommender systems a formidable undertaking, necessitating the exploration of potential solutions. Recent strides in offline reinforcement learning present a new perspective. Offline reinforcement learning empowers agents to glean insights from offline datasets and deploy learned policies in online settings. Given that recommender systems possess extensive offline datasets, the framework of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Mental Health Interventions · Smart Grid Energy Management · Advanced Bandit Algorithms Research
