Choosing the Best of Both Worlds: Diverse and Novel Recommendations through Multi-Objective Reinforcement Learning
Dusan Stamenkovic, Alexandros Karatzoglou, Ioannis Arapakis, Xin Xin,, Kleomenis Katevas

TL;DR
This paper introduces a multi-objective reinforcement learning framework for session-based recommender systems that balances accuracy, diversity, and novelty, improving user experience by reducing filter bubbles.
Contribution
The paper presents SMORL, a novel RL framework that integrates multiple objectives into recommendation models, addressing the limitations of single-objective approaches.
Findings
Significant increase in recommendation diversity.
Moderate improvement in recommendation accuracy.
Reduced repetitiveness of recommendations.
Abstract
Since the inception of Recommender Systems (RS), the accuracy of the recommendations in terms of relevance has been the golden criterion for evaluating the quality of RS algorithms. However, by focusing on item relevance, one pays a significant price in terms of other important metrics: users get stuck in a "filter bubble" and their array of options is significantly reduced, hence degrading the quality of the user experience and leading to churn. Recommendation, and in particular session-based/sequential recommendation, is a complex task with multiple - and often conflicting objectives - that existing state-of-the-art approaches fail to address. In this work, we take on the aforementioned challenge and introduce Scalarized Multi-Objective Reinforcement Learning (SMORL) for the RS setting, a novel Reinforcement Learning (RL) framework that can effectively address multi-objective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research
