Deep Reinforcement Learning for List-wise Recommendations

Xiangyu Zhao; Liang Zhang; Long Xia; Zhuoye Ding; Dawei; Yin; Jiliang Tang

arXiv:1801.00209·cs.LG·June 28, 2019·109 cites

Deep Reinforcement Learning for List-wise Recommendations

Xiangyu Zhao, Liang Zhang, Long Xia, Zhuoye Ding, Dawei, Yin, Jiliang Tang

PDF

Open Access 5 Repos

TL;DR

This paper introduces a reinforcement learning-based recommender system that dynamically improves its strategies through user interactions, utilizing list-wise recommendations and an online simulation environment to enhance personalization.

Contribution

It presents a novel RL framework for list-wise recommendations that adaptively learns from user feedback and incorporates list-wide strategies, validated on real-world e-commerce data.

Findings

01

Reinforcement learning effectively improves recommendation strategies.

02

List-wise recommendations outperform point-wise approaches.

03

The online environment simulator enhances offline training and evaluation.

Abstract

Recommender systems play a crucial role in mitigating the problem of information overload by suggesting users' personalized items or services. The vast majority of traditional recommender systems consider the recommendation procedure as a static process and make recommendations following a fixed strategy. In this paper, we propose a novel recommender system with the capability of continuously improving its strategies during the interactions with users. We model the sequential interactions between users and a recommender system as a Markov Decision Process (MDP) and leverage Reinforcement Learning (RL) to automatically learn the optimal strategies via recommending trial-and-error items and receiving reinforcements of these items from users' feedbacks. In particular, we introduce an online user-agent interacting environment simulator, which can pre-train and evaluate model parameters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Expert finding and Q&A systems