D2RLIR : an improved and diversified ranking function in interactive   recommendation systems based on deep reinforcement learning

Vahid Baghi; Seyed Mohammad Seyed Motehayeri; Ali Moeini; Rooholah; Abedian

arXiv:2110.15089·cs.IR·November 1, 2021

D2RLIR : an improved and diversified ranking function in interactive recommendation systems based on deep reinforcement learning

Vahid Baghi, Seyed Mohammad Seyed Motehayeri, Ali Moeini, Rooholah, Abedian

PDF

Open Access

TL;DR

This paper introduces D2RLIR, a deep reinforcement learning-based interactive recommendation system that enhances ranking efficiency and diversity by combining Actor-Critic architecture, similarity search, and diversity algorithms, validated on MovieLens.

Contribution

It presents a novel recommendation framework integrating Actor-Critic RL, similarity search, and diversity optimization, addressing efficiency and diversity issues in interactive systems.

Findings

01

Improved recommendation diversity and relevance on MovieLens dataset.

02

Enhanced ranking efficiency using Spotify's ANNoy algorithm.

03

Effective modeling of user interactions with positional encoding.

Abstract

Recently, interactive recommendation systems based on reinforcement learning have been attended by researchers due to the consider recommendation procedure as a dynamic process and update the recommendation model based on immediate user feedback, which is neglected in traditional methods. The existing works have two significant drawbacks. Firstly, inefficient ranking function to produce the Top-N recommendation list. Secondly, focusing on recommendation accuracy and inattention to other evaluation metrics such as diversity. This paper proposes a deep reinforcement learning based recommendation system by utilizing Actor-Critic architecture to model dynamic users' interaction with the recommender agent and maximize the expected long-term reward. Furthermore, we propose utilizing Spotify's ANNoy algorithm to find the most similar items to generated action by actor-network. After that, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics