Alleviating Matthew Effect of Offline Reinforcement Learning in   Interactive Recommendation

Chongming Gao; Kexin Huang; Jiawei Chen; Yuan Zhang; Biao Li; Peng; Jiang; Shiqi Wang; Zhong Zhang; Xiangnan He

arXiv:2307.04571·cs.IR·July 11, 2023

Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation

Chongming Gao, Kexin Huang, Jiawei Chen, Yuan Zhang, Biao Li, Peng, Jiang, Shiqi Wang, Zhong Zhang, Xiangnan He

PDF

2 Repos

TL;DR

This paper introduces DORL, a novel offline reinforcement learning method for interactive recommendation that reduces the Matthew effect by promoting diversity and long-term user satisfaction.

Contribution

The paper proposes a debiased model-based offline RL approach that relaxes conservatism to mitigate popularity bias in recommendation systems.

Findings

01

DORL effectively captures user interests.

02

DORL alleviates the Matthew effect in recommendations.

03

Experimental results show improved diversity and satisfaction.

Abstract

Offline reinforcement learning (RL), a technology that offline learns a policy from logged data without the need to interact with online environments, has become a favorable choice in decision-making processes like interactive recommendation. Offline RL faces the value overestimation problem. To address it, existing methods employ conservatism, e.g., by constraining the learned policy to be close to behavior policies or punishing the rarely visited state-action pairs. However, when applying such offline RL to recommendation, it will cause a severe Matthew effect, i.e., the rich get richer and the poor get poorer, by promoting popular items or categories while suppressing the less popular ones. It is a notorious issue that needs to be addressed in practical recommender systems. In this paper, we aim to alleviate the Matthew effect in offline RL-based recommendation. Through theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.