Causal Decision Transformer for Recommender Systems via Offline   Reinforcement Learning

Siyu Wang; Xiaocong Chen; Dietmar Jannach; Lina Yao

arXiv:2304.07920·cs.IR·August 29, 2023·1 cites

Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning

Siyu Wang, Xiaocong Chen, Dietmar Jannach, Lina Yao

PDF

Open Access

TL;DR

This paper introduces CDT4Rec, a causal decision transformer for offline reinforcement learning in recommender systems, leveraging causality and transformer architecture to improve learning from large offline datasets.

Contribution

The paper presents a novel offline RL model using transformers that captures causal relationships in user behavior for recommender systems, addressing data inefficiency issues.

Findings

01

Outperforms existing methods on six real-world datasets

02

Effectively captures long-term dependencies in user data

03

Demonstrates superior online simulation results

Abstract

Reinforcement learning-based recommender systems have recently gained popularity. However, the design of the reward function, on which the agent relies to optimize its recommendation policy, is often not straightforward. Exploring the causality underlying users' behavior can take the place of the reward function in guiding the agent to capture the dynamic interests of users. Moreover, due to the typical limitations of simulation environments (e.g., data inefficiency), most of the work cannot be broadly applied in large-scale situations. Although some works attempt to convert the offline dataset into a simulator, data inefficiency makes the learning process even slower. Because of the nature of reinforcement learning (i.e., learning by interaction), it cannot collect enough data to train during a single interaction. Furthermore, traditional reinforcement learning algorithms do not have a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Digital Mental Health Interventions