Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning
Siyu Wang, Xiaocong Chen, Dietmar Jannach, Lina Yao

TL;DR
This paper introduces CDT4Rec, a causal decision transformer for offline reinforcement learning in recommender systems, leveraging causality and transformer architecture to improve learning from large offline datasets.
Contribution
The paper presents a novel offline RL model using transformers that captures causal relationships in user behavior for recommender systems, addressing data inefficiency issues.
Findings
Outperforms existing methods on six real-world datasets
Effectively captures long-term dependencies in user data
Demonstrates superior online simulation results
Abstract
Reinforcement learning-based recommender systems have recently gained popularity. However, the design of the reward function, on which the agent relies to optimize its recommendation policy, is often not straightforward. Exploring the causality underlying users' behavior can take the place of the reward function in guiding the agent to capture the dynamic interests of users. Moreover, due to the typical limitations of simulation environments (e.g., data inefficiency), most of the work cannot be broadly applied in large-scale situations. Although some works attempt to convert the offline dataset into a simulator, data inefficiency makes the learning process even slower. Because of the nature of reinforcement learning (i.e., learning by interaction), it cannot collect enough data to train during a single interaction. Furthermore, traditional reinforcement learning algorithms do not have a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Digital Mental Health Interventions
