Future-Conditioned Recommendations with Multi-Objective Controllable   Decision Transformer

Chongming Gao; Kexin Huang; Ziang Fei; Jiaju Chen; Jiawei Chen,; Jianshan Sun; Shuchang Liu; Qingpeng Cai; Peng Jiang

arXiv:2501.07212·cs.IR·January 14, 2025

Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer

Chongming Gao, Kexin Huang, Ziang Fei, Jiaju Chen, Jiawei Chen,, Jianshan Sun, Shuchang Liu, Qingpeng Cai, Peng Jiang

PDF

TL;DR

This paper introduces MocDT, a novel offline RL model that enables future-conditioned, multi-objective recommendations by directly specifying future goals, improving control and adaptability in recommender systems.

Contribution

The paper proposes a future-conditioned, multi-objective recommendation approach using MocDT, a new offline RL model that maps multiple objectives to item sequences for better control.

Findings

01

MocDT effectively generates item sequences aligned with specified objectives.

02

The model maintains competitive performance across various objectives.

03

It demonstrates improved flexibility in multi-objective recommendation scenarios.

Abstract

Securing long-term success is the ultimate aim of recommender systems, demanding strategies capable of foreseeing and shaping the impact of decisions on future user satisfaction. Current recommendation strategies grapple with two significant hurdles. Firstly, the future impacts of recommendation decisions remain obscured, rendering it impractical to evaluate them through direct optimization of immediate metrics. Secondly, conflicts often emerge between multiple objectives, like enhancing accuracy versus exploring diverse recommendations. Existing strategies, trapped in a "training, evaluation, and retraining" loop, grow more labor-intensive as objectives evolve. To address these challenges, we introduce a future-conditioned strategy for multi-objective controllable recommendations, allowing for the direct specification of future objectives and empowering the model to generate item…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Absolute Position Encodings · Adam · Residual Connection · Dropout · Softmax · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer