Recommender Transformers with Behavior Pathways
Zhiyu Yao, Xinyang Chen, Sinan Wang, Qinyan Dai, Yumeng Li, Tanchao, Zhu, Mingsheng Long

TL;DR
This paper introduces Recommender Transformers with a novel Pathway Attention mechanism that dynamically identifies and activates pivotal user behavior pathways, significantly improving sequential recommendation accuracy.
Contribution
It proposes a new Pathway Attention mechanism for transformers that selectively focuses on key user behaviors, enhancing recommendation performance.
Findings
RETR achieves state-of-the-art results on seven real-world datasets.
The Pathway Attention mechanism effectively filters trivial behaviors.
RETR outperforms existing sequential recommendation models.
Abstract
Sequential recommendation requires the recommender to capture the evolving behavior characteristics from logged user behavior data for accurate recommendations. However, user behavior sequences are viewed as a script with multiple ongoing threads intertwined. We find that only a small set of pivotal behaviors can be evolved into the user's future action. As a result, the future behavior of the user is hard to predict. We conclude this characteristic for sequential behaviors of each user as the Behavior Pathway. Different users have their unique behavior pathways. Among existing sequential models, transformers have shown great capacity in capturing global-dependent characteristics. However, these models mainly provide a dense distribution over all previous behaviors using the self-attention mechanism, making the final predictions overwhelmed by the trivial behaviors not adjusted to each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Generative Adversarial Networks and Image Synthesis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Softmax · Absolute Position Encodings · Dropout · Adam · Residual Connection · Byte Pair Encoding
