LinRec: Linear Attention Mechanism for Long-term Sequential Recommender   Systems

Langming Liu; Xiangyu Zhao; Chi Zhang; Jingtong Gao; Wanyu Wang; Wenqi; Fan; Yiqi Wang; Ming He; Zitao Liu; Qing Li

arXiv:2411.01537·cs.IR·November 5, 2024

LinRec: Linear Attention Mechanism for Long-term Sequential Recommender Systems

Langming Liu, Xiangyu Zhao, Chi Zhang, Jingtong Gao, Wanyu Wang, Wenqi, Fan, Yiqi Wang, Ming He, Zitao Liu, Qing Li

PDF

1 Repo

TL;DR

LinRec introduces a linear attention mechanism for long-term sequential recommender systems, significantly reducing computational costs while maintaining or improving recommendation performance.

Contribution

The paper proposes LinRec, a novel L2-Normalized Linear Attention that achieves linear complexity and preserves the effectiveness of traditional attention in Transformer-based SRSs.

Findings

01

LinRec achieves linear complexity in attention computation.

02

Experiments show LinRec outperforms or matches state-of-the-art models.

03

Significant improvements in time and memory efficiency.

Abstract

Transformer models have achieved remarkable success in sequential recommender systems (SRSs). However, computing the attention matrix in traditional dot-product attention mechanisms results in a quadratic complexity with sequence lengths, leading to high computational costs for long-term sequential recommendation. Motivated by the above observation, we propose a novel L2-Normalized Linear Attention for the Transformer-based Sequential Recommender Systems (LinRec), which theoretically improves efficiency while preserving the learning capabilities of the traditional dot-product attention. Specifically, by thoroughly examining the equivalence conditions of efficient attention mechanisms, we show that LinRec possesses linear complexity while preserving the property of attention mechanisms. In addition, we reveal its latent efficiency properties by interpreting the proposed LinRec mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Applied-Machine-Learning-Lab/LinRec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Attention Is All You Need · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Dropout · Absolute Position Encodings