SOLAR: SVD-Optimized Lifelong Attention for Recommendation
Chenghao Zhang, Chao Feng, Yuanhao Pu, Xunyong Yang, Wenhui Yu, Xiang Li, Yongqi Liu, Lantao Hu, Kaiqiao Zhan, Han Li, Kun Gai

TL;DR
This paper introduces SOLAR, a novel low-rank SVD-based attention mechanism that reduces computational complexity in long sequence modeling, enabling scalable and efficient recommendation systems with improved performance.
Contribution
It presents SVD-Attention, a lossless low-rank approximation preserving softmax, and integrates it into SOLAR, a framework for scalable lifelong attention in recommendation systems.
Findings
SOLAR achieves 0.68% Video Views gain in online recommendation.
SVD-Attention reduces attention complexity from O(N^2 d) to O(N d r).
Framework supports sequences of ten-thousand scale without filtering.
Abstract
Attention mechanism remains the defining operator in Transformers since it provides expressive global credit assignment, yet its time and memory cost in sequence length makes long-context modeling expensive and often forces truncation or other heuristics. Linear attention reduces complexity to by reordering computation through kernel feature maps, but this reformulation drops the softmax mechanism and shifts the attention score distribution. In recommender systems, low-rank structure in matrices is not a rare case, but rather the default inductive bias in its representation learning, particularly explicit in the user behavior sequence modeling. Leveraging this structure, we introduce SVD-Attention, which is theoretically lossless on low-rank matrices and preserves softmax while reducing attention complexity from to . With SVD-Attention, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques
