Position-Aware Sequential Attention for Accurate Next Item Recommendations
Timur Nabiev, Evgeny Frolov

TL;DR
This paper introduces a kernelized self-attention mechanism for sequential models that disentangles positional information from item semantics, leading to improved accuracy in next-item recommendation tasks.
Contribution
The authors propose a novel positional kernel attention that enhances sequential modeling by directly modulating attention weights based on position, outperforming traditional additive positional embeddings.
Findings
Consistently outperforms baseline models on next-item prediction benchmarks.
Enables adaptive multi-scale sequential modeling.
Improves the sensitivity of attention to temporal order.
Abstract
Sequential self-attention models usually rely on additive positional embeddings, which inject positional information into item representations at the input. In the absence of positional signals, the attention block is permutation-equivariant over sequence positions and thus has no intrinsic notion of temporal order beyond causal masking. We argue that additive positional embeddings make the attention mechanism only superficially sensitive to sequence order: positional information is entangled with item embedding semantics, propagates weakly in deep architectures, and limits the ability to capture rich sequential patterns. To address these limitations, we introduce a kernelized self-attention mechanism, where a learnable positional kernel operates purely in the position space, disentangled from semantic similarity, and directly modulates attention weights. When applied per attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning · Recommender Systems and Techniques
