Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems
Hailing Cheng

TL;DR
This paper introduces two novel causal attention architectures for generative recommender systems that improve efficiency and performance by explicitly modeling item-action dependencies, reducing sequence complexity and training time.
Contribution
It proposes two new architectures, AttnLFA and AttnMVP, that eliminate interleaved dependencies, explicitly encode causality, and outperform traditional interleaving methods in large-scale recommendation tasks.
Findings
AttnLFA and AttnMVP outperform interleaved baselines in loss and entropy.
Models reduce sequence complexity by 50%.
Training time decreases by up to 23%.
Abstract
Generative Recommender Systems (GR) increasingly model user behavior as a sequence generation task by interleaving item and action tokens. While effective, this formulation introduces significant structural and computational inefficiencies: it doubles sequence length, incurs quadratic overhead, and relies on implicit attention to recover the causal relationship between an item and its associated action. Furthermore, interleaving heterogeneous tokens forces the Transformer to disentangle semantically incompatible signals, leading to increased attention noise and reduced representation efficiency.In this work, we propose a principled reformulation of generative recommendation that aligns sequence modeling with underlying causal structures and attention theory. We demonstrate that current interleaving mechanisms act as inefficient proxies for similarity-weighted action pooling. To address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Topic Modeling
