Attention Calibration for Transformer-based Sequential Recommendation
Peilin Zhou, Qichen Ye, Yueqi Xie, Jingqi Gao, Shoujin Wang, Jae Boum, Kim, Chenyu You, Sunghun Kim

TL;DR
This paper identifies issues with attention weight assignment in transformer-based sequential recommendation models and proposes a calibration framework, AC-TSR, to improve recommendation accuracy by correcting attention weights.
Contribution
The paper introduces AC-TSR, a novel framework with spatial and adversarial calibrators that directly calibrate attention weights in transformer-based SR models, enhancing recommendation performance.
Findings
AC-TSR significantly improves recommendation accuracy on four benchmark datasets.
The proposed calibrators effectively correct attention weight misassignments.
AC-TSR is easily integrated into existing transformer-based SR models.
Abstract
Transformer-based sequential recommendation (SR) has been booming in recent years, with the self-attention mechanism as its key component. Self-attention has been widely believed to be able to effectively select those informative and relevant items from a sequence of interacted items for next-item prediction via learning larger attention weights for these items. However, this may not always be true in reality. Our empirical analysis of some representative Transformer-based SR models reveals that it is not uncommon for large attention weights to be assigned to less relevant items, which can result in inaccurate recommendations. Through further in-depth analysis, we find two factors that may contribute to such inaccurate assignment of attention weights: sub-optimal position encoding and noisy input. To this end, in this paper, we aim to address this significant yet challenging gap in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Radiomics and Machine Learning in Medical Imaging · Advanced Graph Neural Networks
