Non-invasive Self-attention for Side Information Fusion in Sequential Recommendation
Chang Liu, Xiaoguang Li, Guohao Cai, Zhenhua Dong, Hong Zhu, Lifeng, Shang

TL;DR
This paper introduces NOVA, a non-invasive self-attention mechanism that effectively incorporates side information into sequential recommendation models based on BERT, improving performance without significant computational costs.
Contribution
The paper proposes NOVA, a novel non-invasive self-attention method that leverages side information in BERT-based recommender systems, addressing limitations of naive fusion approaches.
Findings
NOVA-BERT outperforms state-of-the-art models on multiple datasets.
The method achieves these improvements with negligible additional computational overhead.
Incorporating side information via NOVA enhances recommendation accuracy.
Abstract
Sequential recommender systems aim to model users' evolving interests from their historical behaviors, and hence make customized time-relevant recommendations. Compared with traditional models, deep learning approaches such as CNN and RNN have achieved remarkable advancements in recommendation tasks. Recently, the BERT framework also emerges as a promising method, benefited from its self-attention mechanism in processing sequential data. However, one limitation of the original BERT framework is that it only considers one input source of the natural language tokens. It is still an open question to leverage various types of information under the BERT framework. Nonetheless, it is intuitively appealing to utilize other side information, such as item category or tag, for more comprehensive depictions and better recommendations. In our pilot experiments, we found naive approaches, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Advanced Graph Neural Networks
MethodsLinear Layer · Residual Connection · Adam · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Multi-Head Attention · Dense Connections · Softmax · Layer Normalization
