BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer
Fei Sun, Jun Liu, Jian Wu, Changhua Pei, Xiao Lin, Wenwu Ou, and Peng, Jiang

TL;DR
BERT4Rec introduces a bidirectional transformer-based model trained with a Cloze task to better capture users' evolving preferences for recommendation systems, outperforming previous unidirectional models.
Contribution
The paper presents BERT4Rec, a novel bidirectional transformer model for sequential recommendation, trained with a Cloze task to utilize both past and future context.
Findings
BERT4Rec outperforms state-of-the-art models on four benchmark datasets.
Bidirectional modeling improves recommendation accuracy.
Training with the Cloze task enhances the model's ability to learn from both directions.
Abstract
Modeling users' dynamic and evolving preferences from their historical behaviors is challenging and crucial for recommendation systems. Previous methods employ sequential neural networks (e.g., Recurrent Neural Network) to encode users' historical interactions from left to right into hidden representations for making recommendations. Although these methods achieve satisfactory results, they often assume a rigidly ordered sequence which is not always practical. We argue that such left-to-right unidirectional architectures restrict the power of the historical sequence representations. For this purpose, we introduce a Bidirectional Encoder Representations from Transformers for sequential Recommendation (BERT4Rec). However, jointly conditioning on both left and right context in deep bidirectional model would make the training become trivial since each item can indirectly "see the target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Machine Learning in Healthcare · Topic Modeling
