UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision
Tianyu Li, Ali Cevahir, Derek Cho, Hao Gong, DuyKhuong Nguyen, Bjorn, Stenger

TL;DR
UserBERT is a self-supervised model that captures both long- and short-term user preferences from e-commerce data, improving performance on various downstream tasks without requiring labeled data.
Contribution
The paper introduces a novel extension of BERT for user behavior modeling, integrating long- and short-term preferences in a unified self-supervised framework.
Findings
Significant improvements in downstream task performance
Effective modeling of diverse user behavior sequences
Elimination of labeled data requirement for pre-training
Abstract
E-commerce platforms generate vast amounts of customer behavior data, such as clicks and purchases, from millions of unique users every day. However, effectively using this data for behavior understanding tasks is challenging because there are usually not enough labels to learn from all users in a supervised manner. This paper extends the BERT model to e-commerce user data for pre-training representations in a self-supervised manner. By viewing user actions in sequences as analogous to words in sentences, we extend the existing BERT model to user behavior data. Further, our model adopts a unified structure to simultaneously learn from long-term and short-term user behavior, as well as user attributes. We propose methods for the tokenization of different types of user behavior sequences, the generation of input representation vectors, and a novel pretext task to enable the pre-trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Context-Aware Activity Recognition Systems · Innovative Human-Technology Interaction
MethodsAttention Is All You Need · Linear Layer · Softmax · Layer Normalization · Multi-Head Attention · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Dropout · Attention Dropout
