Learning-To-Embed: Adopting Transformer based models for E-commerce Products Representation Learning
Lakshya Kumar, Sreekanth Vempati

TL;DR
This paper explores the use of transformer-based models like BERT, RoBERTa, ALBERT, and XLNET to learn product representations from user sessions in e-commerce, improving tasks like product recommendation and ranking.
Contribution
It introduces a novel pre-training approach for transformer models using user session data for product embedding in e-commerce, outperforming traditional methods.
Findings
XLNET achieves highest performance with MRR of 0.5 for NPR
Transformer models outperform Word2Vec baseline
Pre-trained models can be fine-tuned for various downstream tasks
Abstract
Learning low-dimensional representation for large number of products present in an e-commerce catalogue plays a vital role as they are helpful in tasks like product ranking, product recommendation, finding similar products, modelling user-behaviour etc. Recently, a lot of tasks in the NLP field are getting tackled using the Transformer based models and these deep models are widely applicable in the industries setting to solve various problems. With this motivation, we apply transformer based model for learning contextual representation of products in an e-commerce setting. In this work, we propose a novel approach of pre-training transformer based model on a users generated sessions dataset obtained from a large fashion e-commerce platform to obtain latent product representation. Once pre-trained, we show that the low-dimension representation of the products can be obtained given the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Web Data Mining and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · LAMB · Label Smoothing · Softmax · Layer Normalization · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · SentencePiece
