Learning-To-Embed: Adopting Transformer based models for E-commerce   Products Representation Learning

Lakshya Kumar; Sreekanth Vempati

arXiv:2212.03725·cs.IR·December 8, 2022·1 cites

Learning-To-Embed: Adopting Transformer based models for E-commerce Products Representation Learning

Lakshya Kumar, Sreekanth Vempati

PDF

Open Access

TL;DR

This paper explores the use of transformer-based models like BERT, RoBERTa, ALBERT, and XLNET to learn product representations from user sessions in e-commerce, improving tasks like product recommendation and ranking.

Contribution

It introduces a novel pre-training approach for transformer models using user session data for product embedding in e-commerce, outperforming traditional methods.

Findings

01

XLNET achieves highest performance with MRR of 0.5 for NPR

02

Transformer models outperform Word2Vec baseline

03

Pre-trained models can be fine-tuned for various downstream tasks

Abstract

Learning low-dimensional representation for large number of products present in an e-commerce catalogue plays a vital role as they are helpful in tasks like product ranking, product recommendation, finding similar products, modelling user-behaviour etc. Recently, a lot of tasks in the NLP field are getting tackled using the Transformer based models and these deep models are widely applicable in the industries setting to solve various problems. With this motivation, we apply transformer based model for learning contextual representation of products in an e-commerce setting. In this work, we propose a novel approach of pre-training transformer based model on a users generated sessions dataset obtained from a large fashion e-commerce platform to obtain latent product representation. Once pre-trained, we show that the low-dimension representation of the products can be obtained given the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Web Data Mining and Analysis

MethodsMulti-Head Attention · Attention Is All You Need · LAMB · Label Smoothing · Softmax · Layer Normalization · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · SentencePiece