Intent-based Product Collections for E-commerce using Pretrained Language Models
Hiun Kim, Jisu Jeong, Kyung-Min Kim, Dongjun Lee, Hyun Dong Lee,, Dongpil Seo, Jeeseung Han, Dong Wook Park, Ji Ae Heo, Rak Yeong Kim

TL;DR
This paper presents a method using pretrained language models, specifically BERT with triplet loss, to automatically generate product collections based on customer intent, improving relevance and diversity over manual methods.
Contribution
The paper introduces a novel approach leveraging pretrained language models with triplet loss, search-based negative sampling, and category-wise augmentation for intent-based product collection in e-commerce.
Findings
Outperforms baseline models in offline intent matching tasks.
Increases click-through rate (CTR) and conversion rate (CVR) in online tests.
Enhances order diversity in product collections.
Abstract
Building a shopping product collection has been primarily a human job. With the manual efforts of craftsmanship, experts collect related but diverse products with common shopping intent that are effective when displayed together, e.g., backpacks, laptop bags, and messenger bags for freshman bag gifts. Automatically constructing a collection requires an ML system to learn a complex relationship between the customer's intent and the product's attributes. However, there have been challenging points, such as 1) long and complicated intent sentences, 2) rich and diverse product attributes, and 3) a huge semantic gap between them, making the problem difficult. In this paper, we use a pretrained language model (PLM) that leverages textual attributes of web-scale products to make intent-based product collections. Specifically, we train a BERT with triplet loss by setting an intent sentence to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Text and Document Classification Technologies · Web Data Mining and Analysis
MethodsAttention Is All You Need · Linear Layer · Softmax · Weight Decay · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam
