Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
Juntao Li, Chang Liu, Jian Wang, Lidong Bing, Hongsong Li, Xiaozhong, Liu, Dongyan Zhao, Rui Yan

TL;DR
This paper introduces a new cross-lingual set-to-description retrieval task for cross-border e-commerce, addressing the challenge of matching product attribute sets with descriptions across languages in a low-resource setting.
Contribution
It proposes a novel cross-lingual matching network (CLMN) with context-dependent mapping on BERT, and provides a high-quality dataset for this challenging task.
Findings
CLMN outperforms baseline models on the dataset.
Context-dependent cross-lingual mapping improves retrieval accuracy.
The dataset serves as a new benchmark for low-resource cross-lingual retrieval.
Abstract
With the prosperous of cross-border e-commerce, there is an urgent demand for designing intelligent approaches for assisting e-commerce sellers to offer local products for consumers from all over the world. In this paper, we explore a new task of cross-lingual information retrieval, i.e., cross-lingual set-to-description retrieval in cross-border e-commerce, which involves matching product attribute sets in the source language with persuasive product descriptions in the target language. We manually collect a new and high-quality paired dataset, where each pair contains an unordered product attribute set in the source language and an informative product description in the target language. As the dataset construction process is both time-consuming and costly, the new dataset only comprises of 13.5k pairs, which is a low-resource setting and can be viewed as a challenging testbed for model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Web Data Mining and Analysis · Text and Document Classification Technologies
MethodsLinear Layer · Weight Decay · Softmax · Adam · Multi-Head Attention · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Linear Warmup With Linear Decay · Dense Connections
