Short Text Pre-training with Extended Token Classification for E-commerce Query Understanding
Haoming Jiang, Tianyu Cao, Zheng Li, Chen Luo, Xianfeng Tang, Qingyu, Yin, Danqing Zhang, Rahul Goutam, Bing Yin

TL;DR
This paper introduces Extended Token Classification (ETC), a novel pre-training task for short text query understanding in e-commerce, improving upon traditional masked language models by extending input tokens rather than masking, leading to better intent inference.
Contribution
The paper proposes ETC, a new pre-training method tailored for short e-commerce queries, addressing the limitations of masked language models in such contexts.
Findings
ETC outperforms traditional MLM in query understanding tasks.
The approach improves semantic understanding of short search queries.
Experiments demonstrate enhanced performance in e-commerce search scenarios.
Abstract
E-commerce query understanding is the process of inferring the shopping intent of customers by extracting semantic meaning from their search queries. The recent progress of pre-trained masked language models (MLM) in natural language processing is extremely attractive for developing effective query understanding models. Specifically, MLM learns contextual text embedding via recovering the masked tokens in the sentences. Such a pre-training process relies on the sufficient contextual information. It is, however, less effective for search queries, which are usually short text. When applying masking to short search queries, most contextual information is lost and the intent of the search queries may be changed. To mitigate the above issues for MLM pre-training on search queries, we propose a novel pre-training task specifically designed for short text, called Extended Token Classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Topic Modeling · Data Quality and Management
MethodsAttention Is All You Need · Softmax · Linear Layer · Relative Position Encodings · Layer Normalization · Global-Local Attention · Position-Wise Feed-Forward Layer · Residual Connection · InfoNCE · Multi-Head Attention
