QUEACO: Borrowing Treasures from Weakly-labeled Behavior Data for Query Attribute Value Extraction
Danqing Zhang, Zheng Li, Tianyu Cao, Chen Luo, Tony Wu, Hanqing Lu,, Yiwei Song, Bing Yin, Tuo Zhao, Qiang Yang

TL;DR
This paper introduces QUEACO, a unified system for query attribute value extraction in e-commerce, leveraging weakly-labeled data and a teacher-student network to improve both entity recognition and normalization with less supervision.
Contribution
The paper proposes a novel unified approach combining NER and AVN phases, utilizing a teacher-student network and weakly-labeled data to enhance extraction performance in e-commerce search.
Findings
Effective in real-world large-scale e-commerce dataset
Improves extraction accuracy with less supervision
Demonstrates the benefit of joint NER and AVN modeling
Abstract
We study the problem of query attribute value extraction, which aims to identify named entities from user queries as diverse surface form attribute values and afterward transform them into formally canonical forms. Such a problem consists of two phases: {named entity recognition (NER)} and {attribute value normalization (AVN)}. However, existing works only focus on the NER phase but neglect equally important AVN. To bridge this gap, this paper proposes a unified query attribute value extraction system in e-commerce search named QUEACO, which involves both two phases. Moreover, by leveraging large-scale weakly-labeled behavior data, we further improve the extraction performance with less supervision cost. Specifically, for the NER phase, QUEACO adopts a novel teacher-student network, where a teacher network that is trained on the strongly-labeled data generates pseudo-labels to refine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
