Towards Better Query Classification with Multi-Expert Knowledge Condensation in JD Ads Search
Kun-Peng Ning, Ming Pang, Zheng Fang, Xue Jiang, Xi-Wei Zhao,, Chang-Ping Peng, Zhan-Gang Lin, Jing-He Hu, Jing-Ping Shao

TL;DR
This paper introduces a knowledge condensation framework that enhances FastText query classification in JD Ads Search by leveraging BERT for data retrieval and multi-expert learning, balancing accuracy and efficiency.
Contribution
The paper proposes a novel knowledge distillation method combining BERT-based data retrieval and multi-expert learning to improve FastText classification performance under low latency constraints.
Findings
Improved classification accuracy on low-frequency queries.
Effective data augmentation using BERT for better training.
Validated through offline and online experiments in JD search.
Abstract
Search query classification, as an effective way to understand user intents, is of great importance in real-world online ads systems. To ensure a lower latency, a shallow model (e.g. FastText) is widely used for efficient online inference. However, the representation ability of the FastText model is insufficient, resulting in poor classification performance, especially on some low-frequency queries and tailed categories. Using a deeper and more complex model (e.g. BERT) is an effective solution, but it will cause a higher online inference latency and more expensive computing costs. Thus, how to juggle both inference efficiency and classification performance is obviously of great practical importance. To overcome this challenge, in this paper, we propose knowledge condensation (KC), a simple yet effective knowledge distillation framework to boost the classification performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Advanced Image and Video Retrieval Techniques · Text and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Knowledge Distillation · WordPiece · fastText · Attention Dropout · Linear Warmup With Linear Decay
