SPM: Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search
Wen Zan, Yaopeng Han, Xiaotian Jiang, Yao Xiao, Yang Yang, Dayao Chen,, Sheng Chen

TL;DR
This paper introduces a two-stage pretraining and matching architecture tailored for relevance modeling in e-commerce search with structured documents, significantly improving search relevance accuracy on Meituan.
Contribution
It proposes a novel pretraining method and a relevance matching architecture that effectively handle structured data and domain-specific query information, outperforming existing methods.
Findings
Improved relevance matching performance verified by offline experiments.
Online A/B tests show significant user engagement improvements.
Model deployed in production for over a year with sustained benefits.
Abstract
In e-commerce search, relevance between query and documents is an essential requirement for satisfying user experience. Different from traditional e-commerce platforms that offer products, users search on life service platforms such as Meituan mainly for product providers, which usually have abundant structured information, e.g. name, address, category, thousands of products. Modeling search relevance with these rich structured contents is challenging due to the following issues: (1) there is language distribution discrepancy among different fields of structured document, making it difficult to directly adopt off-the-shelf pretrained language model based methods like BERT. (2) different fields usually have different importance and their length vary greatly, making it difficult to extract document information helpful for relevance matching. To tackle these issues, in this paper we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Adam · Softmax · Linear Layer · Residual Connection · Dense Connections · Dropout
