HCMRM: A High-Consistency Multimodal Relevance Model for Search Ads
Guobing Gan, Kaiming Gao, Li Wang, Shen Jiang, Peng Jiang

TL;DR
This paper introduces HCMRM, a multimodal relevance model that improves query-video relevance matching for search ads, leading to better ad relevance and increased revenue in a real-world platform.
Contribution
It proposes a high-consistency multimodal relevance model with a novel pre-training and fine-tuning strategy for improved ranking in search advertising.
Findings
6.1% reduction in irrelevant ads
1.4% increase in ad revenue
Effective alignment of visual signals and text in relevance modeling
Abstract
Search advertising is essential for merchants to reach the target users on short video platforms. Short video ads aligned with user search intents are displayed through relevance matching and bid ranking mechanisms. This paper focuses on improving query-to-video relevance matching to enhance the effectiveness of ranking in ad systems. Recent vision-language pre-training models have demonstrated promise in various multimodal tasks. However, their contribution to downstream query-video relevance tasks is limited, as the alignment between the pair of visual signals and text differs from the modeling of the triplet of the query, visual signals, and video text. In addition, our previous relevance model provides limited ranking capabilities, largely due to the discrepancy between the binary cross-entropy fine-tuning objective and the ranking objective. To address these limitations, we design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Marketing and Social Media · Sentiment Analysis and Opinion Mining · Web Data Mining and Analysis
