Advancing Re-Ranking with Multimodal Fusion and Target-Oriented Auxiliary Tasks in E-Commerce Search
Enqiang Xu, Xinhui Li, Zhigong Zhou, Jiahao Ji, Jinyuan Zhao, Dadong, Miao, Songlin Wang, Lin Liu, Sulong Xu

TL;DR
This paper introduces ARMMT, a novel multimodal re-ranking model for e-commerce search that combines textual and visual data with auxiliary tasks to improve recommendation accuracy and user engagement.
Contribution
It presents a new attention-based multimodal fusion technique and auxiliary tasks, advancing the integration of multimodal information in e-commerce re-ranking models.
Findings
Achieved a 0.22% increase in Conversion Rate (CVR)
State-of-the-art performance in multimodal information integration
Enhanced product attribute understanding and personalization
Abstract
In the rapidly evolving field of e-commerce, the effectiveness of search re-ranking models is crucial for enhancing user experience and driving conversion rates. Despite significant advancements in feature representation and model architecture, the integration of multimodal information remains underexplored. This study addresses this gap by investigating the computation and fusion of textual and visual information in the context of re-ranking. We propose \textbf{A}dvancing \textbf{R}e-Ranking with \textbf{M}ulti\textbf{m}odal Fusion and \textbf{T}arget-Oriented Auxiliary Tasks (ARMMT), which integrates an attention-based multimodal fusion technique and an auxiliary ranking-aligned task to enhance item representation and improve targeting capabilities. This method not only enriches the understanding of product attributes but also enables more precise and personalized recommendations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Web Data Mining and Analysis · Spam and Phishing Detection
