Neural Graph Matching for Video Retrieval in Large-Scale Video-driven E-commerce
Houye Ji, Ye Tang, Zhaoxin Chen, Lixi Deng, Jun Hu, Lei Su

TL;DR
This paper introduces a novel bi-level Graph Matching Network (GMN) that models user-video and user-item interactions as a graph matching problem to improve video retrieval in large-scale video-driven e-commerce, demonstrating significant performance gains.
Contribution
The paper proposes a new graph matching approach with a dual graph model and bi-level matching to better understand user preferences in video-driven e-commerce.
Findings
GMN outperforms state-of-the-art methods with +1.9% AUC and +7.15% CTR improvements.
The dual graph model effectively captures user-video and user-item interactions.
Bi-level graph matching enhances user embedding quality for retrieval tasks.
Abstract
With the rapid development of the short video industry, traditional e-commerce has encountered a new paradigm, video-driven e-commerce, which leverages attractive videos for product showcases and provides both video and item services for users. Benefitting from the dynamic and visualized introduction of items,video-driven e-commerce has shown huge potential in stimulating consumer confidence and promoting sales. In this paper, we focus on the video retrieval task, facing the following challenges: (1) Howto handle the heterogeneities among users, items, and videos? (2)How to mine the complementarity between items and videos for better user understanding? In this paper, we first leverage the dual graph to model the co-existing of user-video and user-item interactions in video-driven e-commerce and innovatively reduce user preference understanding to a graph matching problem. To solve it,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Video Analysis and Summarization
MethodsFocus
