Unleashing the Potential of Two-Tower Models: Diffusion-Based Cross-Interaction for Large-Scale Matching
Yihan Wang, Fei Xiong, Zhexin Han, Qi Song, Kaiqiao Zhan, Ben Wang

TL;DR
This paper introduces a novel two-tower matching model that incorporates diffusion-based cross-interaction and temporal behavior analysis, significantly improving large-scale matching accuracy in recommendation and search systems.
Contribution
It proposes a diffusion-based cross-interaction architecture within the two-tower framework, enhancing user-item representation interaction and next positive intent reconstruction.
Findings
Outperforms state-of-the-art two-tower models on real-world datasets
Diffusion module effectively reconstructs item representations
Temporal drift analysis improves next positive intent prediction
Abstract
Two-tower models are widely adopted in the industrial-scale matching stage across a broad range of application domains, such as content recommendations, advertisement systems, and search engines. This model efficiently handles large-scale candidate item screening by separating user and item representations. However, the decoupling network also leads to a neglect of potential information interaction between the user and item representations. Current state-of-the-art (SOTA) approaches include adding a shallow fully connected layer(i.e., COLD), which is limited by performance and can only be used in the ranking stage. For performance considerations, another approach attempts to capture historical positive interaction information from the other tower by regarding them as the input features(i.e., DAT). Later research showed that the gains achieved by this method are still limited because of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
