HIT Model: A Hierarchical Interaction-Enhanced Two-Tower Model for Pre-Ranking Systems
Haoqiang Yang, Congde Yuan, Kun Bai, Mengzhuo Guo, Wei Yang, Chao Zhou

TL;DR
The HIT model enhances the traditional two-tower pre-ranking architecture by integrating hierarchical interaction components, improving relevance and business metrics without sacrificing efficiency, and has been successfully deployed at Tencent.
Contribution
It introduces a hierarchical interaction-enhanced two-tower architecture with generators and multi-head representers for better modeling of user-ad relationships.
Findings
Significantly outperforms baselines in relevance metrics.
Achieves 1.66% increase in GMV and 1.55% ROI in online tests.
Maintains similar latency to vanilla two-tower models.
Abstract
Online display advertising platforms rely on pre-ranking systems to efficiently filter and prioritize candidate ads from large corpora, balancing relevance to users with strict computational constraints. The prevailing two-tower architecture, though highly efficient due to its decoupled design and pre-caching, suffers from cross-domain interaction and coarse similarity metrics, undermining its capacity to model complex user-ad relationships. In this study, we propose the Hierarchical Interaction-Enhanced Two-Tower (HIT) model, a new architecture that augments the two-tower paradigm with two key components: that pre-generate holistic vectors incorporating coarse-grained user-ad interactions through a dual-generator framework with a cosine-similarity-based generation loss as the training objective, and that project embeddings into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis · Big Data and Business Intelligence
