Not All Candidates are Created Equal: A Heterogeneity-Aware Approach to Pre-ranking in Recommender Systems
Pengfei Tong, Siyuan Chen, Chenwei Zhang, Bo Wang, Qi Pi, Pixun Li, Zuotao Liu

TL;DR
This paper introduces Heterogeneity-Aware Adaptive Pre-ranking (HAP), a novel framework that improves pre-ranking in recommender systems by addressing sample heterogeneity and gradient conflicts, leading to better efficiency and effectiveness.
Contribution
The paper proposes HAP, a unified method that separates easy and hard samples, applies adaptive models, and reduces computational costs in pre-ranking for large-scale recommender systems.
Findings
HAP improves pre-ranking accuracy and efficiency in industrial settings.
Deployment in Toutiao shows up to 0.4% increase in user engagement.
A large-scale dataset for studying candidate heterogeneity is released.
Abstract
Most large-scale recommender systems follow a multi-stage cascade of retrieval, pre-ranking, ranking, and re-ranking. A key challenge at the pre-ranking stage arises from the heterogeneity of training instances sampled from coarse-grained retrieval results, fine-grained ranking signals, and exposure feedback. Our analysis reveals that prevailing pre-ranking methods, which indiscriminately mix heterogeneous samples, suffer from gradient conflicts: hard samples dominate training while easy ones remain underutilized, leading to suboptimal performance. We further show that the common practice of uniformly scaling model complexity across all samples is inefficient, as it overspends computation on easy cases and slows training without proportional gains. To address these limitations, this paper presents Heterogeneity-Aware Adaptive Pre-ranking (HAP), a unified framework that mitigates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Mobile Crowdsensing and Crowdsourcing · Machine Learning and Data Classification
