Not All Candidates are Created Equal: A Heterogeneity-Aware Approach to Pre-ranking in Recommender Systems

Pengfei Tong; Siyuan Chen; Chenwei Zhang; Bo Wang; Qi Pi; Pixun Li; Zuotao Liu

arXiv:2603.03770·cs.IR·March 5, 2026

Not All Candidates are Created Equal: A Heterogeneity-Aware Approach to Pre-ranking in Recommender Systems

Pengfei Tong, Siyuan Chen, Chenwei Zhang, Bo Wang, Qi Pi, Pixun Li, Zuotao Liu

PDF

Open Access

TL;DR

This paper introduces Heterogeneity-Aware Adaptive Pre-ranking (HAP), a novel framework that improves pre-ranking in recommender systems by addressing sample heterogeneity and gradient conflicts, leading to better efficiency and effectiveness.

Contribution

The paper proposes HAP, a unified method that separates easy and hard samples, applies adaptive models, and reduces computational costs in pre-ranking for large-scale recommender systems.

Findings

01

HAP improves pre-ranking accuracy and efficiency in industrial settings.

02

Deployment in Toutiao shows up to 0.4% increase in user engagement.

03

A large-scale dataset for studying candidate heterogeneity is released.

Abstract

Most large-scale recommender systems follow a multi-stage cascade of retrieval, pre-ranking, ranking, and re-ranking. A key challenge at the pre-ranking stage arises from the heterogeneity of training instances sampled from coarse-grained retrieval results, fine-grained ranking signals, and exposure feedback. Our analysis reveals that prevailing pre-ranking methods, which indiscriminately mix heterogeneous samples, suffer from gradient conflicts: hard samples dominate training while easy ones remain underutilized, leading to suboptimal performance. We further show that the common practice of uniformly scaling model complexity across all samples is inefficient, as it overspends computation on easy cases and slows training without proportional gains. To address these limitations, this paper presents Heterogeneity-Aware Adaptive Pre-ranking (HAP), a unified framework that mitigates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Mobile Crowdsensing and Crowdsourcing · Machine Learning and Data Classification