Pre-trained Graphformer-based Ranking at Web-scale Search (Extended Abstract)
Yuchen Li, Haoyi Xiong, Linghe Kong, Zeyi Sun, Hongyang Chen,, Shuaiqiang Wang, Dawei Yin

TL;DR
This paper introduces MPGraf, a novel pre-training model that combines Transformers and GNNs for web-scale learning to rank, addressing domain shift challenges and improving ranking performance.
Contribution
The paper proposes MPGraf, a modular capsule-based pre-training approach that unifies Transformer and GNN models for learning to rank at web scale.
Findings
MPGraf outperforms baseline models in offline evaluations.
MPGraf demonstrates improved online ranking performance.
The approach effectively handles distributional shifts between domains.
Abstract
Both Transformer and Graph Neural Networks (GNNs) have been employed in the domain of learning to rank (LTR). However, these approaches adhere to two distinct yet complementary problem formulations: ranking score regression based on query-webpage pairs, and link prediction within query-webpage bipartite graphs, respectively. While it is possible to pre-train GNNs or Transformers on source datasets and subsequently fine-tune them on sparsely annotated LTR datasets, the distributional shifts between the pair-based and bipartite graph domains present significant challenges in integrating these heterogeneous models into a unified LTR framework at web scale. To address this, we introduce the novel MPGraf model, which leverages a modular and capsule-based pre-training strategy, aiming to cohesively integrate the regression capabilities of Transformers with the link prediction strengths of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Data Management and Algorithms · Text and Document Classification Technologies
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Dropout · Dense Connections
