Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking
Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li,, Chenghua Lin, Wenge Rong

TL;DR
This paper introduces AiRTran, a novel transferability estimation method that predicts a model's ranking ability directly, improving model selection in text ranking tasks over existing methods and human intuition.
Contribution
The paper proposes a new transferability estimation approach based on expected rank, specifically designed for text ranking, and demonstrates its effectiveness over existing methods.
Findings
AiRTran outperforms previous TE methods in model selection accuracy.
AiRTran surpasses human intuition and ChatGPT in selecting effective models.
The method effectively captures subtle differences between models.
Abstract
Text ranking has witnessed significant advancements, attributed to the utilization of dual-encoder enhanced by Pre-trained Language Models (PLMs). Given the proliferation of available PLMs, selecting the most effective one for a given dataset has become a non-trivial challenge. As a promising alternative to human intuition and brute-force fine-tuning, Transferability Estimation (TE) has emerged as an effective approach to model selection. However, current TE methods are primarily designed for classification tasks, and their estimated transferability may not align well with the objectives of text ranking. To address this challenge, we propose to compute the expected rank as transferability, explicitly reflecting the model's ranking capability. Furthermore, to mitigate anisotropy and incorporate training dynamics, we adaptively scale isotropic sentence embeddings to yield an accurate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsText and Document Classification Technologies · Advanced Text Analysis Techniques · Data Management and Algorithms
MethodsALIGN
