Leveraging Estimated Transferability Over Human Intuition for Model   Selection in Text Ranking

Jun Bai; Zhuofan Chen; Zhenzi Li; Hanhua Hong; Jianfei Zhang; Chen Li,; Chenghua Lin; Wenge Rong

arXiv:2409.16198·cs.AI·September 25, 2024

Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking

Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li,, Chenghua Lin, Wenge Rong

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces AiRTran, a novel transferability estimation method that predicts a model's ranking ability directly, improving model selection in text ranking tasks over existing methods and human intuition.

Contribution

The paper proposes a new transferability estimation approach based on expected rank, specifically designed for text ranking, and demonstrates its effectiveness over existing methods.

Findings

01

AiRTran outperforms previous TE methods in model selection accuracy.

02

AiRTran surpasses human intuition and ChatGPT in selecting effective models.

03

The method effectively captures subtle differences between models.

Abstract

Text ranking has witnessed significant advancements, attributed to the utilization of dual-encoder enhanced by Pre-trained Language Models (PLMs). Given the proliferation of available PLMs, selecting the most effective one for a given dataset has become a non-trivial challenge. As a promising alternative to human intuition and brute-force fine-tuning, Transferability Estimation (TE) has emerged as an effective approach to model selection. However, current TE methods are primarily designed for classification tasks, and their estimated transferability may not align well with the objectives of text ranking. To address this challenge, we propose to compute the expected rank as transferability, explicitly reflecting the model's ranking capability. Furthermore, to mitigate anisotropy and incorporate training dynamics, we adaptively scale isotropic sentence embeddings to yield an accurate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ba1jun/model-selection-airtran
pytorchOfficial

Videos

Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking· underline

Taxonomy

TopicsText and Document Classification Technologies · Advanced Text Analysis Techniques · Data Management and Algorithms

MethodsALIGN