LimRank: Less is More for Reasoning-Intensive Information Reranking

Tingyu Song; Yilun Zhao; Siyue Zhang; Chen Zhao; Arman Cohan

arXiv:2510.23544·cs.CL·October 28, 2025

LimRank: Less is More for Reasoning-Intensive Information Reranking

Tingyu Song, Yilun Zhao, Siyue Zhang, Chen Zhao, Arman Cohan

PDF

1 Models 1 Video

TL;DR

This paper introduces LIMRANK, a reasoning-intensive reranker trained with minimal supervision and synthetic data, achieving competitive results with significantly less training data and computational resources.

Contribution

The authors propose LIMRANK and LIMRANK-SYNTHESIZER, enabling effective reranking with minimal supervision and open-source synthetic data generation, reducing reliance on large-scale fine-tuning.

Findings

01

LIMRANK performs competitively on reasoning-intensive benchmarks.

02

Training with less than 5% of typical data yields strong results.

03

LIMRANK generalizes well across various downstream tasks.

Abstract

Existing approaches typically rely on large-scale fine-tuning to adapt LLMs for information reranking tasks, which is computationally expensive. In this work, we demonstrate that modern LLMs can be effectively adapted using only minimal, high-quality supervision. To enable this, we design LIMRANK-SYNTHESIZER, a reusable and open-source pipeline for generating diverse, challenging, and realistic reranking examples. Using this synthetic data, we fine-tune our reranker model, LIMRANK. We evaluate LIMRANK on two challenging benchmarks, i.e., BRIGHT for reasoning-intensive retrieval and FollowIR for instruction-following retrieval. Our experiments demonstrate that LIMRANK achieves competitive performance, while being trained on less than 5% of the data typically used in prior work. Further ablation studies demonstrate the effectiveness of LIMRANK-SYNTHESIZER and the strong generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
songtingyu/limrank
model· 5 dl
5 dl

Videos

LimRank: Less is More for Reasoning-Intensive Information Reranking· underline