Rank4Gen: RAG-Preference-Aligned Document Set Selection and Ranking
Yongqi Fan, Yuxiang Chu, Zhentao Xia, Xiaoyang Chen, Jie Liu, Haijin Liang, Jin Ma, Ben He, Yingfei Sun, Dezhi Ye, Tong Ruan

TL;DR
Rank4Gen is a generator-aware document ranking model for RAG that optimizes for response quality and accounts for generator-specific preferences, improving evidence selection and response accuracy.
Contribution
It introduces a generator-specific ranking approach and a new dataset, PRISM, to align document ranking with generator preferences and response quality in RAG systems.
Findings
Rank4Gen outperforms existing models on five RAG benchmarks.
It effectively models generator-specific preferences for evidence ranking.
The PRISM dataset enables training and evaluation of generator-aware rankers.
Abstract
In the RAG paradigm, the information retrieval module provides context for generators by retrieving and ranking multiple documents to support the aggregation of evidence. However, existing ranking models are primarily optimized for query--document relevance, which often misaligns with generators' preferences for evidence selection and citation, limiting their impact on response quality. Moreover, most approaches do not account for preference differences across generators, resulting in unstable cross-generator performance. We propose \textbf{Rank4Gen}, a generator-aware ranker for RAG that targets the goal of \emph{Ranking for Generators}. Rank4Gen introduces two key preference modeling strategies: (1) \textbf{From Ranking Relevance to Response Quality}, which optimizes ranking with respect to downstream response quality rather than query--document relevance; and (2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Topic Modeling · Expert finding and Q&A systems
