ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision

Dosung Lee; Wonjun Oh; Boyoung Kim; Minyoung Kim; Joonsuk Park; Paul Hongsuck Seo

arXiv:2505.21250·cs.CL·November 27, 2025

ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision

Dosung Lee, Wonjun Oh, Boyoung Kim, Minyoung Kim, Joonsuk Park, Paul Hongsuck Seo

PDF

Open Access 3 Models 1 Video

TL;DR

ReSCORE introduces a label-free training method for dense retrievers in multi-hop question answering, leveraging large language models to improve retrieval accuracy without needing labeled data, leading to state-of-the-art results.

Contribution

The paper presents ReSCORE, a novel iterative training approach for dense retrievers in MHQA that does not require labeled query-document pairs, addressing variability in questions.

Findings

01

Significant improvements in retrieval performance on three benchmarks.

02

Achieved state-of-the-art multi-hop QA results.

03

Effective training without labeled data.

Abstract

Multi-hop question answering (MHQA) involves reasoning across multiple documents to answer complex questions. Dense retrievers typically outperform sparse methods like BM25 by leveraging semantic embeddings; however, they require labeled query-document pairs for fine-tuning. This poses a significant challenge in MHQA due to the high variability of queries (reformulated) questions throughout the reasoning steps. To overcome this limitation, we introduce Retriever Supervision with Consistency and Relevance (ReSCORE), a novel method for training dense retrievers for MHQA without labeled documents. ReSCORE leverages large language models to capture each documents relevance to the question and consistency with the correct answer and use them to train a retriever within an iterative question-answering framework. Experiments on three MHQA benchmarks demonstrate the effectiveness of ReSCORE,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision· underline

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning