Query Generation Pipeline with Enhanced Answerability Assessment for Financial Information Retrieval

Hyunkyu Kim; Yeeun Yoo; Youngjun Kwak

arXiv:2511.05000·cs.IR·November 10, 2025

Query Generation Pipeline with Enhanced Answerability Assessment for Financial Information Retrieval

Hyunkyu Kim, Yeeun Yoo, Youngjun Kwak

PDF

Open Access

TL;DR

This paper introduces a methodology for creating domain-specific information retrieval benchmarks using LLM-based query generation, exemplified by KoBankIR, to improve financial IR evaluation and highlight retrieval challenges.

Contribution

It presents a systematic approach for constructing financial IR benchmarks via LLM-generated queries and an enhanced answerability assessment, improving alignment with human judgments.

Findings

01

Existing models struggle with complex multi-document queries in KoBankIR.

02

The proposed pipeline achieves better alignment with human judgments than prior methods.

03

KoBankIR contains 815 queries from 204 banking documents.

Abstract

As financial applications of large language models (LLMs) gain attention, accurate Information Retrieval (IR) remains crucial for reliable AI services. However, existing benchmarks fail to capture the complex and domain-specific information needs of real-world banking scenarios. Building domain-specific IR benchmarks is costly and constrained by legal restrictions on using real customer data. To address these challenges, we propose a systematic methodology for constructing domain-specific IR benchmarks through LLM-based query generation. As a concrete implementation of this methodology, our pipeline combines single and multi-document query generation with an enhanced and reasoning-augmented answerability assessment method, achieving stronger alignment with human judgments than prior approaches. Using this methodology, we construct KoBankIR, comprising 815 queries derived from 204…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Natural Language Processing Techniques