Knowledge Compression via Question Generation: Enhancing Multihop Document Retrieval without Fine-tuning

Anvi Alex Eponon; Moein Shahiki-Tash; Ildar Batyrshin; Christian E. Maldonado-Sifuentes; Grigori Sidorov; Alexander Gelbukh

arXiv:2506.13778·cs.IR·June 18, 2025

Knowledge Compression via Question Generation: Enhancing Multihop Document Retrieval without Fine-tuning

Anvi Alex Eponon, Moein Shahiki-Tash, Ildar Batyrshin, Christian E. Maldonado-Sifuentes, Grigori Sidorov, Alexander Gelbukh

PDF

Open Access

TL;DR

This paper introduces a question-based knowledge encoding method that significantly improves multihop document retrieval and retrieval-augmented generation without fine-tuning, using question generation and reranking techniques.

Contribution

It presents a novel question generation approach for knowledge encoding that enhances retrieval performance and reduces storage needs without requiring model fine-tuning.

Findings

01

Achieves 0.84 Recall@3 in single-hop retrieval, outperforming chunking by 60%.

02

Increases BM25 MRR@3 from 0.56 to 0.85 with paper-cards.

03

Surpasses baselines in multihop F1 score with 0.52 on LongBench.

Abstract

This study presents a question-based knowledge encoding approach that improves retrieval-augmented generation (RAG) systems without requiring fine-tuning or traditional chunking. We encode textual content using generated questions that span the lexical and semantic space, creating targeted retrieval cues combined with a custom syntactic reranking method. In single-hop retrieval over 109 scientific papers, our approach achieves a Recall@3 of 0.84, outperforming traditional chunking methods by 60 percent. We also introduce "paper-cards", concise paper summaries under 300 characters, which enhance BM25 retrieval, increasing MRR@3 from 0.56 to 0.85 on simplified technical queries. For multihop tasks, our reranking method reaches an F1 score of 0.52 with LLaMA2-Chat-7B on the LongBench 2WikiMultihopQA dataset, surpassing chunking and fine-tuned baselines which score 0.328 and 0.412…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Semantic Web and Ontologies · Natural Language Processing Techniques

MethodsLinear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Byte Pair Encoding · Dense Connections · Softmax · Layer Normalization · Dropout · BERT · BART