Chunking, Retrieval, and Re-ranking: An Empirical Evaluation of RAG Architectures for Policy Document Question Answering

Anuj Maharjan; Umesh Yadav

arXiv:2601.15457·cs.CL·January 23, 2026·2 cites

Chunking, Retrieval, and Re-ranking: An Empirical Evaluation of RAG Architectures for Policy Document Question Answering

Anuj Maharjan, Umesh Yadav

PDF

Open Access

TL;DR

This study empirically evaluates RAG architectures for policy document question answering, demonstrating that advanced retrieval and re-ranking significantly improve answer faithfulness in high-stakes public health contexts.

Contribution

It provides a comparative analysis of RAG architectures with different chunking strategies, highlighting the importance of two-stage retrieval for accurate policy question answering.

Findings

01

Advanced RAG achieves higher faithfulness (0.797) than baseline models.

02

Chunking strategies impact retrieval effectiveness and answer accuracy.

03

Two-stage retrieval mechanisms are crucial for domain-specific accuracy.

Abstract

The integration of Large Language Models (LLMs) into the public health policy sector offers a transformative approach to navigating the vast repositories of regulatory guidance maintained by agencies such as the Centers for Disease Control and Prevention (CDC). However, the propensity for LLMs to generate hallucinations, defined as plausible but factually incorrect assertions, presents a critical barrier to the adoption of these technologies in high-stakes environments where information integrity is non-negotiable. This empirical evaluation explores the effectiveness of Retrieval-Augmented Generation (RAG) architectures in mitigating these risks by grounding generative outputs in authoritative document context. Specifically, this study compares a baseline Vanilla LLM against Basic RAG and Advanced RAG pipelines utilizing cross-encoder re-ranking. The experimental framework employs a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Topic Modeling · Misinformation and Its Impacts