Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval
Md. Asraful Haque, Aasar Mehdi, Maaz Mahboob, Tamkeen Fatima

TL;DR
This paper introduces a domain-grounded tiered retrieval system that significantly reduces hallucinations in large language models by verifying facts through a multi-stage, self-regulating pipeline across diverse benchmarks.
Contribution
It presents a novel four-phase retrieval and verification architecture that systematically improves factual accuracy in LLM outputs, especially in high-stakes domains.
Findings
Outperforms zero-shot baselines across all tested benchmarks
Achieves up to 83.7% win rate in TimeQA v2
Maintains high groundedness scores between 78.8% and 86.4%
Abstract
Large Language Models (LLMs) have achieved unprecedented fluency but remain susceptible to "hallucinations" - the generation of factually incorrect or ungrounded content. This limitation is particularly critical in high-stakes domains where reliability is paramount. We propose a domain-grounded tiered retrieval and verification architecture designed to systematically intercept factual inaccuracies by shifting LLMs from stochastic pattern-matchers to verified truth-seekers. The proposed framework utilizes a four-phase, self-regulating pipeline implemented via LangGraph: (I) Intrinsic Verification with Early-Exit logic to optimize compute, (II) Adaptive Search Routing utilizing a Domain Detector to target subject-specific archives, (III) Refined Context Filtering (RCF) to eliminate non-essential or distracting information, and (IV) Extrinsic Regeneration followed by atomic claim-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Misinformation and Its Impacts · Multimodal Machine Learning Applications
