QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting

Nicole Cho; William Watson; Alec Koppel; Sumitra Ganesh; Manuela Veloso

arXiv:2508.16697·cs.CL·August 26, 2025

QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting

Nicole Cho, William Watson, Alec Koppel, Sumitra Ganesh, Manuela Veloso

PDF

TL;DR

This paper introduces QueryBandits, a bandit-based framework that proactively rewrites queries to reduce hallucinations in Large Language Models by leveraging semantic features, outperforming static prompts and no-rewrite baselines.

Contribution

Proposes a novel bandit approach for query rewriting that effectively mitigates hallucinations in LLMs by exploiting semantic features, without retraining or gradient updates.

Findings

01

QueryBandits achieves an 87.5% win rate over no-rewrite baseline.

02

Static prompts can worsen hallucinations compared to no-rewrite.

03

Rewrites guided by semantic features induce significant output shifts.

Abstract

Advanced reasoning capabilities in Large Language Models (LLMs) have caused higher hallucination prevalence; yet most mitigation work focuses on after-the-fact filtering rather than shaping the queries that trigger them. We introduce QueryBandits, a bandit framework that designs rewrite strategies to maximize a reward model, that encapsulates hallucination propensity based upon the sensitivities of 17 linguistic features of the input query-and therefore, proactively steer LLMs away from generating hallucinations. Across 13 diverse QA benchmarks and 1,050 lexically perturbed queries per dataset, our top contextual QueryBandit (Thompson Sampling) achieves an 87.5% win rate over a no-rewrite baseline and also outperforms zero-shot static prompting ("paraphrase" or "expand") by 42.6% and 60.3% respectively. Therefore, we empirically substantiate the effectiveness of QueryBandits in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.