QueryBandits for Hallucination Mitigation: Exploiting Semantic Features for No-Regret Rewriting
Nicole Cho, William Watson, Alec Koppel, Sumitra Ganesh, Manuela Veloso

TL;DR
This paper introduces QueryBandits, a bandit-based framework that proactively rewrites queries to reduce hallucinations in Large Language Models by leveraging semantic features, outperforming static prompts and no-rewrite baselines.
Contribution
Proposes a novel bandit approach for query rewriting that effectively mitigates hallucinations in LLMs by exploiting semantic features, without retraining or gradient updates.
Findings
QueryBandits achieves an 87.5% win rate over no-rewrite baseline.
Static prompts can worsen hallucinations compared to no-rewrite.
Rewrites guided by semantic features induce significant output shifts.
Abstract
Advanced reasoning capabilities in Large Language Models (LLMs) have caused higher hallucination prevalence; yet most mitigation work focuses on after-the-fact filtering rather than shaping the queries that trigger them. We introduce QueryBandits, a bandit framework that designs rewrite strategies to maximize a reward model, that encapsulates hallucination propensity based upon the sensitivities of 17 linguistic features of the input query-and therefore, proactively steer LLMs away from generating hallucinations. Across 13 diverse QA benchmarks and 1,050 lexically perturbed queries per dataset, our top contextual QueryBandit (Thompson Sampling) achieves an 87.5% win rate over a no-rewrite baseline and also outperforms zero-shot static prompting ("paraphrase" or "expand") by 42.6% and 60.3% respectively. Therefore, we empirically substantiate the effectiveness of QueryBandits in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
