Is There No Such Thing as a Bad Question? H4R: HalluciBot For Ratiocination, Rewriting, Ranking, and Routing
William Watson, Nicole Cho, Nishan Srishankar

TL;DR
This paper introduces HalluciBot, a model that predicts the likelihood of hallucination in LLM queries, enabling improved query rewriting and routing to enhance response accuracy without relying on the LLM during inference.
Contribution
HalluciBot is a novel proxy model that estimates query quality and guides query rewriting and routing, significantly reducing hallucinations in LLM outputs.
Findings
Achieves 95.7% accuracy in multiple choice question responses
Reduces hallucination through query rewriting by 30.2%
Improves query routing effectiveness by 50.6%
Abstract
Hallucination continues to be one of the most critical challenges in the institutional adoption journey of Large Language Models (LLMs). While prior studies have primarily focused on the post-generation analysis and refinement of outputs, this paper centers on the effectiveness of queries in eliciting accurate responses from LLMs. We present HalluciBot, a model that estimates the query's propensity to hallucinate before generation, without invoking any LLMs during inference. HalluciBot can serve as a proxy reward model for query rewriting, offering a general framework to estimate query quality based on accuracy and consensus. In essence, HalluciBot investigates how poorly constructed queries can lead to erroneous outputs - moreover, by employing query rewriting guided by HalluciBot's empirical estimates, we demonstrate that 95.7% output accuracy can be achieved for Multiple Choice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPsychedelics and Drug Studies · Mental Health Research Topics · Digital Mental Health Interventions
