LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction
Jensen Zhang, Ningyuan Liu, Yijia Fan, Zihao Huang, Qinglin Zeng, Kaitong Cai, Jian Wang, Keze Wang

TL;DR
LLM-CAS introduces a hierarchical reinforcement learning framework that dynamically perturbs neurons during inference to correct hallucinations in large language models, significantly enhancing factual accuracy without permanent model changes.
Contribution
This work presents a novel, adaptive neuron perturbation method using reinforcement learning for real-time hallucination correction in LLMs, outperforming static and heuristic dynamic approaches.
Findings
Achieves nearly 11% improvement on StoryCloze accuracy
Gains over 2.7 points on TriviaQA accuracy
Outperforms prior static and dynamic correction methods
Abstract
Large language models (LLMs) often generate hallucinated content that lacks factual or contextual grounding, limiting their reliability in critical applications. Existing approaches such as supervised fine-tuning and reinforcement learning from human feedback are data intensive and computationally expensive, while static parameter editing methods struggle with context dependent errors and catastrophic forgetting. We propose LLM-CAS, a framework that formulates real-time hallucination correction as a hierarchical reinforcement learning problem. LLM-CAS trains an agent to learn a policy that dynamically selects temporary neuron perturbations during inference based on the current context. Unlike prior dynamic approaches that rely on heuristic or predefined adjustments, this policy driven mechanism enables adaptive and fine grained correction without permanent parameter modification.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Misinformation and Its Impacts
