LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction

Jensen Zhang; Ningyuan Liu; Yijia Fan; Zihao Huang; Qinglin Zeng; Kaitong Cai; Jian Wang; Keze Wang

arXiv:2512.18623·cs.CL·December 23, 2025

LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction

Jensen Zhang, Ningyuan Liu, Yijia Fan, Zihao Huang, Qinglin Zeng, Kaitong Cai, Jian Wang, Keze Wang

PDF

Open Access

TL;DR

LLM-CAS introduces a hierarchical reinforcement learning framework that dynamically perturbs neurons during inference to correct hallucinations in large language models, significantly enhancing factual accuracy without permanent model changes.

Contribution

This work presents a novel, adaptive neuron perturbation method using reinforcement learning for real-time hallucination correction in LLMs, outperforming static and heuristic dynamic approaches.

Findings

01

Achieves nearly 11% improvement on StoryCloze accuracy

02

Gains over 2.7 points on TriviaQA accuracy

03

Outperforms prior static and dynamic correction methods

Abstract

Large language models (LLMs) often generate hallucinated content that lacks factual or contextual grounding, limiting their reliability in critical applications. Existing approaches such as supervised fine-tuning and reinforcement learning from human feedback are data intensive and computationally expensive, while static parameter editing methods struggle with context dependent errors and catastrophic forgetting. We propose LLM-CAS, a framework that formulates real-time hallucination correction as a hierarchical reinforcement learning problem. LLM-CAS trains an agent to learn a policy that dynamically selects temporary neuron perturbations during inference based on the current context. Unlike prior dynamic approaches that rely on heuristic or predefined adjustments, this policy driven mechanism enables adaptive and fine grained correction without permanent parameter modification.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Misinformation and Its Impacts