NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models
Zheng Yi Ho, Siyuan Liang, Sen Zhang, Yibing Zhan, Dacheng Tao

TL;DR
NoVo is a lightweight, inference-only method that leverages attention head norms to significantly improve factual accuracy in large language models across diverse datasets, surpassing current state-of-the-art methods.
Contribution
The paper introduces Norm Voting (NoVo), a novel approach that uses attention head norms for scalable, zero-shot factual accuracy enhancement in LLMs, with minimal computational overhead.
Findings
NoVo surpasses state-of-the-art accuracy on TruthfulQA MC1 by at least 19 points.
NoVo generalizes well to 20 diverse datasets, improving accuracy in over 90% of cases.
Head norm voting enhances LLM interpretability, robustness, and reliability.
Abstract
Hallucinations in Large Language Models (LLMs) remain a major obstacle, particularly in high-stakes applications where factual accuracy is critical. While representation editing and reading methods have made strides in reducing hallucinations, their heavy reliance on specialised tools and training on in-domain samples, makes them difficult to scale and prone to overfitting. This limits their accuracy gains and generalizability to diverse datasets. This paper presents a lightweight method, Norm Voting (NoVo), which harnesses the untapped potential of attention head norms to dramatically enhance factual accuracy in zero-shot multiple-choice questions (MCQs). NoVo begins by automatically selecting truth-correlated head norms with an efficient, inference-only algorithm using only 30 random samples, allowing NoVo to effortlessly scale to diverse datasets. Afterwards, selected head norms are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · COVID-19 diagnosis using AI · Big Data and Digital Economy
MethodsSoftmax · Attention Is All You Need
