CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG
Boyi Deng, Wenjie Wang, Fengbin Zhu, Qifan Wang, Fuli Feng

TL;DR
This paper introduces CrAM, a method that enhances retrieval-augmented generation in LLMs by adjusting attention based on document credibility, effectively reducing misinformation influence and improving factual accuracy.
Contribution
CrAM is a novel plug-and-play approach that identifies influential attention heads and modulates their weights according to document credibility, improving LLM reliability without fine-tuning.
Findings
CrAM improves RAG performance by over 20% against misinformation.
CrAM surpasses supervised fine-tuning methods in accuracy.
Effective across multiple LLM architectures and datasets.
Abstract
Retrieval-Augmented Generation (RAG) can alleviate hallucinations of Large Language Models (LLMs) by referencing external documents. However, the misinformation in external documents may mislead LLMs' generation. To address this issue, we explore the task of "credibility-aware RAG", in which LLMs automatically adjust the influence of retrieved documents based on their credibility scores to counteract misinformation. To this end, we introduce a plug-and-play method named edibility-aware ttention odification (CrAM). CrAM identifies influential attention heads in LLMs and adjusts their attention weights based on the credibility of the documents, thereby reducing the impact of low-credibility documents. Experiments on Natual Questions and TriviaQA using Llama2-13B, Llama3-8B, and Qwen1.5-7B show that CrAM improves the RAG performance of LLMs against…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNetwork Security and Intrusion Detection · Adversarial Robustness in Machine Learning · Misinformation and Its Impacts
MethodsAttention Is All You Need · WordPiece · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Dropout
