CrAM: Credibility-Aware Attention Modification in LLMs for Combating   Misinformation in RAG

Boyi Deng; Wenjie Wang; Fengbin Zhu; Qifan Wang; Fuli Feng

arXiv:2406.11497·cs.CL·December 18, 2024

CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG

Boyi Deng, Wenjie Wang, Fengbin Zhu, Qifan Wang, Fuli Feng

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces CrAM, a method that enhances retrieval-augmented generation in LLMs by adjusting attention based on document credibility, effectively reducing misinformation influence and improving factual accuracy.

Contribution

CrAM is a novel plug-and-play approach that identifies influential attention heads and modulates their weights according to document credibility, improving LLM reliability without fine-tuning.

Findings

01

CrAM improves RAG performance by over 20% against misinformation.

02

CrAM surpasses supervised fine-tuning methods in accuracy.

03

Effective across multiple LLM architectures and datasets.

Abstract

Retrieval-Augmented Generation (RAG) can alleviate hallucinations of Large Language Models (LLMs) by referencing external documents. However, the misinformation in external documents may mislead LLMs' generation. To address this issue, we explore the task of "credibility-aware RAG", in which LLMs automatically adjust the influence of retrieved documents based on their credibility scores to counteract misinformation. To this end, we introduce a plug-and-play method named $Cr$ edibility-aware $A$ ttention $M$ odification (CrAM). CrAM identifies influential attention heads in LLMs and adjusts their attention weights based on the credibility of the documents, thereby reducing the impact of low-credibility documents. Experiments on Natual Questions and TriviaQA using Llama2-13B, Llama3-8B, and Qwen1.5-7B show that CrAM improves the RAG performance of LLMs against…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Aatrox103/CrAM
noneOfficial

Videos

CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG· underline

Taxonomy

TopicsNetwork Security and Intrusion Detection · Adversarial Robustness in Machine Learning · Misinformation and Its Impacts

MethodsAttention Is All You Need · WordPiece · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Dropout