ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

Pengcheng Huang; Zhenghao Liu; Yukun Yan; Haiyan Zhao; Xiaoyuan Yi; Hao Chen; Zhiyuan Liu; Maosong Sun; Tong Xiao; Ge Yu; Chenyan Xiong

arXiv:2502.15543·cs.CL·June 24, 2025

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

Pengcheng Huang, Zhenghao Liu, Yukun Yan, Haiyan Zhao, Xiaoyuan Yi, Hao Chen, Zhiyuan Liu, Maosong Sun, Tong Xiao, Ge Yu, Chenyan Xiong

PDF

Open Access 1 Repo 3 Models 1 Datasets

TL;DR

ParamMute is a novel method that suppresses specific internal FFNs in LLMs to improve the faithfulness of retrieval-augmented generation, reducing reliance on internal knowledge and aligning outputs with external evidence.

Contribution

This work identifies unfaithfulness-associated FFNs and introduces ParamMute, a framework that suppresses their activation to enhance LLM output faithfulness in RAG settings.

Findings

01

ParamMute significantly improves faithfulness on CoFaithfulQA.

02

It reduces reliance on internal parametric knowledge.

03

The approach outperforms existing methods on benchmark tests.

Abstract

Large language models (LLMs) integrated with retrieval-augmented generation (RAG) have improved factuality by grounding outputs in external evidence. However, they remain susceptible to unfaithful generation, where outputs contradict retrieved context despite its relevance and accuracy. Existing approaches aiming to improve faithfulness primarily focus on enhancing the utilization of external context, but often overlook the persistent influence of internal parametric knowledge during generation. In this work, we investigate the internal mechanisms behind unfaithful generation and identify a subset of mid-to-deep feed-forward networks (FFNs) that are disproportionately activated in such cases. Building on this insight, we propose Parametric Knowledge Muting through FFN Suppression (ParamMute), a framework that improves contextual faithfulness by suppressing the activation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

openbmb/pip-kag
jaxOfficial

Models

Datasets

chengpingan/CoConflictQA
dataset· 27 dl
27 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Advanced Graph Neural Networks

MethodsFocus