Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models

Yating Wang; Wenting Zhao; Yaqi Zhao; Yongshun Gong; Yilong Yin; Haoliang Sun

arXiv:2604.08284·cs.CL·April 10, 2026

Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models

Yating Wang, Wenting Zhao, Yaqi Zhao, Yongshun Gong, Yilong Yin, Haoliang Sun

PDF

1 Repo

TL;DR

This paper introduces Distributed Multi-Layer Editing (DMLE), a novel method for editing rule-level knowledge in large language models by applying targeted updates across multiple transformer layers.

Contribution

It reveals the layered organization of rule knowledge in transformers and proposes a multi-layer editing approach that significantly improves rule-level editing performance.

Findings

01

Rule knowledge is organized across different transformer layers.

02

Single-layer editing methods are insufficient for rule-level knowledge.

03

DMLE improves rule understanding and instance portability by over 13 and 50 percentage points.

Abstract

Large language models store not only isolated facts but also rules that support reasoning across symbolic expressions, natural language explanations, and concrete instances. Yet most model editing methods are built for fact-level knowledge, assuming that a target edit can be achieved through a localized intervention. This assumption does not hold for rule-level knowledge, where a single rule must remain consistent across multiple interdependent forms. We investigate this problem through a mechanistic study of rule-level knowledge editing. To support this study, we extend the RuleEdit benchmark from 80 to 200 manually verified rules spanning mathematics and physics. Fine-grained causal tracing reveals a form-specific organization of rule knowledge in transformer layers: formulas and descriptions are concentrated in earlier layers, while instances are more associated with middle layers.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Pepper66/DMLE
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.