PropMEND: Hypernetworks for Knowledge Propagation in LLMs
Zeyu Leo Liu, Greg Durrett, Eunsol Choi

TL;DR
PropMEND introduces a hypernetwork-based method that enhances knowledge propagation in large language models, enabling better reasoning with injected facts, especially for multi-hop questions, and demonstrates improved performance on new datasets.
Contribution
The paper proposes PropMEND, a novel hypernetwork approach that extends meta-learning to improve knowledge propagation in LLMs, especially for multi-hop reasoning tasks.
Findings
Almost 2x accuracy on multi-hop questions in RippleEdit
Outperforms existing methods on unseen entity-relation pairs
Performance gap decreases on generalization, indicating room for improvement
Abstract
Knowledge editing techniques for large language models (LLMs) can inject knowledge that is later reproducible verbatim, but they fall short on propagating that knowledge: models cannot answer questions that require reasoning with the injected knowledge. We present a hypernetwork-based approach for knowledge propagation, named PropMEND, where we meta-learn how to modify gradients of a language modeling loss to encourage injected information to propagate. Our approach extends the meta-objective of MEND [29] so that gradient updates on knowledge are transformed to enable answering multi-hop questions involving that knowledge. We show improved performance on the RippleEdit dataset, showing almost 2x accuracy on challenging multi-hop questions whose answers are not explicitly stated in the injected fact. We further introduce a new dataset, Controlled RippleEdit, to evaluate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Multimodal Machine Learning Applications
MethodsMODEL EDITOR NETWORKS WITH GRADIENT DECOMPOSITION · HyperNetwork
