How Well Can Knowledge Edit Methods Edit Perplexing Knowledge?
Huaizhi Ge, Frank Rudzicz, Zining Zhu

TL;DR
This paper investigates how the conflicting nature of new knowledge, termed perplexingness, affects the ability of LLM editing methods to update their knowledge, revealing higher resistance to edits that violate learned hierarchies.
Contribution
The study introduces HierarchyData, a dataset for analyzing knowledge editing, and systematically examines how perplexingness impacts editing effectiveness across models and methods.
Findings
Higher perplexingness correlates with lower editing success.
Abstract concept edits are more resistant to modification.
Knowledge conflicts with learned hierarchies are harder to encode.
Abstract
Large language models (LLMs) have demonstrated remarkable capabilities, but updating their knowledge post-training remains a critical challenge. While recent model editing techniques like Rank-One Model Editing (ROME) show promise, their effectiveness may vary based on the nature of the knowledge being edited. We introduce the concept of ``perplexingness'': the degree to which new knowledge conflicts with an LLM's learned conceptual hierarchies and categorical relationships. For instance, editing ``British Shorthair is a kind of cat'' to ``British Shorthair is a kind of dog'' represents a low-perplexingness edit within the same taxonomic level, while editing ``A cat is a kind of animal'' to ``A cat is a kind of plant'' represents a high-perplexingness edit that violates fundamental categorical boundaries. To systematically investigate this phenomenon, we introduce HierarchyData, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Biomedical Text Mining and Ontologies · Semantic Web and Ontologies
