Modifying Memories in Transformer Models
Chen Zhu, Ankit Singh Rawat, Manzil Zaheer, Srinadh Bhojanapalli,, Daliang Li, Felix Yu, Sanjiv Kumar

TL;DR
This paper introduces a new task for Transformer models to selectively update or erase specific factual knowledge without affecting overall performance, addressing issues like stale information, privacy, and bias.
Contribution
It proposes the task of explicit knowledge modification in Transformers, benchmarks baseline methods, and analyzes key components and training phases affecting knowledge editing.
Findings
Baseline approaches achieve natural performance on the task.
Certain model components are especially effective for knowledge modification.
Training phases influence the model's ability to memorize and modify facts.
Abstract
Large Transformer models have achieved impressive performance in many natural language tasks. In particular, Transformer based language models have been shown to have great capabilities in encoding factual knowledge in their vast amount of parameters. While the tasks of improving the memorization and generalization of Transformers have been widely studied, it is not well known how to make transformers forget specific old facts and memorize new ones. In this paper, we propose a new task of \emph{explicitly modifying specific factual knowledge in Transformer models while ensuring the model performance does not degrade on the unmodified facts}. This task is useful in many scenarios, such as updating stale knowledge, protecting privacy, and eliminating unintended biases stored in the models. We benchmarked several approaches that provide natural baseline performances on this task. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dense Connections · Attention Is All You Need · Adam · Softmax · Byte Pair Encoding · Label Smoothing
