Modifying Memories in Transformer Models

Chen Zhu; Ankit Singh Rawat; Manzil Zaheer; Srinadh Bhojanapalli,; Daliang Li; Felix Yu; Sanjiv Kumar

arXiv:2012.00363·cs.CL·December 2, 2020·42 cites

Modifying Memories in Transformer Models

Chen Zhu, Ankit Singh Rawat, Manzil Zaheer, Srinadh Bhojanapalli,, Daliang Li, Felix Yu, Sanjiv Kumar

PDF

Open Access

TL;DR

This paper introduces a new task for Transformer models to selectively update or erase specific factual knowledge without affecting overall performance, addressing issues like stale information, privacy, and bias.

Contribution

It proposes the task of explicit knowledge modification in Transformers, benchmarks baseline methods, and analyzes key components and training phases affecting knowledge editing.

Findings

01

Baseline approaches achieve natural performance on the task.

02

Certain model components are especially effective for knowledge modification.

03

Training phases influence the model's ability to memorize and modify facts.

Abstract

Large Transformer models have achieved impressive performance in many natural language tasks. In particular, Transformer based language models have been shown to have great capabilities in encoding factual knowledge in their vast amount of parameters. While the tasks of improving the memorization and generalization of Transformers have been widely studied, it is not well known how to make transformers forget specific old facts and memorize new ones. In this paper, we propose a new task of \emph{explicitly modifying specific factual knowledge in Transformer models while ensuring the model performance does not degrade on the unmodified facts}. This task is useful in many scenarios, such as updating stale knowledge, protecting privacy, and eliminating unintended biases stored in the models. We benchmarked several approaches that provide natural baseline performances on this task. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dense Connections · Attention Is All You Need · Adam · Softmax · Byte Pair Encoding · Label Smoothing