To Copy or Not to Copy: Copying Is Easier to Induce Than Recall
Mehrdad Farahani, Franziska Penzkofer, Richard Johansson

TL;DR
This paper investigates how language models choose between recalling stored knowledge and copying from context, introducing an arbitration vector to steer this behavior and analyzing the mechanisms behind copying and recall processes.
Contribution
The study presents a novel arbitration vector for controlling model behavior between copying and recalling, with detailed mechanistic insights into these processes.
Findings
Copying is easier to induce than recall in language models.
Inducing copying can be triggered at various input locations.
Restoring recall requires more fragile suppression techniques.
Abstract
Language models used in retrieval-augmented settings must arbitrate between parametric knowledge stored in their weights and contextual information in the prompt. This work presents a mechanistic study of that choice by extracting an \emph{arbitration vector} from model activations on a curated dataset designed to disentangle (i) irrelevant contexts that elicit parametric recall and (ii) relevant but false contexts that elicit copying. The vector is computed as the residual-stream centroid difference between these regimes across 27 relations, and is injected as an additive intervention at selected layers and token spans to steer behavior in two directions: CopyRecall (suppressing context use) and RecallCopy (inducing the model to copy any token from the context). Experiments on two architectures (decoder-only and encoder/decoder) and two open-domain QA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Multimodal Machine Learning Applications
