To Copy or Not to Copy: Copying Is Easier to Induce Than Recall

Mehrdad Farahani; Franziska Penzkofer; Richard Johansson

arXiv:2601.12075·cs.CL·January 21, 2026

To Copy or Not to Copy: Copying Is Easier to Induce Than Recall

Mehrdad Farahani, Franziska Penzkofer, Richard Johansson

PDF

Open Access

TL;DR

This paper investigates how language models choose between recalling stored knowledge and copying from context, introducing an arbitration vector to steer this behavior and analyzing the mechanisms behind copying and recall processes.

Contribution

The study presents a novel arbitration vector for controlling model behavior between copying and recalling, with detailed mechanistic insights into these processes.

Findings

01

Copying is easier to induce than recall in language models.

02

Inducing copying can be triggered at various input locations.

03

Restoring recall requires more fragile suppression techniques.

Abstract

Language models used in retrieval-augmented settings must arbitrate between parametric knowledge stored in their weights and contextual information in the prompt. This work presents a mechanistic study of that choice by extracting an \emph{arbitration vector} from model activations on a curated dataset designed to disentangle (i) irrelevant contexts that elicit parametric recall and (ii) relevant but false contexts that elicit copying. The vector is computed as the residual-stream centroid difference between these regimes across 27 relations, and is injected as an additive intervention at selected layers and token spans to steer behavior in two directions: Copy $\to$ Recall (suppressing context use) and Recall $\to$ Copy (inducing the model to copy any token from the context). Experiments on two architectures (decoder-only and encoder/decoder) and two open-domain QA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Multimodal Machine Learning Applications