Anchored Decoding: Provably Reducing Copyright Risk for Any Language Model

Jacqueline He; Jonathan Hayase; Wen-tau Yih; Sewoong Oh; Luke Zettlemoyer; Pang Wei Koh

arXiv:2602.07120·cs.CL·February 10, 2026

Anchored Decoding: Provably Reducing Copyright Risk for Any Language Model

Jacqueline He, Jonathan Hayase, Wen-tau Yih, Sewoong Oh, Luke Zettlemoyer, Pang Wei Koh

PDF

Open Access 1 Models

TL;DR

Anchored Decoding is a novel inference-time technique that reduces copyright risk in language models by constraining generation to stay close to a safe model, balancing risk reduction and utility.

Contribution

The paper introduces Anchored Decoding, a new method for suppressing verbatim copying in language models, along with a safe model and byte-level variant for practical deployment.

Findings

01

Reduces copying gap by up to 75% across six metrics

02

Preserves near-original fluency and factuality

03

Defines a new Pareto frontier for risk-utility trade-off

Abstract

Modern language models (LMs) tend to memorize portions of their training data and emit verbatim spans. When the underlying sources are sensitive or copyright-protected, such reproduction raises issues of consent and compensation for creators and compliance risks for developers. We propose Anchored Decoding, a plug-and-play inference-time method for suppressing verbatim copying: it enables decoding from any risky LM trained on mixed-license data by keeping generation in bounded proximity to a permissively trained safe LM. Anchored Decoding adaptively allocates a user-chosen information budget over the generation trajectory and enforces per-step constraints that yield a sequence-level guarantee, enabling a tunable risk-utility trade-off. To make Anchored Decoding practically useful, we introduce a new permissively trained safe model (TinyComma 1.8B), as well as Anchored $_{Byte}$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
jacquelinehe/tinycomma-1.8b-llama3-tokenizer
model· 12 dl· ♡ 1
12 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)