ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models

Zhenhua Xu; Haobo Zhang; Zhebo Wang; Qichen Liu; Haitao Xu; Wenpeng Xing; Meng Han

arXiv:2601.08189·cs.CR·January 21, 2026

ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models

Zhenhua Xu, Haobo Zhang, Zhebo Wang, Qichen Liu, Haitao Xu, Wenpeng Xing, Meng Han

PDF

Open Access

TL;DR

ForgetMark introduces a stealthy fingerprinting method for language models that uses targeted unlearning and probabilistic traces, avoiding fixed triggers and improving robustness and detectability over existing backdoor approaches.

Contribution

It proposes a novel fingerprinting framework that encodes provenance via targeted unlearning with lightweight adapters, enhancing stealthiness and robustness compared to prior backdoor methods.

Findings

01

Achieves 100% ownership verification across diverse models.

02

Maintains standard model performance while embedding fingerprints.

03

Outperforms backdoor baselines in stealthiness and robustness.

Abstract

Existing invasive (backdoor) fingerprints suffer from high-perplexity triggers that are easily filtered, fixed response patterns exposed by heuristic detectors, and spurious activations on benign inputs. We introduce \textsc{ForgetMark}, a stealthy fingerprinting framework that encodes provenance via targeted unlearning. It builds a compact, human-readable key--value set with an assistant model and predictive-entropy ranking, then trains lightweight LoRA adapters to suppress the original values on their keys while preserving general capabilities. Ownership is verified under black/gray-box access by aggregating likelihood and semantic evidence into a fingerprint success rate. By relying on probabilistic forgetting traces rather than fixed trigger--response patterns, \textsc{ForgetMark} avoids high-perplexity triggers, reduces detectability, and lowers false triggers. Across diverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiometric Identification and Security · Adversarial Robustness in Machine Learning · Forensic Fingerprint Detection Methods