MEraser: An Effective Fingerprint Erasure Approach for Large Language Models
Jingxuan Zhang, Zhenhua Xu, Rui Hu, Wenpeng Xing, Xuhong Zhang, Meng Han

TL;DR
MEraser is a novel two-phase fine-tuning method that effectively removes backdoor-based fingerprints from large language models with minimal data, preserving performance and enabling cross-model fingerprint erasure.
Contribution
This paper introduces MEraser, a new approach for removing backdoor fingerprints from LLMs using a two-phase fine-tuning strategy with minimal data and a transferable erasure mechanism.
Findings
Complete fingerprint removal achieved across multiple models.
Model performance maintained with fewer than 1,000 samples.
Transferable erasure enables cross-model fingerprint removal.
Abstract
Large Language Models (LLMs) have become increasingly prevalent across various sectors, raising critical concerns about model ownership and intellectual property protection. Although backdoor-based fingerprinting has emerged as a promising solution for model authentication, effective attacks for removing these fingerprints remain largely unexplored. Therefore, we present Mismatched Eraser (MEraser), a novel method for effectively removing backdoor-based fingerprints from LLMs while maintaining model performance. Our approach leverages a two-phase fine-tuning strategy utilizing carefully constructed mismatched and clean datasets. Through extensive evaluation across multiple LLM architectures and fingerprinting methods, we demonstrate that MEraser achieves complete fingerprinting removal while maintaining model performance with minimal training data of fewer than 1,000 samples.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Digital and Cyber Forensics · Biometric Identification and Security
