Defending Unauthorized Model Merging via Dual-Stage Weight Protection

Wei-Jia Chen; Min-Yen Tsai; Cheng-Yi Lee; Chia-Mu Yu

arXiv:2511.11851·cs.CV·March 13, 2026

Defending Unauthorized Model Merging via Dual-Stage Weight Protection

Wei-Jia Chen, Min-Yen Tsai, Cheng-Yi Lee, Chia-Mu Yu

PDF

Open Access

TL;DR

This paper introduces MergeGuard, a dual-stage weight protection framework that prevents unauthorized model merging by disrupting compatibility while preserving the original model's performance.

Contribution

The paper proposes a novel dual-stage method to protect models from unauthorized merging, combining gradient redistribution and structured perturbations.

Findings

01

Reduces merged model accuracy by up to 90%

02

Maintains less than 1.5% performance loss on the original model

03

Effective on both vision and language models

Abstract

The rapid proliferation of pretrained models and open repositories has made model merging a convenient yet risky practice, allowing free-riders to combine fine-tuned models into a new multi-capability model without authorization. Such unauthorized model merging not only violates intellectual property rights but also undermines model ownership and accountability. To address this issue, we present MergeGuard, a proactive dual-stage weight protection framework that disrupts merging compatibility while maintaining task fidelity. In the first stage, we redistribute task-relevant information across layers via L2-regularized optimization, ensuring that important gradients are evenly dispersed. In the second stage, we inject structured perturbations to misalign task subspaces, breaking curvature compatibility in the loss landscape. Together, these stages reshape the model's parameter geometry…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Big Data and Digital Economy