Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging

Qinfeng Li; Miao Pan; Jintao Chen; Fu Teng; Zhiqiang Shen; Ge Su; Hao Peng; Xuhong Zhang

arXiv:2511.10712·cs.CR·November 21, 2025

Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging

Qinfeng Li, Miao Pan, Jintao Chen, Fu Teng, Zhiqiang Shen, Ge Su, Hao Peng, Xuhong Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces MergeBarrier, a defense mechanism that proactively prevents unauthorized model merging in open-source LLMs by disrupting linear mode connectivity, effectively safeguarding models with minimal performance impact.

Contribution

The paper proposes MergeBarrier, a novel plug-and-play method that disrupts linear mode connectivity to prevent unauthorized model merging in open-source LLMs.

Findings

01

MergeBarrier effectively prevents model merging stealing.

02

It achieves high security with negligible accuracy loss.

03

Extensive experiments validate its effectiveness.

Abstract

Model merging has emerged as an efficient technique for expanding large language models (LLMs) by integrating specialized expert models. However, it also introduces a new threat: model merging stealing, where free-riders exploit models through unauthorized model merging. Unfortunately, existing defense mechanisms fail to provide effective protection. Specifically, we identify three critical protection properties that existing methods fail to simultaneously satisfy: (1) proactively preventing unauthorized merging; (2) ensuring compatibility with general open-source settings; (3) achieving high security with negligible performance loss. To address the above issues, we propose MergeBarrier, a plug-and-play defense that proactively prevents unauthorized merging. The core design of MergeBarrier is to disrupt the Linear Mode Connectivity (LMC) between the protected model and its homologous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Topic Modeling