Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging
By: Qinfeng Li , Miao Pan , Jintao Chen and more
Potential Business Impact:
Stops others from copying AI models unfairly.
Model merging has emerged as an efficient technique for expanding large language models (LLMs) by integrating specialized expert models. However, it also introduces a new threat: model merging stealing, where free-riders exploit models through unauthorized model merging. Unfortunately, existing defense mechanisms fail to provide effective protection. Specifically, we identify three critical protection properties that existing methods fail to simultaneously satisfy: (1) proactively preventing unauthorized merging; (2) ensuring compatibility with general open-source settings; (3) achieving high security with negligible performance loss. To address the above issues, we propose MergeBarrier, a plug-and-play defense that proactively prevents unauthorized merging. The core design of MergeBarrier is to disrupt the Linear Mode Connectivity (LMC) between the protected model and its homologous counterparts, thereby eliminating the low-loss path required for effective model merging. Extensive experiments show that MergeBarrier effectively prevents model merging stealing with negligible accuracy loss.
Similar Papers
Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging
Cryptography and Security
Stops others from copying smart computer programs.
Defending Unauthorized Model Merging via Dual-Stage Weight Protection
CV and Pattern Recognition
Stops others from stealing and breaking your AI models.
Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy
Machine Learning (CS)
Stops others from stealing AI's special skills.