Towards Building Non-Fine-Tunable Foundation Models
Ziyao Wang, Nizhang Li, Pingzhi Li, Guoheng Sun, Tianlong Chen, Ang Li

TL;DR
This paper introduces a novel pre-training framework called Private Mask Pre-Training (PMP) that creates foundation models resistant to unauthorized fine-tuning, maintaining performance while limiting adaptation gains.
Contribution
The paper proposes PMP, a method that uses a private sparse subnetwork mask during pre-training to prevent effective unauthorized fine-tuning of foundation models.
Findings
PMP preserves base model performance.
PMP significantly degrades unauthorized fine-tuning.
The non-fine-tunability strength is adjustable via mask ratio.
Abstract
Open-sourcing foundation models (FMs) enables broad reuse but also exposes model trainers to economic and safety risks from unrestricted downstream fine-tuning. We address this problem by building non-fine-tunable foundation models: models that remain broadly usable in their released form while yielding limited adaptation gains under task-agnostic unauthorized fine-tuning. We propose Private Mask Pre-Training (PMP), a pre-training framework that concentrates representation learning into a sparse subnetwork identified early in training. The binary mask defining this subnetwork is kept private, and only the final dense weights are released. This forces unauthorized fine-tuning without access to the mask to update parameters misaligned with pretraining subspace, inducing an intrinsic mismatch between the fine-tuning objective and the pre-training geometry. We provide theoretical analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques
