Model Provenance via Model DNA

Xin Mu; Yu Wang; Yehong Zhang; Jiaqi Zhang; Hui Wang; Yang Xiang; Yue; Yu

arXiv:2308.02121·cs.LG·July 19, 2024

Model Provenance via Model DNA

Xin Mu, Yu Wang, Yehong Zhang, Jiaqi Zhang, Hui Wang, Yang Xiang, Yue, Yu

PDF

Open Access

TL;DR

This paper introduces Model DNA, a novel representation capturing unique model characteristics to determine the provenance of machine learning models, with applications in security and intellectual property protection.

Contribution

It proposes a new concept of Model DNA and a data-driven framework for accurately identifying model provenance across tasks and model types.

Findings

01

Effective in identifying model provenance in vision and NLP tasks

02

High accuracy across diverse models and datasets

03

Enhances security and IP protection for ML models

Abstract

Understanding the life cycle of the machine learning (ML) model is an intriguing area of research (e.g., understanding where the model comes from, how it is trained, and how it is used). This paper focuses on a novel problem within this field, namely Model Provenance (MP), which concerns the relationship between a target model and its pre-training model and aims to determine whether a source model serves as the provenance for a target model. This is an important problem that has significant implications for ensuring the security and intellectual property of machine learning models but has not received much attention in the literature. To fill in this gap, we introduce a novel concept of Model DNA which represents the unique characteristics of a machine learning model. We utilize a data-driven and model-driven representation learning method to encode the model's training data and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Machine Learning in Materials Science · Software Engineering Research