"They've Stolen My GPL-Licensed Model!": Toward Standardized and Transparent Model Licensing
Moming Duan, Rui Zhao, Linshan Jiang, Nigel Shadbolt, Bingsheng He

TL;DR
This paper introduces ModelGo Licenses and MG Analyzer to standardize and improve transparency in ML model licensing, addressing compliance issues and enabling better rights management in collaborative ML development.
Contribution
It proposes a novel set of model-specific licenses and a reasoning-based tool for analyzing licensing compliance in ML workflows, enhancing transparency and flexibility.
Findings
MG Analyzer effectively detects licensing compliance issues.
ModelGo Licenses offer flexible licensing options for ML models.
Experiments demonstrate improved licensing clarity and compliance.
Abstract
As model parameter sizes scale into the billions and training consumes zettaFLOPs of computation, the reuse of Machine Learning (ML) assets and collaborative development have become increasingly prevalent in the ML community. These ML assets, including models, datasets, and software, may originate from various sources and be published under different licenses, which govern the use and distribution of licensed works and their derivatives. However, commonly chosen licenses, such as GPL and Apache, are software-specific and are not clearly defined or bounded in the context of model publishing. Meanwhile, the reused assets may also be under free-content licenses and model licenses, which pose a potential risk of license noncompliance and rights infringement within the model production workflow. In this paper, we address these challenges along two lines: 1) For ML workflow compliance, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Rights Management and Security · Scientific Computing and Data Management · Model-Driven Software Engineering Techniques
MethodsSparse Evolutionary Training
