The Case for Contextual Copyleft: Licensing Open Source Training Data and Generative AI
Grant Shanklin, Emmie Hine, Claudio Novelli, Tyler Schroder, Luciano Floridi

TL;DR
This paper proposes the Contextual Copyleft AI (CCAI) license, extending copyleft principles to AI training data and models, aiming to enhance open source AI development while addressing legal, policy, and risk considerations.
Contribution
Introduction of the CCAI license as a novel legal framework that applies copyleft to AI training data and models, with a comprehensive evaluation of its feasibility and implications.
Findings
CCAI enhances developer control over AI models.
Legal analysis shows CCAI is feasible under current law.
Regulatory measures are needed to mitigate misuse risks.
Abstract
The proliferation of generative AI systems has created new challenges for the Free and Open Source Software (FOSS) community, particularly regarding how traditional copyleft principles should apply when open source code is used to train AI models. This article introduces the Contextual Copyleft AI (CCAI) license, a novel licensing mechanism that extends copyleft requirements from training data to the resulting generative AI models. The CCAI license offers significant advantages, including enhanced developer control, incentivization of open source AI development, and mitigation of openwashing practices. This is demonstrated through a structured three-part evaluation framework that examines (1) legal feasibility under current copyright law, (2) policy justification comparing traditional software and AI contexts, and (3) synthesis of cross-contextual benefits and risks. However, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
