Tackling Copyright Issues in AI Image Generation Through Originality Estimation and Genericization
Hiroaki Chiba-Okabe, Weijie J. Su

TL;DR
This paper introduces a novel genericization method and a metric for originality to reduce copyright infringement in AI-generated images, significantly decreasing the likelihood of producing copyrighted characters.
Contribution
It proposes a new genericization technique combined with an originality metric, and implements PREGen to effectively mitigate copyright issues in AI image generation.
Findings
PREGen reduces copyrighted character generation by over 50% when prompts include character names.
PREGen nearly eliminates generation of copyrighted characters without explicit prompts.
The originality metric quantifies data uniqueness, aiding in copyright risk assessment.
Abstract
The rapid progress of generative AI technology has sparked significant copyright concerns, leading to numerous lawsuits filed against AI developers. Notably, generative AI's capacity for generating images of copyrighted characters has been well documented in the literature, and while various techniques for mitigating copyright issues have been studied, significant risks remain. Here, we propose a genericization method that modifies the outputs of a generative model to make them more generic and less likely to imitate distinctive features of copyrighted materials. To achieve this, we introduce a metric for quantifying the level of originality of data, estimated by drawing samples from a generative model, and applied in the genericization process. As a practical implementation, we introduce PREGen (Prompt Rewriting-Enhanced Genericization), which combines our genericization method with an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsResearch Data Management Practices
