Can Large Vision-Language Models Detect Images Copyright Infringement from GenAI?
Qipan Xu, Zhenting Wang, Xiaoxiao He, Ligong Han, Ruixiang Tang

TL;DR
This paper evaluates the ability of large vision-language models to detect image copyright infringement, introduces a new benchmark dataset, and analyzes their overfitting issues and failure cases.
Contribution
It constructs a comprehensive benchmark dataset for copyright detection and assesses the performance of state-of-the-art LVLMs on this task.
Findings
LVLMs tend to overfit, misclassifying non-infringing images as infringing.
The constructed dataset includes both infringement and ambiguous non-infringement samples.
Analysis of failure cases suggests potential directions for improving LVLM copyright detection.
Abstract
Generative AI models, renowned for their ability to synthesize high-quality content, have sparked growing concerns over the improper generation of copyright-protected material. While recent studies have proposed various approaches to address copyright issues, the capability of large vision-language models (LVLMs) to detect copyright infringements remains largely unexplored. In this work, we focus on evaluating the copyright detection abilities of state-of-the-art LVLMs using a various set of image samples. Recognizing the absence of a comprehensive dataset that includes both IP-infringement samples and ambiguous non-infringement negative samples, we construct a benchmark dataset comprising positive samples that violate the copyright protection of well-known IP figures, as well as negative samples that resemble these figures but do not raise copyright concerns. This dataset is created…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection
MethodsSparse Evolutionary Training · Focus
