Tracking the Copyright of Large Vision-Language Models through Parameter   Learning Adversarial Images

Yubo Wang; Jianting Tang; Chaohu Liu; Linli Xu

arXiv:2502.16593·cs.AI·February 25, 2025

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Yubo Wang, Jianting Tang, Chaohu Liu, Linli Xu

PDF

Open Access

TL;DR

This paper introduces a novel method called Parameter Learning Attack (PLA) that enables copyright tracking of large vision-language models by creating adversarial images, effective even after models are fine-tuned, without altering the original model.

Contribution

The paper presents a new approach for copyright tracking of LVLMs using parameter learning adversarial images that remains effective post-fine-tuning, without modifying the original model.

Findings

01

PLA outperforms baseline methods in identifying original copyrights.

02

The method remains effective after models are fine-tuned.

03

It does not impact the original model's performance.

Abstract

Large vision-language models (LVLMs) have demonstrated remarkable image understanding and dialogue capabilities, allowing them to handle a variety of visual question answering tasks. However, their widespread availability raises concerns about unauthorized usage and copyright infringement, where users or individuals can develop their own LVLMs by fine-tuning published models. In this paper, we propose a novel method called Parameter Learning Attack (PLA) for tracking the copyright of LVLMs without modifying the original model. Specifically, we construct adversarial images through targeted attacks against the original model, enabling it to generate specific outputs. To ensure these attacks remain effective on potential fine-tuned models to trigger copyright tracking, we allow the original model to learn the trigger images by updating parameters in the opposite direction during the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Digital Media Forensic Detection