Securing Deep Generative Models with Universal Adversarial Signature

Yu Zeng; Mo Zhou; Yuan Xue; Vishal M. Patel

arXiv:2305.16310·cs.CV·May 26, 2023·2 cites

Securing Deep Generative Models with Universal Adversarial Signature

Yu Zeng, Mo Zhou, Yuan Xue, Vishal M. Patel

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces a universal adversarial signature that can be embedded into any pre-trained generative model to improve the detection and traceability of generated images across different models.

Contribution

It proposes a novel method to inject a universal adversarial signature into generative models, enhancing their detectability and traceability without affecting image quality.

Findings

01

Effective detection rate on FFHQ and ImageNet datasets

02

Universal signature generalizes to unseen models

03

Method is compatible with various state-of-the-art generators

Abstract

Recent advances in deep generative models have led to the development of methods capable of synthesizing high-quality, realistic images. These models pose threats to society due to their potential misuse. Prior research attempted to mitigate these threats by detecting generated images, but the varying traces left by different generative models make it challenging to create a universal detector capable of generalizing to new, unseen generative models. In this paper, we propose to inject a universal adversarial signature into an arbitrary pre-trained generative model, in order to make its generated contents more detectable and traceable. First, the imperceptible optimal signature for each image can be found by a signature injector through adversarial training. Subsequently, the signature can be incorporated into an arbitrary generator by fine-tuning it with the images processed by the…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

The motivation is explained clearly. The paper is well-written.

Weaknesses

The performance with or without the adversarial signature should be presented. The term universal in used incorrectly since the signature depends on each image.

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 5

Strengths

1) The papers deals with an important problem regarding the safety of generative models and proposes an innovative solution. 2) The proposed solution intuitively makes sense and is also feasible in practice. 2) The paper is well-written and easy to follow.

Weaknesses

There are five main concerns about the proposed solution. 1) Firstly, the model F has been primarily described as a binary classifier whose goal is to distinguish between real images and "signed" synthetic images. What happens when the classifier F is fed with an "unsigned" synthetic image (i.e., an image generated from the original generator instead of the finetuned generator)? Shouldn't it be trained in such a way to detect "unsigned" synthetic images to the best possible extent along with "s

Reviewer 03Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

(1) This paper introduces adversarial examples into the detection of images generated by generative models, combined with joint training and watermark fine-tuning, which is novelty; (2) For the generation model, this paper investigates the latest diffusion-based generation model, which is of good practical significance under the common use of AIGC nowadays; (3) This paper is well-structured and logical. The author reviews the effectiveness and limitations of the proposed method from multiple per

Weaknesses

(1) The two SOTA detection methods compared in the experiment are against the CNN-based and GAN-based generative model, whether there is any relevant paper for the watermarking of the diffusion model at present, if so, please supplement; (2) Missing a lot of experimental data on ImageNet, please add results comparing with SOTA on ImageNet; (3) Some formatting issues: (1) Please cite the graphs in order; (2) Please distinguish between periods and semicolons within the algorithm; (3) Please give t

Code & Models

Repositories

zengxianyu/genwm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis