Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to   Evade AIGC Detection

Caiyun Xie; Dengpan Ye; Yunming Zhang; Long Tang; Yunna Lv; Jiacheng; Deng; Jiawei Song

arXiv:2412.06727·cs.CV·December 17, 2024

Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection

Caiyun Xie, Dengpan Ye, Yunming Zhang, Long Tang, Yunna Lv, Jiacheng, Deng, Jiawei Song

PDF

Open Access

TL;DR

This paper introduces R²BA, a novel black-box adversarial attack that uses real-world post-processing techniques to evade AIGC detectors effectively, improving robustness and invisibility over existing methods.

Contribution

The paper proposes R²BA, a realistic-like adversarial attack leveraging post-processing fusion optimization to evade AIGC detectors, addressing limitations of prior attacks on multi-class and diffusion-based models.

Findings

01

R²BA achieves 15-72% improvement in anti-detection performance.

02

It demonstrates strong robustness against various AIGC detectors.

03

The attack maintains high invisibility and effectiveness in real-world scenarios.

Abstract

The security of AI-generated content (AIGC) detection is crucial for ensuring multimedia content credibility. To enhance detector security, research on adversarial attacks has become essential. However, most existing adversarial attacks focus only on GAN-generated facial images detection, struggle to be effective on multi-class natural images and diffusion-based detectors, and exhibit poor invisibility. To fill this gap, we first conduct an in-depth analysis of the vulnerability of AIGC detectors and discover the feature that detectors vary in vulnerability to different post-processing. Then, considering that the detector is agnostic in real-world scenarios and given this discovery, we propose a Realistic-like Robust Black-box Adversarial attack (R $^{2}$ BA) with post-processing fusion optimization. Unlike typical perturbations, R $^{2}$ BA uses real-world post-processing, i.e., Gaussian blur,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Fault Detection and Control Systems

MethodsDiffusion · Focus