Take Fake as Real: Realistic-like Robust Black-box Adversarial Attack to Evade AIGC Detection
Caiyun Xie, Dengpan Ye, Yunming Zhang, Long Tang, Yunna Lv, Jiacheng, Deng, Jiawei Song

TL;DR
This paper introduces R²BA, a novel black-box adversarial attack that uses real-world post-processing techniques to evade AIGC detectors effectively, improving robustness and invisibility over existing methods.
Contribution
The paper proposes R²BA, a realistic-like adversarial attack leveraging post-processing fusion optimization to evade AIGC detectors, addressing limitations of prior attacks on multi-class and diffusion-based models.
Findings
R²BA achieves 15-72% improvement in anti-detection performance.
It demonstrates strong robustness against various AIGC detectors.
The attack maintains high invisibility and effectiveness in real-world scenarios.
Abstract
The security of AI-generated content (AIGC) detection is crucial for ensuring multimedia content credibility. To enhance detector security, research on adversarial attacks has become essential. However, most existing adversarial attacks focus only on GAN-generated facial images detection, struggle to be effective on multi-class natural images and diffusion-based detectors, and exhibit poor invisibility. To fill this gap, we first conduct an in-depth analysis of the vulnerability of AIGC detectors and discover the feature that detectors vary in vulnerability to different post-processing. Then, considering that the detector is agnostic in real-world scenarios and given this discovery, we propose a Realistic-like Robust Black-box Adversarial attack (RBA) with post-processing fusion optimization. Unlike typical perturbations, RBA uses real-world post-processing, i.e., Gaussian blur,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Fault Detection and Control Systems
MethodsDiffusion · Focus
