FBA$^2$D: Frequency-based Black-box Attack for AI-generated Image Detection
Xiaojing Chen, Dan Li, Lijun Peng, Jun Yan{\L}etter, Zhiqing Guo, Junyang Chen, Xiao Lan, Zhongjie Ba, Yunfeng Diao{\L}etter

TL;DR
This paper introduces FBA$^2$D, a frequency-based black-box attack method targeting AI-generated content detectors, utilizing spectral analysis to improve attack efficiency and effectiveness in real-world API scenarios.
Contribution
It proposes a novel frequency-domain decision-based attack leveraging DCT for spectral partitioning, filling a gap in black-box attacks on AIGC detectors.
Findings
Effective in reducing query counts
Preserves image quality during attack
Demonstrates success on multiple datasets
Abstract
The prosperous development of Artificial Intelligence-Generated Content (AIGC) has brought people's anxiety about the spread of false information on social media. Designing detectors for filtering is an effective defense method, but most detectors will be compromised by adversarial samples. Currently, most studies exposing AIGC security issues assume information on model structure and data distribution. In real applications, attackers query and interfere with models that provide services in the form of application programming interfaces (APIs), which constitutes the black-box decision-based attack paradigm. However, to the best of our knowledge, decision-based attacks on AIGC detectors remain unexplored. In this study, we propose \textbf{FBAD}: a frequency-based black-box attack method for AIGC detection to fill the research gap. Motivated by frequency-domain discrepancies between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis
