Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses
Zhen Sun, Zongmin Zhang, Leyi Sheng, Yule Liu, Yifan Liao, Ke Li, Xinhu Zheng, Jiaheng Wei, Wenyuan Yang, and Xinlei He

TL;DR
SADBench is a comprehensive benchmark framework for evaluating the effectiveness of image steganography attacks and steganalysis defenses across various tasks, payload types, and distributions, highlighting key vulnerabilities and robustness issues.
Contribution
It introduces a systematic, extensible evaluation framework for assessing both attack and defense capabilities in image steganography, addressing current gaps in security assessment.
Findings
Autoencoder-based methods show superior stability.
In-domain detection is nearly perfect and cost-effective.
Attacks transfer well across distributions, but detectors do not.
Abstract
Image steganography is widely used to protect user privacy and enable covert communication. However, it can also be abused by the adversary as a covert channel to bypass content moderation, disseminate harmful semantics, and even hide malicious instructions in images to elicit dangerous outputs from large models, posing a practical security risk that continues to evolve. To address the lack of a unified and systematic evaluation framework, we propose SADBench, a systematic benchmark that assesses the adversary's ability to inject harmful secrets via steganography and the defender's ability to detect such threats through steganalysis. Crucially, SADBench comprises core tasks, namely steganography attack capability evaluation, steganalysis defense capability evaluation, efficiency evaluation, and transferability evaluation. It evaluates both image-payload and text-payload…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
