Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses

Zhen Sun; Zongmin Zhang; Leyi Sheng; Yule Liu; Yifan Liao; Ke Li; Xinhu Zheng; Jiaheng Wei; Wenyuan Yang; and Xinlei He

arXiv:2605.05789·cs.CR·May 8, 2026

Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses

Zhen Sun, Zongmin Zhang, Leyi Sheng, Yule Liu, Yifan Liao, Ke Li, Xinhu Zheng, Jiaheng Wei, Wenyuan Yang, and Xinlei He

PDF

TL;DR

SADBench is a comprehensive benchmark framework for evaluating the effectiveness of image steganography attacks and steganalysis defenses across various tasks, payload types, and distributions, highlighting key vulnerabilities and robustness issues.

Contribution

It introduces a systematic, extensible evaluation framework for assessing both attack and defense capabilities in image steganography, addressing current gaps in security assessment.

Findings

01

Autoencoder-based methods show superior stability.

02

In-domain detection is nearly perfect and cost-effective.

03

Attacks transfer well across distributions, but detectors do not.

Abstract

Image steganography is widely used to protect user privacy and enable covert communication. However, it can also be abused by the adversary as a covert channel to bypass content moderation, disseminate harmful semantics, and even hide malicious instructions in images to elicit dangerous outputs from large models, posing a practical security risk that continues to evolve. To address the lack of a unified and systematic evaluation framework, we propose SADBench, a systematic benchmark that assesses the adversary's ability to inject harmful secrets via steganography and the defender's ability to detect such threats through steganalysis. Crucially, SADBench comprises $4$ core tasks, namely steganography attack capability evaluation, steganalysis defense capability evaluation, efficiency evaluation, and transferability evaluation. It evaluates both image-payload and text-payload…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.