Training-Free Color-Aware Adversarial Diffusion Sanitization for Diffusion Stegomalware Defense at Security Gateways
Vladimir Frants, Sos Agaian

TL;DR
This paper presents a training-free, color-aware adversarial sanitization method that neutralizes diffusion-based steganography payloads at security gateways with minimal perceptual distortion.
Contribution
It introduces ADS, a novel defense leveraging pretrained denoisers and quaternion-coupled updates to effectively neutralize diffusion steganography without training.
Findings
Drives decoder success rates to near zero against Pulsar steganography
Maintains minimal perceptual distortion in sanitized images
Offers a better security-utility trade-off than standard content transformations
Abstract
The rapid expansion of generative AI has normalized large-scale synthetic media creation, enabling new forms of covert communication. Recent generative steganography methods, particularly those based on diffusion models, can embed high-capacity payloads without fine-tuning or auxiliary decoders, creating significant challenges for detection and remediation. Coverless diffusion-based techniques are difficult to counter because they generate image carriers directly from secret data, enabling attackers to deliver stegomalware for command-and-control, payload staging, and data exfiltration while bypassing detectors that rely on cover-stego discrepancies. This work introduces Adversarial Diffusion Sanitization (ADS), a training-free defense for security gateways that neutralizes hidden payloads rather than detecting them. ADS employs an off-the-shelf pretrained denoiser as a differentiable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Digital Media Forensic Detection
