Leveraging Data Geometry to Mitigate CSM in Steganalysis
Rony Abecidan (CRIStAL, CNRS), Vincent Itier (IMT Nord Europe,, CRIStAL), J\'er\'emie Boulanger (CRIStAL), Patrick Bas (CRIStAL, CNRS),, Tom\'a\v{s} Pevn\'y (CTU)

TL;DR
This paper introduces a geometry-based method to select training datasets that improve the generalization of steganalysis models under cover source mismatch conditions, addressing real-world operational challenges.
Contribution
It develops a novel geometrical metric based on subspace chordal distance to optimize training data selection, enhancing model robustness to CSM without requiring cover-stego balance information.
Findings
Geometry-based metric correlates with operational regret
Method outperforms traditional data selection approaches
Effective in scenarios with diverse cover sources
Abstract
In operational scenarios, steganographers use sets of covers from various sensors and processing pipelines that differ significantly from those used by researchers to train steganalysis models. This leads to an inevitable performance gap when dealing with out-of-distribution covers, commonly referred to as Cover Source Mismatch (CSM). In this study, we consider the scenario where test images are processed using the same pipeline. However, knowledge regarding both the labels and the balance between cover and stego is missing. Our objective is to identify a training dataset that allows for maximum generalization to our target. By exploring a grid of processing pipelines fostering CSM, we discovered a geometrical metric based on the chordal distance between subspaces spanned by DCTr features, that exhibits high correlation with operational regret while being not affected by the cover-stego…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis
