Analysis of the Scalability of a Deep-Learning Network for Steganography "Into the Wild"
Hugo Ruiz, Marc Chaumont, Mehdi Yedroudj, Ahmed Oulad Amara,, Fr\'ed\'eric Comby, G\'erard Subsol

TL;DR
This paper investigates how the performance and scalability of deep learning networks for steganalysis change with larger datasets and network sizes, revealing that error behavior follows a power-law even in complex scenarios.
Contribution
It provides the first analysis of CNN scalability in steganalysis on large, diverse datasets, confirming power-law error behavior in this context.
Findings
Error follows a power-law with respect to dataset and network size.
CNN performance remains consistent in ranking when scaled to larger datasets.
A minimum dataset and network size are identified for surpassing random guessing.
Abstract
Since the emergence of deep learning and its adoption in steganalysis fields, most of the reference articles kept using small to medium size CNN, and learn them on relatively small databases. Therefore, benchmarks and comparisons between different deep learning-based steganalysis algorithms, more precisely CNNs, are thus made on small to medium databases. This is performed without knowing: 1. if the ranking, with a criterion such as accuracy, is always the same when the database is larger, 2. if the efficiency of CNNs will collapse or not if the training database is a multiple of magnitude larger, 3. the minimum size required for a database or a CNN, in order to obtain a better result than a random guesser. In this paper, after a solid discussion related to the observed behaviour of CNNs as a function of their sizes and the database size, we confirm that the error's power-law…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
