On the Steganographic Capacity of Selected Learning Models
Rishit Agrawal, Kelvin Jou, Tanush Obili, Daksh Parikh and, Samarth Prajapati, Yash Seth, Charan Sridhar, Nathan Zhang, Mark, Stamp

TL;DR
This paper investigates how much information can be hidden in the parameters of various machine learning models without affecting their performance, revealing significant steganographic capacity across models.
Contribution
It provides a comprehensive analysis of the steganographic capacity of diverse learning models by quantifying how many low-order bits can be overwritten without degrading accuracy.
Findings
Most bits of trained parameters can be overwritten before accuracy drops.
Steganographic capacity varies widely among models, from KBs to MBs.
Results suggest potential for covert information embedding in models.
Abstract
Machine learning and deep learning models are potential vectors for various attack scenarios. For example, previous research has shown that malware can be hidden in deep learning models. Hiding information in a learning model can be viewed as a form of steganography. In this research, we consider the general question of the steganographic capacity of learning models. Specifically, for a wide range of models, we determine the number of low-order bits of the trained parameters that can be overwritten, without adversely affecting model performance. For each model considered, we graph the accuracy as a function of the number of low-order bits that have been overwritten, and for selected models, we also analyze the steganographic capacity of individual layers. The models that we test include the classic machine learning techniques of Linear Regression (LR) and Support Vector Machine (SVM);…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Internet Traffic Analysis and Secure E-voting
MethodsLinear Regression · Auxiliary Classifier
