On deceiving malware classification with section injection
Adeilson Antonio da Silva, Mauricio Pamplona Segundo

TL;DR
This paper presents a method to inject random bytes into malware files to deceive classification systems and improve robustness, revealing vulnerabilities in current malware detection approaches.
Contribution
It introduces a novel byte injection technique that both attacks and enhances malware classifiers, respecting file formats to maintain malware functionality.
Findings
Injection causes 25-40% accuracy drop in classification
7% size increase significantly impacts detection accuracy
Combining section reordering and injection improves classifier robustness
Abstract
We investigate how to modify executable files to deceive malware classification systems. This work's main contribution is a methodology to inject bytes across a malware file randomly and use it both as an attack to decrease classification accuracy but also as a defensive method, augmenting the data available for training. It respects the operating system file format to make sure the malware will still execute after our injection and will not change its behavior. We reproduced five state-of-the-art malware classification approaches to evaluate our injection scheme: one based on GIST+KNN, three CNN variations and one Gated CNN. We performed our experiments on a public dataset with 9,339 malware samples from 25 different families. Our results show that a mere increase of 7% in the malware size causes an accuracy drop between 25% and 40% for malware family classification. They show that a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Adversarial Robustness in Machine Learning
