Explaining Vulnerabilities of Deep Learning to Adversarial Malware Binaries
Luca Demetrio, Battista Biggio, Giovanni Lagorio, Fabio Roli,, Alessandro Armando

TL;DR
This paper investigates why deep learning models for malware detection are vulnerable to adversarial attacks by using explainable AI techniques, revealing that models rely on superficial features like file headers rather than meaningful malware characteristics.
Contribution
It introduces an explainability-based analysis revealing the reliance on header features and proposes a novel, efficient adversarial attack that modifies only a few header bytes.
Findings
Models rely on header features rather than malware characteristics.
The proposed attack modifies only tens of header bytes, improving efficiency.
Explainability reveals why models are susceptible to adversarial examples.
Abstract
Recent work has shown that deep-learning algorithms for malware detection are also susceptible to adversarial examples, i.e., carefully-crafted perturbations to input malware that enable misleading classification. Although this has questioned their suitability for this task, it is not yet clear why such algorithms are easily fooled also in this particular application domain. In this work, we take a first step to tackle this issue by leveraging explainable machine-learning algorithms developed to interpret the black-box decisions of deep neural networks. In particular, we use an explainable technique known as feature attribution to identify the most influential input features contributing to each decision, and adapt it to provide meaningful explanations to the classification of malware binaries. In this case, we find that a recently-proposed convolutional neural network does not learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications
