Explaining Vulnerabilities of Deep Learning to Adversarial Malware   Binaries

Luca Demetrio; Battista Biggio; Giovanni Lagorio; Fabio Roli,; Alessandro Armando

arXiv:1901.03583·cs.CR·January 25, 2019·24 cites

Explaining Vulnerabilities of Deep Learning to Adversarial Malware Binaries

Luca Demetrio, Battista Biggio, Giovanni Lagorio, Fabio Roli,, Alessandro Armando

PDF

Open Access 2 Repos

TL;DR

This paper investigates why deep learning models for malware detection are vulnerable to adversarial attacks by using explainable AI techniques, revealing that models rely on superficial features like file headers rather than meaningful malware characteristics.

Contribution

It introduces an explainability-based analysis revealing the reliance on header features and proposes a novel, efficient adversarial attack that modifies only a few header bytes.

Findings

01

Models rely on header features rather than malware characteristics.

02

The proposed attack modifies only tens of header bytes, improving efficiency.

03

Explainability reveals why models are susceptible to adversarial examples.

Abstract

Recent work has shown that deep-learning algorithms for malware detection are also susceptible to adversarial examples, i.e., carefully-crafted perturbations to input malware that enable misleading classification. Although this has questioned their suitability for this task, it is not yet clear why such algorithms are easily fooled also in this particular application domain. In this work, we take a first step to tackle this issue by leveraging explainable machine-learning algorithms developed to interpret the black-box decisions of deep neural networks. In particular, we use an explainable technique known as feature attribution to identify the most influential input features contributing to each decision, and adapt it to provide meaningful explanations to the classification of malware binaries. In this case, we find that a recently-proposed convolutional neural network does not learn…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications