Explanation-Guided Backdoor Poisoning Attacks Against Malware   Classifiers

Giorgio Severi; Jim Meyer; Scott Coull; Alina Oprea

arXiv:2003.01031·cs.CR·January 12, 2021·55 cites

Explanation-Guided Backdoor Poisoning Attacks Against Malware Classifiers

Giorgio Severi, Jim Meyer, Scott Coull, Alina Oprea

PDF

Open Access 2 Repos

TL;DR

This paper explores how explainable AI techniques can be used to craft effective backdoor poisoning attacks on malware classifiers, highlighting vulnerabilities in feature-based models and demonstrating practical attack methods across multiple file types.

Contribution

It introduces a novel explainability-guided approach for creating backdoor triggers in malware classifiers, applicable across diverse datasets and model types, with practical watermarking implementations.

Findings

01

Effective backdoor attacks demonstrated on Windows PE, PDFs, and Android classifiers

02

Explainability techniques improve trigger selection for model-agnostic attacks

03

Defensive strategies face significant challenges against these sophisticated attacks

Abstract

Training pipelines for machine learning (ML) based malware classification often rely on crowdsourced threat feeds, exposing a natural attack injection point. In this paper, we study the susceptibility of feature-based ML malware classifiers to backdoor poisoning attacks, specifically focusing on challenging "clean label" attacks where attackers do not control the sample labeling process. We propose the use of techniques from explainable machine learning to guide the selection of relevant features and values to create effective backdoor triggers in a model-agnostic fashion. Using multiple reference datasets for malware classification, including Windows PE files, PDFs, and Android applications, we demonstrate effective attacks against a diverse set of machine learning models and evaluate the effect of various constraints imposed on the attacker. To demonstrate the feasibility of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications