Multi-view Representation Learning from Malware to Defend Against   Adversarial Variants

James Lee Hu; Mohammadreza Ebrahimi; Weifeng Li; Xin Li; Hsinchun Chen

arXiv:2210.15429·cs.CR·October 28, 2022

Multi-view Representation Learning from Malware to Defend Against Adversarial Variants

James Lee Hu, Mohammadreza Ebrahimi, Weifeng Li, Xin Li, Hsinchun Chen

PDF

Open Access

TL;DR

This paper introduces ARMD, a multi-view learning framework that enhances deep learning malware detectors' robustness against adversarial variants by leveraging multiple views of malware files, such as source code and binary data.

Contribution

The paper proposes a novel multi-view learning approach, ARMD, to significantly improve the robustness of malware detectors against adversarial modifications.

Findings

01

ARMD improves adversarial robustness up to seven times.

02

Experiments conducted on three malware detectors across six categories.

03

Multi-view approach exploits untouched source code features.

Abstract

Deep learning-based adversarial malware detectors have yielded promising results in detecting never-before-seen malware executables without relying on expensive dynamic behavior analysis and sandbox. Despite their abilities, these detectors have been shown to be vulnerable to adversarial malware variants - meticulously modified, functionality-preserving versions of original malware executables generated by machine learning. Due to the nature of these adversarial modifications, these adversarial methods often use a \textit{single view} of malware executables (i.e., the binary/hexadecimal view) to generate adversarial malware variants. This provides an opportunity for the defenders (i.e., malware detectors) to detect the adversarial variants by utilizing more than one view of a malware file (e.g., source code view in addition to the binary view). The rationale behind this idea is that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications