MADAR: Efficient Continual Learning for Malware Analysis with Distribution-Aware Replay
Mohammad Saidur Rahman, Scott Coull, Qi Yu, Matthew Wright

TL;DR
MADAR is a novel continual learning framework tailored for malware analysis that effectively handles the diverse and evolving nature of malware data, outperforming existing methods on large-scale datasets.
Contribution
This paper introduces MADAR, a distribution-aware replay method specifically designed for malware classification, addressing the limitations of prior CL techniques in this domain.
Findings
MADAR significantly outperforms prior continual learning methods on malware datasets.
Understanding malware data distribution is crucial for designing effective CL techniques.
MADAR demonstrates robustness across Windows and Android malware datasets.
Abstract
Millions of new pieces of malicious software (i.e., malware) are introduced each year. This poses significant challenges for antivirus vendors, who use machine learning to detect and analyze malware, and must keep up with changes in the distribution while retaining knowledge of older variants. Continual learning (CL) holds the potential to address this challenge by reducing the storage and computational costs of regularly retraining over all the collected data. Prior work, however, shows that CL techniques, which are designed primarily for computer vision tasks, fare poorly when applied to malware classification. To address these issues, we begin with an exploratory analysis of a typical malware dataset, which reveals that malware families are diverse and difficult to characterize, requiring a wide variety of samples to learn a robust representation. Based on these findings, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Influenza Virus Research Studies
