Can Drift-Adaptive Malware Detectors Be Made Robust? Attacks and Defenses Under White-Box and Black-Box Threats
Adrian Shuai Li, Md Ajwad Akil, Elisa Bertino

TL;DR
This paper investigates the robustness of drift-adaptive malware detectors against adversarial attacks, proposing a universal framework that enhances defenses under different threat models and attack types.
Contribution
It introduces AdvDA, a malware detection method using adversarial domain adaptation, and a universal robustification framework to improve robustness against white-box and black-box attacks.
Findings
AdvDA is vulnerable to PGD attacks with 100% success rate.
The proposed framework reduces attack success rates to as low as 3.2%.
Different training strategies are optimal for different threat models.
Abstract
Concept drift and adversarial evasion are two major challenges for deploying machine learning-based malware detectors. While both have been studied separately, their combination, the adversarial robustness of drift-adaptive detectors, remains unexplored. We address this problem with AdvDA, a recent malware detector that uses adversarial domain adaptation to align a labeled source domain with a target domain with limited labels. The distribution shift between domains poses a unique challenge: robustness learned on the source may not transfer to the target, and existing defenses assume a fixed distribution. To address this, we propose a universal robustification framework that fine-tunes a pretrained AdvDA model on adversarially transformed inputs, agnostic to the attack type and choice of transformations. We instantiate it with five defense variants spanning two threat models: white-box…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
