Can Drift-Adaptive Malware Detectors Be Made Robust? Attacks and Defenses Under White-Box and Black-Box Threats

Adrian Shuai Li; Md Ajwad Akil; Elisa Bertino

arXiv:2604.06599·cs.CR·April 9, 2026

Can Drift-Adaptive Malware Detectors Be Made Robust? Attacks and Defenses Under White-Box and Black-Box Threats

Adrian Shuai Li, Md Ajwad Akil, Elisa Bertino

PDF

TL;DR

This paper investigates the robustness of drift-adaptive malware detectors against adversarial attacks, proposing a universal framework that enhances defenses under different threat models and attack types.

Contribution

It introduces AdvDA, a malware detection method using adversarial domain adaptation, and a universal robustification framework to improve robustness against white-box and black-box attacks.

Findings

01

AdvDA is vulnerable to PGD attacks with 100% success rate.

02

The proposed framework reduces attack success rates to as low as 3.2%.

03

Different training strategies are optimal for different threat models.

Abstract

Concept drift and adversarial evasion are two major challenges for deploying machine learning-based malware detectors. While both have been studied separately, their combination, the adversarial robustness of drift-adaptive detectors, remains unexplored. We address this problem with AdvDA, a recent malware detector that uses adversarial domain adaptation to align a labeled source domain with a target domain with limited labels. The distribution shift between domains poses a unique challenge: robustness learned on the source may not transfer to the target, and existing defenses assume a fixed distribution. To address this, we propose a universal robustification framework that fine-tunes a pretrained AdvDA model on adversarially transformed inputs, agnostic to the attack type and choice of transformations. We instantiate it with five defense variants spanning two threat models: white-box…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.