LFreeDA: Label-Free Drift Adaptation for Windows Malware Detection
Adrian Shuai Li, Elisa Bertino

TL;DR
LFreeDA is a novel framework that enables malware classifiers to adapt to concept drift without manual labeling by leveraging unlabeled data through unsupervised domain adaptation and pseudo-labeling, maintaining high detection accuracy.
Contribution
It introduces a fully label-free, end-to-end malware detection adaptation method combining image-based domain adaptation and CFG-based classifier fine-tuning.
Findings
Improves accuracy by up to 12.6% over no-adaptation baselines.
Achieves near-supervised performance with minimal labeled data.
Maintains detection performance over evolving malware without human labels.
Abstract
Machine learning (ML)-based malware detectors degrade over time as concept drift introduces new and evolving families unseen during training. Retraining is limited by the cost and time of manual labeling or sandbox analysis. Existing approaches mitigate this via drift detection and selective labeling, but fully label-free adaptation remains largely unexplored. Recent self-training methods use a previously trained model to generate pseudo-labels for unlabeled data and then train a new model on these labels. The unlabeled data are used only for inference and do not participate in training the earlier model. We argue that these unlabeled samples still carry valuable information that can be leveraged when incorporated appropriately into training. This paper introduces LFreeDA, an end-to-end framework that adapts malware classifiers to drift without manual labeling or drift detection.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications
