Efficient Concept Drift Handling for Batch Android Malware Detection Models
Molina-Coronado B., Mori U., Mendiburu A., Miguel-Alonso J

TL;DR
This paper investigates efficient retraining strategies, including concept drift detection and sample selection, to maintain the effectiveness of static Android malware detectors amid evolving app environments.
Contribution
It introduces a combination of concept drift detection and active learning for effective, resource-efficient model retraining in Android malware detection.
Findings
Concept drift detection reduces unnecessary retraining.
Active learning helps maintain diverse training data with fewer samples.
Retraining strategies improve detector performance over time.
Abstract
The rapidly evolving nature of Android apps poses a significant challenge to static batch machine learning algorithms employed in malware detection systems, as they quickly become obsolete. Despite this challenge, the existing literature pays limited attention to addressing this issue, with many advanced Android malware detection approaches, such as Drebin, DroidDet and MaMaDroid, relying on static models. In this work, we show how retraining techniques are able to maintain detector capabilities over time. Particularly, we analyze the effect of two aspects in the efficiency and performance of the detectors: 1) the frequency with which the models are retrained, and 2) the data used for retraining. In the first experiment, we compare periodic retraining with a more advanced concept drift detection method that triggers retraining only when necessary. In the second experiment, we analyze…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Malware Detection Techniques · Biosensors and Analytical Detection
