Dynamic detection of mobile malware using smartphone data and machine learning
J.S. Panman de Wit, J. van der Ham, D. Bucur

TL;DR
This paper evaluates machine learning techniques using dynamic hardware features to detect mobile malware on Android devices without privileged access, achieving high classification accuracy across multiple malware subtypes.
Contribution
It provides an empirical analysis of ML classifiers on real-world data, highlighting the effectiveness of Random Forest and other models for malware detection using non-privileged device features.
Findings
Random Forest achieves an F1 score of 0.73 for malware detection.
All classifiers maintain low false positive rates below 0.02.
Detection performance varies across malware subtypes, with FNR below 0.33.
Abstract
Mobile malware are malicious programs that target mobile devices. They are an increasing problem, as seen in the rise of detected mobile malware samples per year. The number of active smartphone users is expected to grow, stressing the importance of research on the detection of mobile malware. Detection methods for mobile malware exist but are still limited. In this paper, we provide an overview of the performance of machine learning (ML) techniques to detect malware on Android, without using privileged access. The ML-classifiers use device information such as the CPU usage, battery usage, and memory usage for the detection of 10 subtypes of Mobile Trojans on the Android Operating System (OS). We use a real-life dataset containing device and malware data from 47 users for a year (2016). We examine which features, i.e. aspects, of a device, are most important to monitor to detect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
