Dynamic detection of mobile malware using smartphone data and machine   learning

J.S. Panman de Wit; J. van der Ham; D. Bucur

arXiv:2107.11167·cs.CR·February 15, 2022

Dynamic detection of mobile malware using smartphone data and machine learning

J.S. Panman de Wit, J. van der Ham, D. Bucur

PDF

TL;DR

This paper evaluates machine learning techniques using dynamic hardware features to detect mobile malware on Android devices without privileged access, achieving high classification accuracy across multiple malware subtypes.

Contribution

It provides an empirical analysis of ML classifiers on real-world data, highlighting the effectiveness of Random Forest and other models for malware detection using non-privileged device features.

Findings

01

Random Forest achieves an F1 score of 0.73 for malware detection.

02

All classifiers maintain low false positive rates below 0.02.

03

Detection performance varies across malware subtypes, with FNR below 0.33.

Abstract

Mobile malware are malicious programs that target mobile devices. They are an increasing problem, as seen in the rise of detected mobile malware samples per year. The number of active smartphone users is expected to grow, stressing the importance of research on the detection of mobile malware. Detection methods for mobile malware exist but are still limited. In this paper, we provide an overview of the performance of machine learning (ML) techniques to detect malware on Android, without using privileged access. The ML-classifiers use device information such as the CPU usage, battery usage, and memory usage for the detection of 10 subtypes of Mobile Trojans on the Android Operating System (OS). We use a real-life dataset containing device and malware data from 47 users for a year (2016). We examine which features, i.e. aspects, of a device, are most important to monitor to detect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.