A Non-Intrusive Machine Learning Solution for Malware Detection and Data   Theft Classification in Smartphones

Sai Vishwanath Venkatesh; Prasanna D. Kumaran; Joish J Bosco; Pravin; R. Kumaar; Vineeth Vijayaraghavan

arXiv:2102.06511·cs.CR·February 15, 2021

A Non-Intrusive Machine Learning Solution for Malware Detection and Data Theft Classification in Smartphones

Sai Vishwanath Venkatesh, Prasanna D. Kumaran, Joish J Bosco, Pravin, R. Kumaar, Vineeth Vijayaraghavan

PDF

TL;DR

This paper presents a non-intrusive machine learning approach that detects malware in smartphones and classifies stolen data types with high accuracy, using Android usage data from the SherLock framework.

Contribution

It introduces a novel, accessible ML solution for simultaneous malware detection and data theft classification on smartphones, utilizing publicly available Android usage data.

Findings

01

Less than 9% inaccuracy in malware detection

02

83% certainty in data theft classification

03

Effective on real-world multi-user data

Abstract

Smartphones contain information that is more sensitive and personal than those found on computers and laptops. With an increase in the versatility of smartphone functionality, more data has become vulnerable and exposed to attackers. Successful mobile malware attacks could steal a user's location, photos, or even banking information. Due to a lack of post-attack strategies firms also risk going out of business due to data theft. Thus, there is a need besides just detecting malware intrusion in smartphones but to also identify the data that has been stolen to assess, aid in recovery and prevent future attacks. In this paper, we propose an accessible, non-intrusive machine learning solution to not only detect malware intrusion but also identify the type of data stolen for any app under supervision. We do this with Android usage data obtained by utilising publicly available data collection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.