Android HIV: A Study of Repackaging Malware for Evading Machine-Learning   Detection

Xiao Chen; Chaoran Li; Derui Wang; Sheng Wen; Jun Zhang; Surya Nepal,; Yang Xiang; Kui Ren

arXiv:1808.04218·cs.CR·November 17, 2021

Android HIV: A Study of Repackaging Malware for Evading Machine-Learning Detection

Xiao Chen, Chaoran Li, Derui Wang, Sheng Wen, Jun Zhang, Surya Nepal,, Yang Xiang, Kui Ren

PDF

TL;DR

This paper presents a novel automated attack method that generates adversarial Android malware examples by applying optimal perturbations to APKs, successfully evading state-of-the-art machine learning detectors relying on semantic features.

Contribution

The study introduces a new attack technique that manipulates APKs at the bytecode level to deceive advanced malware detectors, surpassing previous methods limited to manifest modifications.

Findings

01

Detection rates dropped from 96% to 0% for MaMaDroid.

02

Detection rates dropped from 97% to 0% for Drebin.

03

Adversarial APKs can evade both syntactic and semantic feature-based detectors.

Abstract

Machine learning based solutions have been successfully employed for automatic detection of malware on Android. However, machine learning models lack robustness to adversarial examples, which are crafted by adding carefully chosen perturbations to the normal inputs. So far, the adversarial examples can only deceive detectors that rely on syntactic features (e.g., requested permissions, API calls, etc), and the perturbations can only be implemented by simply modifying application's manifest. While recent Android malware detectors rely more on semantic features from Dalvik bytecode rather than manifest, existing attacking/defending methods are no longer effective. In this paper, we introduce a new attacking method that generates adversarial examples of Android malware and evades being detected by the current models. To this end, we propose a method of applying optimal perturbations onto…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.