Op2Vec: An Opcode Embedding Technique and Dataset Design for End-to-End   Detection of Android Malware

Kaleem Nawaz Khan; Najeeb Ullah; Sikandar Ali; Muhammad Salman Khan,; Mohammad Nauman; Anwar Ghani

arXiv:2104.04798·cs.CR·May 31, 2022

Op2Vec: An Opcode Embedding Technique and Dataset Design for End-to-End Detection of Android Malware

Kaleem Nawaz Khan, Najeeb Ullah, Sikandar Ali, Muhammad Salman Khan,, Mohammad Nauman, Anwar Ghani

PDF

TL;DR

This paper introduces Op2Vec, a novel opcode embedding technique, and a dataset for automated Android malware detection, achieving high accuracy and reducing the need for manual feature engineering.

Contribution

The paper presents Op2Vec, a new opcode embedding method, and a comprehensive dataset for end-to-end Android malware detection using deep learning.

Findings

01

Achieved an average detection accuracy of 97.47%

02

Precision of 0.976 and F1 score of 0.979

03

Reduced reliance on handcrafted features

Abstract

Android is one of the leading operating systems for smart phones in terms of market share and usage. Unfortunately, it is also an appealing target for attackers to compromise its security through malicious applications. To tackle this issue, domain experts and researchers are trying different techniques to stop such attacks. All the attempts of securing Android platform are somewhat successful. However, existing detection techniques have severe shortcomings, including the cumbersome process of feature engineering. Designing representative features require expert domain knowledge. There is a need for minimizing human experts' intervention by circumventing handcrafted feature engineering. Deep learning could be exploited by extracting deep features automatically. Previous work has shown that operational codes (opcodes) of executables provide key information to be used with deep learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.