Op2Vec: An Opcode Embedding Technique and Dataset Design for End-to-End Detection of Android Malware
Kaleem Nawaz Khan, Najeeb Ullah, Sikandar Ali, Muhammad Salman Khan,, Mohammad Nauman, Anwar Ghani

TL;DR
This paper introduces Op2Vec, a novel opcode embedding technique, and a dataset for automated Android malware detection, achieving high accuracy and reducing the need for manual feature engineering.
Contribution
The paper presents Op2Vec, a new opcode embedding method, and a comprehensive dataset for end-to-end Android malware detection using deep learning.
Findings
Achieved an average detection accuracy of 97.47%
Precision of 0.976 and F1 score of 0.979
Reduced reliance on handcrafted features
Abstract
Android is one of the leading operating systems for smart phones in terms of market share and usage. Unfortunately, it is also an appealing target for attackers to compromise its security through malicious applications. To tackle this issue, domain experts and researchers are trying different techniques to stop such attacks. All the attempts of securing Android platform are somewhat successful. However, existing detection techniques have severe shortcomings, including the cumbersome process of feature engineering. Designing representative features require expert domain knowledge. There is a need for minimizing human experts' intervention by circumventing handcrafted feature engineering. Deep learning could be exploited by extracting deep features automatically. Previous work has shown that operational codes (opcodes) of executables provide key information to be used with deep learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
