A3Ident: A Two-phased Approach to Identify the Leading Authors of Android Apps
Wei Wang, Guozhu Meng, Haoyu Wang, Kai Chen, Weimin Ge and, Xiaohong Li

TL;DR
This paper presents A3Ident, a two-phased method for identifying primary authors of Android apps by analyzing code relationships and developer coding habits, achieving high accuracy even on obfuscated apps.
Contribution
The paper introduces a novel two-phased approach combining package relationship analysis and coding style features for authorship attribution in Android apps, addressing challenges from third-party libraries and inheritance.
Findings
Achieves 92.5% accuracy on original datasets.
Classifies obfuscated apps with 80.4% accuracy.
Effectively clusters packages by authorship using new algorithms.
Abstract
Authorship identification is the process of identifying and classifying authors through given codes. Authorship identification can be used in a wide range of software domains, e.g., code authorship disputes, plagiarism detection, exposure of attackers' identity. Besides the inherent challenges from legacy software development, framework programming and crowdsourcing mode in Android raise the difficulties of authorship identification significantly. More specifically, widespread third party libraries and inherited components (e.g., classes, methods, and variables) dilute the primary code within the entire Android app and blur the boundaries of code written by different authors. However, prior research has not well addressed these challenges. To this end, we design a two-phased approach to attribute the primary code of an Android app to the specific developer. In the first phase, we put…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Spam and Phishing Detection
