On labeling Android malware signatures using minhashing and further classification with Structural Equation Models
Ignacio Mart\'in, Jos\'e Alberto Hern\'andez, Sergio de los Santos

TL;DR
This paper analyzes over 250,000 Android malware signatures from 61 antivirus engines, classifies malware into categories, and uses structural equation modeling to evaluate antivirus effectiveness and malware classification.
Contribution
It introduces a novel approach combining minhashing, community detection, and structural equation models to analyze antivirus signatures and malware categories.
Findings
Identified 41 malware classes grouped into three categories
Determined which antivirus engines are most effective for each category
Showed how models can predict malware type based on signatures
Abstract
Multi-scanner Antivirus systems provide insightful information on the nature of a suspect application; however there is often a lack of consensus and consistency between different Anti-Virus engines. In this article, we analyze more than 250 thousand malware signatures generated by 61 different Anti-Virus engines after analyzing 82 thousand different Android malware applications. We identify 41 different malware classes grouped into three major categories, namely Adware, Harmful Threats and Unknown or Generic signatures. We further investigate the relationships between such 41 classes using community detection algorithms from graph theory to identify similarities between them; and we finally propose a Structure Equation Model to identify which Anti-Virus engines are more powerful at detecting each macro-category. As an application, we show how such models can help in identifying whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Software Testing and Debugging Techniques
