MalVis: A Large-Scale Image-Based Framework and Dataset for Advancing Android Malware Classification
Saleh J. Makkawy, Michael J. De Lucia, Kenneth E. Barner

TL;DR
MalVis introduces a large-scale dataset and a visualization framework that significantly improve Android malware classification accuracy and interpretability using deep learning and ensemble strategies.
Contribution
This paper presents MalVis, a novel visualization framework and extensive dataset that enhance malware detection and interpretability over existing methods.
Findings
Achieved 95.19% accuracy in malware classification
Demonstrated improved interpretability of malicious features
Validated effectiveness across multiple CNN models
Abstract
As technology advances, Android malware continues to pose significant threats to devices and sensitive data. The open-source nature of the Android OS and the availability of its SDK contribute to this rapid growth. Traditional malware detection techniques, such as signature-based, static, and dynamic analysis, struggle to detect obfuscated threats that use encryption, packing, or compression. While deep learning (DL)-based visualization methods have been proposed, they often fail to highlight the critical malicious features effectively. This research introduces MalVis, a unified visualization framework that integrates entropy and N-gram analysis to emphasize structural and anomalous patterns in malware bytecode. MalVis addresses key limitations of prior methods, including insufficient feature representation, poor interpretability, and limited data accessibility. The framework leverages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Digital and Cyber Forensics · Software Engineering Research
