MalVis: A Large-Scale Image-Based Framework and Dataset for Advancing Android Malware Classification

Saleh J. Makkawy; Michael J. De Lucia; Kenneth E. Barner

arXiv:2505.12106·cs.CR·May 20, 2025

MalVis: A Large-Scale Image-Based Framework and Dataset for Advancing Android Malware Classification

Saleh J. Makkawy, Michael J. De Lucia, Kenneth E. Barner

PDF

Open Access 1 Repo

TL;DR

MalVis introduces a large-scale dataset and a visualization framework that significantly improve Android malware classification accuracy and interpretability using deep learning and ensemble strategies.

Contribution

This paper presents MalVis, a novel visualization framework and extensive dataset that enhance malware detection and interpretability over existing methods.

Findings

01

Achieved 95.19% accuracy in malware classification

02

Demonstrated improved interpretability of malicious features

03

Validated effectiveness across multiple CNN models

Abstract

As technology advances, Android malware continues to pose significant threats to devices and sensitive data. The open-source nature of the Android OS and the availability of its SDK contribute to this rapid growth. Traditional malware detection techniques, such as signature-based, static, and dynamic analysis, struggle to detect obfuscated threats that use encryption, packing, or compression. While deep learning (DL)-based visualization methods have been proposed, they often fail to highlight the critical malicious features effectively. This research introduces MalVis, a unified visualization framework that integrates entropy and N-gram analysis to emphasize structural and anomalous patterns in malware bytecode. MalVis addresses key limitations of prior methods, including insufficient feature representation, poor interpretability, and limited data accessibility. The framework leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

makkawysaleh/MalVis
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Digital and Cyber Forensics · Software Engineering Research