Interpretable by Design: MH-AutoML for Transparent and Efficient Android Malware Detection without Compromising Performance

Joner Assolin; Gabriel Canto; Diego Kreutz; Eduardo Feitosa; Hendrio Bragan\c{c}a; Angelo Nogueira; Vanderson Rocha

arXiv:2506.23314·cs.CR·July 1, 2025

Interpretable by Design: MH-AutoML for Transparent and Efficient Android Malware Detection without Compromising Performance

Joner Assolin, Gabriel Canto, Diego Kreutz, Eduardo Feitosa, Hendrio Bragan\c{c}a, Angelo Nogueira, Vanderson Rocha

PDF

Open Access

TL;DR

This paper introduces MH-AutoML, a domain-specific AutoML framework for Android malware detection that emphasizes interpretability, transparency, and efficiency, outperforming existing solutions in recall and control.

Contribution

The paper presents MH-AutoML, a novel AutoML framework tailored for Android malware detection that integrates interpretability and experiment traceability without sacrificing performance.

Findings

01

MH-AutoML achieves higher recall rates than seven established AutoML frameworks.

02

It provides enhanced transparency, interpretability, and control over the ML pipeline.

03

The framework maintains computational efficiency comparable to existing AutoML solutions.

Abstract

Malware detection in Android systems requires both cybersecurity expertise and machine learning (ML) techniques. Automated Machine Learning (AutoML) has emerged as an approach to simplify ML development by reducing the need for specialized knowledge. However, current AutoML solutions typically operate as black-box systems with limited transparency, interpretability, and experiment traceability. To address these limitations, we present MH-AutoML, a domain-specific framework for Android malware detection. MH-AutoML automates the entire ML pipeline, including data preprocessing, feature engineering, algorithm selection, and hyperparameter tuning. The framework incorporates capabilities for interpretability, debugging, and experiment tracking that are often missing in general-purpose solutions. In this study, we compare MH-AutoML against seven established AutoML frameworks: Auto-Sklearn,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Machine Learning and Data Classification