McNdroid: A Longitudinal Multimodal Benchmark for Robust Drift Detection in Android Malware
Md Mahmuduzzaman Kamol, Jesus Lopez, Saeefa Rubaiyet Nowmi, Emilia Rivas, Md Ahsanul Haque, Edward Raff, Aritran Piplai, Mohammad Saidur Rahman

TL;DR
McNdroid is a comprehensive longitudinal benchmark for Android malware detection that evaluates model robustness over time using multimodal data and analyzes drift effects across multiple feature modalities.
Contribution
It introduces the largest multimodal Android malware benchmark spanning 2013-2025, enabling analysis of temporal drift and multimodal fusion effectiveness in malware detection.
Findings
Multimodal fusion outperforms single modalities over long-term temporal gaps.
Temporal degradation affects both individual features and inter-modality agreement.
Drift impacts model explanations and malware-family evolution.
Abstract
Machine learning (ML) in real-world systems must contend with concept drift, adversarial actors, and a spectrum of potential features with varying costs and benefits. Malware naturally exhibits all of these complexities, but for the same reason, it is challenging to curate and organize data to study these factors. We present McNdroid, to our knowledge the largest longitudinal multimodal Android malware benchmark for malware detection and drift analysis. McNdroid spans 2013--2025, excluding 2015, and represents each application with three aligned modalities--static features from manifests and smali code, dynamic behavioral features from sandbox execution, and graph-based features from function-call graphs. Using temporally separated splits, we evaluate standard ML and deep-learning detectors across increasing train--test time gaps. Results show clear temporal degradation, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
