Enhancing Decision-Making in Windows PE Malware Classification During Dataset Shifts with Uncertainty Estimation
Rahul Yumlembam, Biju Issac, and Seibu Mary Jacob

TL;DR
This paper improves Windows PE malware classification robustness during dataset shifts by integrating ensemble-based uncertainty estimates with conformal prediction, significantly reducing misclassification acceptance rates in challenging scenarios.
Contribution
It introduces a novel ensemble-based uncertainty estimation method within conformal prediction to enhance malware classifier reliability under dataset shifts.
Findings
Ensemble-based uncertainty estimates reduce incorrect acceptance rates by ~30%.
The method maintains competitive correct acceptance rates under dataset shifts.
Effective in detecting packed malware with distributional differences.
Abstract
Artificial intelligence techniques have achieved strong performance in classifying Windows Portable Executable (PE) malware, but their reliability often degrades under dataset shifts, leading to misclassifications with severe security consequences. To address this, we enhance an existing LightGBM (LGBM) malware detector by integrating Neural Networks (NN), PriorNet, and Neural Network Ensembles, evaluated across three benchmark datasets: EMBER, BODMAS, and UCSB. The UCSB dataset, composed mainly of packed malware, introduces a substantial distributional shift relative to EMBER and BODMAS, making it a challenging testbed for robustness. We study uncertainty-aware decision strategies, including probability thresholding, PriorNet, ensemble-derived estimates, and Inductive Conformal Evaluation (ICE). Our main contribution is the use of ensemble-based uncertainty estimates as Non-Conformity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Security and Verification in Computing · Network Security and Intrusion Detection
