MAB-Malware: A Reinforcement Learning Framework for Attacking Static Malware Classifiers
Wei Song, Xuezixiang Li, Sadia Afroz, Deepali Garg, Dmitry Kuznetsov,, Heng Yin

TL;DR
This paper introduces a reinforcement learning framework for black-box adversarial attacks on malware classifiers, achieving high evasion rates and providing insights into attack transferability.
Contribution
It presents a novel RL-based black-box attack framework that models malware evasion as a multi-armed bandit problem, improving attack success rates against commercial antivirus systems.
Findings
Achieves 74-97% evasion rate on ML malware detectors.
Attains 32-48% evasion rate on commercial antivirus in black-box settings.
Higher transferability among ML classifiers than between ML and commercial AVs.
Abstract
Modern commercial antivirus systems increasingly rely on machine learning to keep up with the rampant inflation of new malware. However, it is well-known that machine learning models are vulnerable to adversarial examples (AEs). Previous works have shown that ML malware classifiers are fragile to the white-box adversarial attacks. However, ML models used in commercial antivirus products are usually not available to attackers and only return hard classification labels. Therefore, it is more practical to evaluate the robustness of ML models and real-world AVs in a pure black-box manner. We propose a black-box Reinforcement Learning (RL) based framework to generate AEs for PE malware classifiers and AV engines. It regards the adversarial attack problem as a multi-armed bandit problem, which finds an optimal balance between exploiting the successful patterns and exploring more varieties.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Network Security and Intrusion Detection
