Enhancing the Insertion of NOP Instructions to Obfuscate Malware via Deep Reinforcement Learning
Daniel Gibert, Matt Fredrikson, Carles Mateu, Jordi Planes, and Quan Le

TL;DR
This paper introduces a deep reinforcement learning framework that strategically inserts dead code into malware to evade detection by neural network classifiers, significantly reducing their accuracy and achieving perfect evasion rates.
Contribution
It presents a novel double Q-network based method for optimizing dead code insertion to fool malware classifiers, demonstrating high evasion success and efficiency improvements.
Findings
Classifier accuracy drops to 56.53%
Evasion rate reaches 100% for targeted malware families
33% reduction in instructions needed for successful evasion
Abstract
Current state-of-the-art research for tackling the problem of malware detection and classification is centered on the design, implementation and deployment of systems powered by machine learning because of its ability to generalize to never-before-seen malware families and polymorphic mutations. However, it has been shown that machine learning models, in particular deep neural networks, lack robustness against crafted inputs (adversarial examples). In this work, we have investigated the vulnerability of a state-of-the-art shallow convolutional neural network malware classifier against the dead code insertion technique. We propose a general framework powered by a Double Q-network to induce misclassification over malware families. The framework trains an agent through a convolutional neural network to select the optimal positions in a code sequence to insert dead code instructions so that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
