Enhancing the Insertion of NOP Instructions to Obfuscate Malware via   Deep Reinforcement Learning

Daniel Gibert; Matt Fredrikson; Carles Mateu; Jordi Planes; and Quan Le

arXiv:2111.09626·cs.CR·June 14, 2022

Enhancing the Insertion of NOP Instructions to Obfuscate Malware via Deep Reinforcement Learning

Daniel Gibert, Matt Fredrikson, Carles Mateu, Jordi Planes, and Quan Le

PDF

TL;DR

This paper introduces a deep reinforcement learning framework that strategically inserts dead code into malware to evade detection by neural network classifiers, significantly reducing their accuracy and achieving perfect evasion rates.

Contribution

It presents a novel double Q-network based method for optimizing dead code insertion to fool malware classifiers, demonstrating high evasion success and efficiency improvements.

Findings

01

Classifier accuracy drops to 56.53%

02

Evasion rate reaches 100% for targeted malware families

03

33% reduction in instructions needed for successful evasion

Abstract

Current state-of-the-art research for tackling the problem of malware detection and classification is centered on the design, implementation and deployment of systems powered by machine learning because of its ability to generalize to never-before-seen malware families and polymorphic mutations. However, it has been shown that machine learning models, in particular deep neural networks, lack robustness against crafted inputs (adversarial examples). In this work, we have investigated the vulnerability of a state-of-the-art shallow convolutional neural network malware classifier against the dead code insertion technique. We propose a general framework powered by a Double Q-network to induce misclassification over malware families. The framework trains an agent through a convolutional neural network to select the optimal positions in a code sequence to insert dead code instructions so that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.