ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic Malware SwarmGenerator
Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

TL;DR
ADVERSARIALuscator is a novel adversarial deep reinforcement learning system that generates metamorphic malware variants at the opcode level, enhancing evasion capabilities and aiding in cybersecurity defense testing.
Contribution
It introduces the first Markov Decision Process-based approach for opcode-level malware obfuscation and employs continuous action control with deep RL like PPO in cybersecurity.
Findings
Increased metamorphic probability of malware by over 0.45.
Over 33% of generated malware variants evaded advanced IDS.
Demonstrated effectiveness of adversarial DRL in creating potent malware obfuscations.
Abstract
Advanced metamorphic malware and ransomware, by using obfuscation, could alter their internal structure with every attack. If such malware could intrude even into any of the IoT networks, then even if the original malware instance gets detected, by that time it can still infect the entire network. It is challenging to obtain training data for such evasive malware. Therefore, in this paper, we present ADVERSARIALuscator, a novel system that uses specialized Adversarial-DRL to obfuscate malware at the opcode level and create multiple metamorphic instances of the same. To the best of our knowledge, ADVERSARIALuscator is the first-ever system that adopts the Markov Decision Process-based approach to convert and find a solution to the problem of creating individual obfuscations at the opcode level. This is important as the machine language level is the least at which functionality could be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
