NeuroUnlock: Unlocking the Architecture of Obfuscated Deep Neural Networks
Mahya Morid Ahmadi, Lilas Alrahis, Alessio Colucci, Ozgur Sinanoglu,, Muhammad Shafique

TL;DR
NeuroUnlock reveals vulnerabilities in current DNN obfuscation methods by effectively reversing them to recover original architectures, which enhances the success of subsequent attacks and highlights the need for more resilient obfuscation techniques.
Contribution
This work introduces NeuroUnlock, a novel sequence-to-sequence attack that can reverse state-of-the-art DNN obfuscation, and proposes ReDLock, a more resilient obfuscation method.
Findings
NeuroUnlock successfully recovers architectures of various obfuscated DNNs.
ReDLock increases resilience, reducing NeuroUnlock's success rate.
Recovered models maintain high accuracy with minimal performance loss.
Abstract
The advancements of deep neural networks (DNNs) have led to their deployment in diverse settings, including safety and security-critical applications. As a result, the characteristics of these models have become sensitive intellectual properties that require protection from malicious users. Extracting the architecture of a DNN through leaky side-channels (e.g., memory access) allows adversaries to (i) clone the model, and (ii) craft adversarial attacks. DNN obfuscation thwarts side-channel-based architecture stealing (SCAS) attacks by altering the run-time traces of a given DNN while preserving its functionality. In this work, we expose the vulnerability of state-of-the-art DNN obfuscation methods to these attacks. We present NeuroUnlock, a novel SCAS attack against obfuscated DNNs. Our NeuroUnlock employs a sequence-to-sequence model that learns the obfuscation procedure and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
