Polynomial Time Cryptanalytic Extraction of Deep Neural Networks in the Hard-Label Setting
Nicholas Carlini, Jorge Ch\'avez-Saab, Anna Hambitzer, Francisco, Rodr\'iguez-Henr\'iquez, Adi Shamir

TL;DR
This paper presents a novel polynomial-time cryptanalytic method to extract parameters from deep neural networks in the hard-label black-box setting, significantly advancing the security understanding of DNNs.
Contribution
It introduces the first polynomial-time attack in the hard-label setting, enabling extraction of nearly one million parameters from complex DNNs.
Findings
Successfully extracted parameters from a CIFAR-10 trained DNN with 832 neurons.
Demonstrated that decision boundary geometry reveals all network weights.
Achieved polynomial query complexity and computational efficiency.
Abstract
Deep neural networks (DNNs) are valuable assets, yet their public accessibility raises security concerns about parameter extraction by malicious actors. Recent work by Carlini et al. (crypto'20) and Canales-Mart\'inez et al. (eurocrypt'24) has drawn parallels between this issue and block cipher key extraction via chosen plaintext attacks. Leveraging differential cryptanalysis, they demonstrated that all the weights and biases of black-box ReLU-based DNNs could be inferred using a polynomial number of queries and computational time. However, their attacks relied on the availability of the exact numeric value of output logits, which allowed the calculation of their derivatives. To overcome this limitation, Chen et al. (asiacrypt'24) tackled the more realistic hard-label scenario, where only the final classification label (e.g., "dog" or "car") is accessible to the attacker. They proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Neural Networks and Applications · Statistical and Computational Modeling
MethodsSparse Evolutionary Training
