Polynomial Time Cryptanalytic Extraction of Deep Neural Networks in the   Hard-Label Setting

Nicholas Carlini; Jorge Ch\'avez-Saab; Anna Hambitzer; Francisco; Rodr\'iguez-Henr\'iquez; Adi Shamir

arXiv:2410.05750·cs.CR·October 10, 2024

Polynomial Time Cryptanalytic Extraction of Deep Neural Networks in the Hard-Label Setting

Nicholas Carlini, Jorge Ch\'avez-Saab, Anna Hambitzer, Francisco, Rodr\'iguez-Henr\'iquez, Adi Shamir

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel polynomial-time cryptanalytic method to extract parameters from deep neural networks in the hard-label black-box setting, significantly advancing the security understanding of DNNs.

Contribution

It introduces the first polynomial-time attack in the hard-label setting, enabling extraction of nearly one million parameters from complex DNNs.

Findings

01

Successfully extracted parameters from a CIFAR-10 trained DNN with 832 neurons.

02

Demonstrated that decision boundary geometry reveals all network weights.

03

Achieved polynomial query complexity and computational efficiency.

Abstract

Deep neural networks (DNNs) are valuable assets, yet their public accessibility raises security concerns about parameter extraction by malicious actors. Recent work by Carlini et al. (crypto'20) and Canales-Mart\'inez et al. (eurocrypt'24) has drawn parallels between this issue and block cipher key extraction via chosen plaintext attacks. Leveraging differential cryptanalysis, they demonstrated that all the weights and biases of black-box ReLU-based DNNs could be inferred using a polynomial number of queries and computational time. However, their attacks relied on the availability of the exact numeric value of output logits, which allowed the calculation of their derivatives. To overcome this limitation, Chen et al. (asiacrypt'24) tackled the more realistic hard-label scenario, where only the final classification label (e.g., "dog" or "car") is accessible to the attacker. They proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ai-lab-y/nn_cryptanalytic_extraction
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Neural Networks and Applications · Statistical and Computational Modeling

MethodsSparse Evolutionary Training