Efficiently Learning Any One Hidden Layer ReLU Network From Queries
Sitan Chen, Adam R Klivans, Raghu Meka

TL;DR
This paper presents the first polynomial-time algorithm for learning any one hidden layer ReLU neural network from queries with guarantees on efficiency and accuracy, even in worst-case and overparameterized settings.
Contribution
It introduces a novel polynomial-time algorithm for learning arbitrary one hidden layer ReLU networks from black-box queries with provable guarantees.
Findings
Algorithm has polynomial query complexity and runtime.
Achieves low square loss relative to the original network.
Works in overparameterized and worst-case scenarios.
Abstract
Model extraction attacks have renewed interest in the classic problem of learning neural networks from queries. In this work we give the first polynomial-time algorithm for learning arbitrary one hidden layer neural networks activations provided black-box access to the network. Formally, we show that if is an arbitrary one hidden layer neural network with ReLU activations, there is an algorithm with query complexity and running time that is polynomial in all parameters that outputs a network achieving low square loss relative to with respect to the Gaussian measure. While a number of works in the security literature have proposed and empirically demonstrated the effectiveness of certain algorithms for this problem, ours is the first with fully polynomial-time guarantees of efficiency even for worst-case networks (in particular our algorithm succeeds in the overparameterized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Machine Learning and Algorithms
