Data Augmentation Techniques to Reverse-Engineer Neural Network Weights from Input-Output Queries
Alexander Beiser, Flavio Martinelli, Wulfram Gerstner, Johanni Brea

TL;DR
This paper introduces novel data augmentation techniques tailored for reverse-engineering neural network weights from input-output queries, significantly improving the ability to recover larger networks than previously possible.
Contribution
The authors propose new augmentation methods specifically designed to sample neural network representations, enabling the recovery of networks with up to 100 times more parameters than training data.
Findings
Standard augmentations offer little improvement.
Tailored augmentations extend recoverable network size.
Able to recover networks with 100x more parameters than data points.
Abstract
Network weights can be reverse-engineered given enough informative samples of a network's input-output function. In a teacher-student setup, this translates into collecting a dataset of the teacher mapping -- querying the teacher -- and fitting a student to imitate such mapping. A sensible choice of queries is the dataset the teacher is trained on. But current methods fail when the teacher parameters are more numerous than the training data, because the student overfits to the queries instead of aligning its parameters to the teacher. In this work, we explore augmentation techniques to best sample the input-output mapping of a teacher network, with the goal of eliciting a rich set of representations from the teacher hidden layers. We discover that standard augmentations such as rotation, flipping, and adding noise, bring little to no improvement to the identification problem. We design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
