Inverting Neural Networks: New Methods to Generate Neural Network Inputs from Prescribed Outputs
Rebecca Pattichis, Sebastian Janampa, Constantinos S. Pattichis, Marios S. Pattichis

TL;DR
This paper introduces two novel methods for inverting neural networks to generate input images from specific outputs, revealing vulnerabilities and providing deeper understanding of network decision boundaries.
Contribution
The paper presents two new general methods for solving the inverse problem in neural networks, enabling the generation of input images from prescribed outputs.
Findings
Methods produce random-like images with high classification accuracy
Reveals vulnerabilities in neural network architectures
Applicable to transformer and linear layer networks
Abstract
Neural network systems describe complex mappings that can be very difficult to understand. In this paper, we study the inverse problem of determining the input images that get mapped to specific neural network classes. Ultimately, we expect that these images contain recognizable features that are associated with their corresponding class classifications. We introduce two general methods for solving the inverse problem. In our forward pass method, we develop an inverse method based on a root-finding algorithm and the Jacobian with respect to the input image. In our backward pass method, we iteratively invert each layer, at the top. During the inversion process, we add random vectors sampled from the null-space of each linear layer. We demonstrate our new methods on both transformer architectures and sequential networks based on linear layers. Unlike previous methods, we show that our new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
