A Symbolic Neural Network Representation and its Application to Understanding, Verifying, and Patching Networks
Matthew Sotoudeh, Aditya V. Thakur

TL;DR
This paper introduces a symbolic representation for piecewise-linear neural networks that enables analysis, verification, and patching, facilitating better understanding and manipulation of complex neural network behaviors.
Contribution
The paper presents a novel symbolic representation for neural networks that simplifies analysis and enables new applications like verification and patching.
Findings
Exact visualization of network advisories for aircraft collision avoidance.
Performed bounded model checking on neural network controllers.
Demonstrated neural network patching to correct specific behaviors.
Abstract
Analysis and manipulation of trained neural networks is a challenging and important problem. We propose a symbolic representation for piecewise-linear neural networks and discuss its efficient computation. With this representation, one can translate the problem of analyzing a complex neural network into that of analyzing a finite set of affine functions. We demonstrate the use of this representation for three applications. First, we apply the symbolic representation to computing weakest preconditions on network inputs, which we use to exactly visualize the advisories made by a network meant to operate an aircraft collision avoidance system. Second, we use the symbolic representation to compute strongest postconditions on the network outputs, which we use to perform bounded model checking on standard neural network controllers. Finally, we show how the symbolic representation can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Neural Networks and Applications · Machine Learning and Algorithms
