Towards Reliable Neural Specifications
Chuqin Geng, Nham Le, Xiaojie Xu, Zhaoyue Wang, Arie Gurfinkel, Xujie, Si

TL;DR
This paper introduces neural activation patterns (NAPs) as a new, more reliable specification method for neural networks, enabling larger verifiable regions and better practical applicability than existing data-based specifications.
Contribution
The paper proposes NAPs as a novel specification approach using neural activation patterns, improving verification scope and reliability over traditional data-centric methods.
Findings
Verified larger input regions on MNIST and CIFAR10 datasets.
Achieved 84% data recall on MNIST with NAPs.
Expanded verifiable bounds to ten times larger on CIFAR10.
Abstract
Having reliable specifications is an unavoidable challenge in achieving verifiable correctness, robustness, and interpretability of AI systems. Existing specifications for neural networks are in the paradigm of data as specification. That is, the local neighborhood centering around a reference input is considered to be correct (or robust). While existing specifications contribute to verifying adversarial robustness, a significant problem in many research domains, our empirical study shows that those verified regions are somewhat tight, and thus fail to allow verification of test set inputs, making them impractical for some real-world applications. To this end, we propose a new family of specifications called neural representation as specification, which uses the intrinsic information of neural networks - neural activation patterns (NAPs), rather than input data to specify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
Methodsfail · Test · High-Order Consensuses
