Neural Network Repair with Reachability Analysis
Xiaodong Yang, Tom Yamaguchi, Hoang-Dung Tran, Bardh Hoxha, Taylor T, Johnson, Danil Prokhorov

TL;DR
This paper introduces a reachability analysis-based framework to repair unsafe neural networks in safety-critical systems, improving safety and robustness with minimal performance loss.
Contribution
It presents a novel repair method that uses reachability analysis to identify unsafe regions and enhance adversarial training, improving safety in neural networks.
Findings
Successfully repaired all safety instances in benchmarks.
Achieved five-fold faster reachability analysis runtime.
Doubled memory efficiency in analysis process.
Abstract
Safety is a critical concern for the next generation of autonomy that is likely to rely heavily on deep neural networks for perception and control. Formally verifying the safety and robustness of well-trained DNNs and learning-enabled systems under attacks, model uncertainties, and sensing errors is essential for safe autonomy. This research proposes a framework to repair unsafe DNNs in safety-critical systems with reachability analysis. The repair process is inspired by adversarial training which has demonstrated high effectiveness in improving the safety and robustness of DNNs. Different from traditional adversarial training approaches where adversarial examples are utilized from random attacks and may not be representative of all unsafe behaviors, our repair process uses reachability analysis to compute the exact unsafe regions and identify sufficiently representative examples to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Cardiac Arrest and Resuscitation
MethodsRepair · Random Convolutional Kernel Transform
