Adaptive Verifiable Training Using Pairwise Class Similarity
Shiqi Wang, Kevin Eykholt, Taesung Lee, Jiyong Jang, and Ian Molloy

TL;DR
This paper introduces a novel method that leverages class similarity to enhance verifiable adversarial training, resulting in more robust and accurate neural networks across multiple datasets.
Contribution
It proposes a new approach using class clustering and tailored robustness criteria to improve verifiable training performance and robustness in neural networks.
Findings
Improves clean accuracy by up to 30.89% on CIFAR10.
Enhances robustness and performance on Fashion-MNIST and CIFAR100.
Introduces two methods: inter-group robustness prioritization and neural decision trees.
Abstract
Verifiable training has shown success in creating neural networks that are provably robust to a given amount of noise. However, despite only enforcing a single robustness criterion, its performance scales poorly with dataset complexity. On CIFAR10, a non-robust LeNet model has a 21.63% error rate, while a model created using verifiable training and a L-infinity robustness criterion of 8/255, has an error rate of 57.10%. Upon examination, we find that when labeling visually similar classes, the model's error rate is as high as 61.65%. We attribute the loss in performance to inter-class similarity. Similar classes (i.e., close in the feature space) increase the difficulty of learning a robust model. While it's desirable to train a robust model for a large robustness region, pairwise class similarities limit the potential gains. Also, consideration must be made regarding the relative cost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
