Improving the Certified Robustness of Neural Networks via Consistency Regularization
Mengting Xu, Tao Zhang, Zhongnian Li, Daoqiang Zhang

TL;DR
This paper introduces a novel regularization method called MAAR that improves the certified robustness of neural networks by addressing the inconsistency in robustness constraints between correctly classified and misclassified examples.
Contribution
The paper proposes a new consistency regularization term, MAAR, that enhances certified robustness by leveraging misclassified examples more effectively.
Findings
MAAR achieves superior certified robustness on CIFAR-10 and MNIST.
MAAR maintains comparable accuracy to state-of-the-art methods.
The approach effectively utilizes misclassified examples to improve robustness.
Abstract
A range of defense methods have been proposed to improve the robustness of neural networks on adversarial examples, among which provable defense methods have been demonstrated to be effective to train neural networks that are certifiably robust to the attacker. However, most of these provable defense methods treat all examples equally during training process, which ignore the inconsistent constraint of certified robustness between correctly classified (natural) and misclassified examples. In this paper, we explore this inconsistency caused by misclassified examples and add a novel consistency regularization term to make better use of the misclassified examples. Specifically, we identified that the certified robustness of network can be significantly improved if the constraint of certified robustness on misclassified examples and correctly classified examples is consistent. Motivated by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsAttentive Walk-Aggregating Graph Neural Network
