Et Tu Certifications: Robustness Certificates Yield Better Adversarial   Examples

Andrew C. Cullen; Shijie Liu; Paul Montague; Sarah M. Erfani; Benjamin; I.P. Rubinstein

arXiv:2302.04379·cs.LG·June 13, 2024·1 cites

Et Tu Certifications: Robustness Certificates Yield Better Adversarial Examples

Andrew C. Cullen, Shijie Liu, Paul Montague, Sarah M. Erfani, Benjamin, I.P. Rubinstein

PDF

Open Access 1 Repo

TL;DR

This paper introduces a certification-aware attack that exploits neural network robustness certificates to generate more effective adversarial examples, revealing potential security vulnerabilities in certification methods.

Contribution

It presents a novel attack method that leverages certification information to produce more efficient adversarial examples, challenging the assumption that certifications always enhance security.

Findings

01

The attack produces adversarial examples 74% more often than comparable methods.

02

Median perturbation norm is reduced by over 10% using the attack.

03

Releasing certifications can paradoxically decrease model security.

Abstract

In guaranteeing the absence of adversarial examples in an instance's neighbourhood, certification mechanisms play an important role in demonstrating neural net robustness. In this paper, we ask if these certifications can compromise the very models they help to protect? Our new \emph{Certification Aware Attack} exploits certifications to produce computationally efficient norm-minimising adversarial examples $74%$ more often than comparable attacks, while reducing the median perturbation norm by more than $10%$ . While these attacks can be used to assess the tightness of certification bounds, they also highlight that releasing certifications can paradoxically reduce security.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andrew-cullen/attacking-certified-robustness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDisaster Response and Management

MethodsAttentive Walk-Aggregating Graph Neural Network