CertViT: Certified Robustness of Pre-Trained Vision Transformers

Kavya Gupta; Sagar Verma

arXiv:2302.10287·cs.CV·February 22, 2023

CertViT: Certified Robustness of Pre-Trained Vision Transformers

Kavya Gupta, Sagar Verma

PDF

Open Access 1 Repo

TL;DR

CertViT introduces a scalable method to achieve certified robustness in large pre-trained vision transformers by fine-tuning with a two-step proximal-projection approach, improving robustness without extensive retraining.

Contribution

The paper proposes CertViT, a novel two-step proximal-projection method enabling certified robustness of large pre-trained vision transformers without full retraining.

Findings

01

CertViT outperforms state-of-the-art Lipschitz-trained networks in certified accuracy.

02

The method effectively enhances adversarial robustness of various vision transformer variants.

03

CertViT maintains high clean accuracy while providing robustness guarantees.

Abstract

Lipschitz bounded neural networks are certifiably robust and have a good trade-off between clean and certified accuracy. Existing Lipschitz bounding methods train from scratch and are limited to moderately sized networks (< 6M parameters). They require a fair amount of hyper-parameter tuning and are computationally prohibitive for large networks like Vision Transformers (5M to 660M parameters). Obtaining certified robustness of transformers is not feasible due to the non-scalability and inflexibility of the current methods. This work presents CertViT, a two-step proximal-projection method to achieve certified robustness from pre-trained weights. The proximal step tries to lower the Lipschitz bound and the projection step tries to maintain the clean accuracy of pre-trained weights. We show that CertViT networks have better certified accuracy than state-of-the-art Lipschitz trained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sagarverma/transformer-lipschitz
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning