SAFER: Sharpness Aware layer-selective Finetuning for Enhanced   Robustness in vision transformers

Bhavna Gopal; Huanrui Yang; Mark Horton; Yiran Chen

arXiv:2501.01529·cs.CV·January 6, 2025

SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers

Bhavna Gopal, Huanrui Yang, Mark Horton, Yiran Chen

PDF

Open Access

TL;DR

SAFER is a novel layer-selective fine-tuning method for vision transformers that improves robustness against adversarial attacks by focusing on vulnerable layers with sharpness-aware minimization.

Contribution

The paper introduces SAFER, a layer-selective fine-tuning approach that mitigates adversarial overfitting in ViTs by selectively optimizing vulnerable layers.

Findings

01

Enhances clean and adversarial accuracy by around 5% on average.

02

Achieves up to 20% improvement in certain ViT architectures.

03

Effective across various datasets and model architectures.

Abstract

Vision transformers (ViTs) have become essential backbones in advanced computer vision applications and multi-modal foundation models. Despite their strengths, ViTs remain vulnerable to adversarial perturbations, comparable to or even exceeding the vulnerability of convolutional neural networks (CNNs). Furthermore, the large parameter count and complex architecture of ViTs make them particularly prone to adversarial overfitting, often compromising both clean and adversarial accuracy. This paper mitigates adversarial overfitting in ViTs through a novel, layer-selective fine-tuning approach: SAFER. Instead of optimizing the entire model, we identify and selectively fine-tune a small subset of layers most susceptible to overfitting, applying sharpness-aware minimization to these layers while freezing the rest of the model. Our method consistently enhances both clean and adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Industrial Vision Systems and Defect Detection · Infrared Target Detection Methodologies

MethodsSharpness-Aware Minimization