Adversarial Attacks on Foundational Vision Models

Nathan Inkawhich; Gwendolyn McDonald; Ryan Luley

arXiv:2308.14597·cs.CV·August 29, 2023·2 cites

Adversarial Attacks on Foundational Vision Models

Nathan Inkawhich, Gwendolyn McDonald, Ryan Luley

PDF

Open Access

TL;DR

This paper investigates vulnerabilities in large, pretrained vision models like CLIP and DINOv2, demonstrating effective adversarial attacks that manipulate model outputs and highlight the need for robustness improvements.

Contribution

It introduces novel adversarial attack methods targeting foundational vision models, revealing their susceptibility to manipulation in both whitebox and blackbox scenarios.

Findings

01

Attacks can fool models into misclassifying in-distribution images as out-of-distribution.

02

Methods transfer effectively across different foundational model types.

03

Attacks are successful under low-knowledge threat models.

Abstract

Rapid progress is being made in developing large, pretrained, task-agnostic foundational vision models such as CLIP, ALIGN, DINOv2, etc. In fact, we are approaching the point where these models do not have to be finetuned downstream, and can simply be used in zero-shot or with a lightweight probing head. Critically, given the complexity of working at this scale, there is a bottleneck where relatively few organizations in the world are executing the training then sharing the models on centralized platforms such as HuggingFace and torch.hub. The goal of this work is to identify several key adversarial vulnerabilities of these models in an effort to make future designs more robust. Intuitively, our attacks manipulate deep feature representations to fool an out-of-distribution (OOD) detector which will be required when using these open-world-aware models to solve closed-set downstream…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research

MethodsALIGN · Contrastive Language-Image Pre-training