Attacking Attention of Foundation Models Disrupts Downstream Tasks

Hondamunige Prasanna Silva; Federico Becattini; Lorenzo Seidenari

arXiv:2506.05394·cs.CR·September 15, 2025

Attacking Attention of Foundation Models Disrupts Downstream Tasks

Hondamunige Prasanna Silva, Federico Becattini, Lorenzo Seidenari

PDF

Open Access 1 Repo

TL;DR

This paper reveals that attacking the attention mechanisms of foundation models like CLIP and ViTs can significantly disrupt various downstream tasks, highlighting critical security vulnerabilities in these models.

Contribution

It introduces a novel, task-agnostic attack targeting transformer attention structures and demonstrates its effectiveness across multiple downstream applications.

Findings

01

Attacks significantly disrupt downstream task performance

02

Transferability of adversarial examples to various tasks

03

Vulnerabilities in transformer attention mechanisms

Abstract

Foundation models represent the most prominent and recent paradigm shift in artificial intelligence. Foundation models are large models, trained on broad data that deliver high accuracy in many downstream tasks, often without fine-tuning. For this reason, models such as CLIP , DINO or Vision Transfomers (ViT), are becoming the bedrock of many industrial AI-powered applications. However, the reliance on pre-trained foundation models also introduces significant security concerns, as these models are vulnerable to adversarial attacks. Such attacks involve deliberately crafted inputs designed to deceive AI systems, jeopardizing their reliability. This paper studies the vulnerabilities of vision foundation models, focusing specifically on CLIP and ViTs, and explores the transferability of adversarial attacks to downstream tasks. We introduce a novel attack, targeting the structure of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hondamunigeprasannasilva/attack-attention
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)

MethodsLinear Layer · Softmax · Attention Is All You Need · Multi-Head Attention · Dense Connections · Residual Connection · Layer Normalization · Contrastive Language-Image Pre-training · Vision Transformer · self-DIstillation with NO labels