Attacking Attention of Foundation Models Disrupts Downstream Tasks
Hondamunige Prasanna Silva, Federico Becattini, Lorenzo Seidenari

TL;DR
This paper reveals that attacking the attention mechanisms of foundation models like CLIP and ViTs can significantly disrupt various downstream tasks, highlighting critical security vulnerabilities in these models.
Contribution
It introduces a novel, task-agnostic attack targeting transformer attention structures and demonstrates its effectiveness across multiple downstream applications.
Findings
Attacks significantly disrupt downstream task performance
Transferability of adversarial examples to various tasks
Vulnerabilities in transformer attention mechanisms
Abstract
Foundation models represent the most prominent and recent paradigm shift in artificial intelligence. Foundation models are large models, trained on broad data that deliver high accuracy in many downstream tasks, often without fine-tuning. For this reason, models such as CLIP , DINO or Vision Transfomers (ViT), are becoming the bedrock of many industrial AI-powered applications. However, the reliance on pre-trained foundation models also introduces significant security concerns, as these models are vulnerable to adversarial attacks. Such attacks involve deliberately crafted inputs designed to deceive AI systems, jeopardizing their reliability. This paper studies the vulnerabilities of vision foundation models, focusing specifically on CLIP and ViTs, and explores the transferability of adversarial attacks to downstream tasks. We introduce a novel attack, targeting the structure of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
MethodsLinear Layer · Softmax · Attention Is All You Need · Multi-Head Attention · Dense Connections · Residual Connection · Layer Normalization · Contrastive Language-Image Pre-training · Vision Transformer · self-DIstillation with NO labels
