Semantic Shield: Defending Vision-Language Models Against Backdooring   and Poisoning via Fine-grained Knowledge Alignment

Alvi Md Ishmam; Christopher Thomas

arXiv:2411.15673·cs.CV·November 26, 2024

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

Alvi Md Ishmam, Christopher Thomas

PDF

Open Access 1 Repo

TL;DR

This paper introduces Semantic Shield, a method that uses external knowledge to defend vision-language models against backdoor and poisoning attacks by aligning attention with external knowledge.

Contribution

It proposes a novel knowledge alignment technique that enhances model security without affecting inference, addressing vulnerabilities in contrastively trained vision-language models.

Findings

01

Effective defense against backdooring and poisoning attacks

02

Maintains model utility while improving security

03

Works across multiple datasets and architectures

Abstract

In recent years there has been enormous interest in vision-language models trained using self-supervised objectives. However, the use of large-scale datasets scraped from the web for training also makes these models vulnerable to potential security threats, such as backdooring and poisoning attacks. In this paper, we propose a method for mitigating such attacks on contrastively trained vision-language models. Our approach leverages external knowledge extracted from a language model to prevent models from learning correlations between image regions which lack strong alignment with external knowledge. We do this by imposing constraints to enforce that attention paid by the model to visual regions is proportional to the alignment of those regions with external knowledge. We conduct extensive experiments using a variety of recent backdooring and poisoning attacks on multiple datasets and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IshmamAlvi/Semantic-Shield
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications

MethodsSoftmax · Attention Is All You Need