LAMP: Learning Universal Adversarial Perturbations for Multi-Image Tasks via Pre-trained Models

Alvi Md Ishmam; Najibul Haque Sarker; Zaber Ibn Abdul Hakim; Chris Thomas

arXiv:2601.21220·cs.CV·January 30, 2026

LAMP: Learning Universal Adversarial Perturbations for Multi-Image Tasks via Pre-trained Models

Alvi Md Ishmam, Najibul Haque Sarker, Zaber Ibn Abdul Hakim, Chris Thomas

PDF

Open Access

TL;DR

LAMP is a novel black-box attack method that learns universal adversarial perturbations to effectively compromise multi-image vision-language models by exploiting attention mechanisms and cross-image influences.

Contribution

The paper introduces LAMP, a new approach for generating universal adversarial perturbations targeting multi-image models, with novel constraints and loss functions for improved attack success.

Findings

01

LAMP outperforms state-of-the-art baselines in attack success rates.

02

LAMP effectively disrupts multi-image vision-language tasks.

03

LAMP demonstrates robustness across various models and tasks.

Abstract

Multimodal Large Language Models (MLLMs) have achieved remarkable performance across vision-language tasks. Recent advancements allow these models to process multiple images as inputs. However, the vulnerabilities of multi-image MLLMs remain unexplored. Existing adversarial attacks focus on single-image settings and often assume a white-box threat model, which is impractical in many real-world scenarios. This paper introduces LAMP, a black-box method for learning Universal Adversarial Perturbations (UAPs) targeting multi-image MLLMs. LAMP applies an attention-based constraint that prevents the model from effectively aggregating information across images. LAMP also introduces a novel cross-image contagious constraint that forces perturbed tokens to influence clean tokens, spreading adversarial effects without requiring all inputs to be modified. Additionally, an index-attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · COVID-19 diagnosis using AI · Generative Adversarial Networks and Image Synthesis