Adversarial Hubness in Multi-Modal Retrieval
Tingwei Zhang, Fnu Suya, Rishi Jha, Collin Zhang, and Vitaly Shmatikov

TL;DR
This paper explores how adversaries can exploit the hubness phenomenon in high-dimensional multi-modal retrieval systems to create universal or targeted adversarial content, revealing vulnerabilities in current mitigation techniques.
Contribution
It introduces a method for generating adversarial hubs in multi-modal retrieval systems and evaluates their effectiveness on benchmark datasets and real-world systems.
Findings
Adversarial hubs can be retrieved as top results for thousands of queries.
A single adversarial hub can dominate over 21,000 out of 25,000 queries.
Mitigation techniques for natural hubness are ineffective against targeted adversarial hubs.
Abstract
Hubness is a phenomenon in high-dimensional vector spaces where a point from the natural distribution is unusually close to many other points. This is a well-known problem in information retrieval that causes some items to accidentally (and incorrectly) appear relevant to many queries. In this paper, we investigate how attackers can exploit hubness to turn any image or audio input in a multi-modal retrieval system into an adversarial hub. Adversarial hubs can be used to inject universal adversarial content (e.g., spam) that will be retrieved in response to thousands of different queries, and also for targeted attacks on queries related to specific, attacker-chosen concepts. We present a method for creating adversarial hubs and evaluate the resulting hubs on benchmark multi-modal retrieval datasets and an image-to-image retrieval system implemented by Pinecone, a popular vector…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
