Adversarial Hubness Detector: Detecting Hubness Poisoning in Retrieval-Augmented Generation Systems
Idan Habler, Vineeth Sai Narajala, Stav Koren, Amy Chang, Tiffany Saade

TL;DR
This paper introduces hubscan, a comprehensive security tool for detecting harmful hubness in retrieval-augmented generation systems, enhancing safety and reliability in AI knowledge retrieval.
Contribution
We present hubscan, a multi-detector security scanner that identifies hubs in vector indices across various databases and retrieval methods, improving detection of adversarial content.
Findings
Achieves 90% recall at 0.2% alert rate
Detects adversarial hubs ranked above the 99.8th percentile
Successfully recovers 100% of targeted attacks in domain-scoped tests
Abstract
Retrieval-Augmented Generation (RAG) systems are essential to contemporary AI applications, allowing large language models to obtain external knowledge via vector similarity search. Nevertheless, these systems encounter a significant security flaw: hubness - items that frequently appear in the top- retrieval results for a disproportionately high number of varied queries. These hubs can be exploited to introduce harmful content, alter search rankings, bypass content filtering, and decrease system performance. We introduce hubscan, an open-source security scanner that evaluates vector indices and embeddings to identify hubs in RAG systems. Hubscan presents a multi-detector architecture that integrates: (1) robust statistical hubness detection utilizing median/Median Absolute Deviation (MAD)-based z-scores, (2) cluster spread analysis to assess cross-cluster retrieval patterns, (3)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Spam and Phishing Detection · Information Retrieval and Search Behavior
