Don't Be a Pot Stirrer! Authorized Vector Data Retrieval via Access-Aware Indexing
Shanshan Han, Vishal Chakraborty, Sharad Mehrotra

TL;DR
This paper introduces Veda and EffVeda, indexing strategies for vector databases that enforce role-based access control efficiently by partitioning data and using access-aware structures to optimize search performance.
Contribution
The paper proposes access-aware lattice-based indexing methods that improve search efficiency and storage management in role-restricted vector databases.
Findings
Veda and EffVeda achieve higher throughput at high recall.
The methods closely adhere to specified storage budgets.
Query processing is optimized by pruning impure nodes based on distance bounds.
Abstract
Vector databases increasingly enforce role-based access control, where each top-k approximate nearest neighbor query must return only vectors the querying role is authorized to access. Two extremes bracket the design space. A single global index built over all vectors avoids duplication but wastes search effort on unauthorized vectors and degrades recall, while an oracle index, built with all authorized vectors to the query roles, searches only authorized vectors but duplicates every shared vector between roles or queries. We present Veda and its efficient variant EffVeda, two indexing strategies built on an access-aware lattice to address access control in vector databases. The methods first partitions the dataset into disjoint data blocks by role combination, then leverage the structure of the access-aware lattice to apply copy and merge operations to group co-accessed blocks under a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
