TL;DR
This paper reveals how vector-store embeddings can be exploited for steganographic data exfiltration and introduces VectorPin, a cryptographic protocol to ensure embedding integrity and provenance.
Contribution
It demonstrates steganographic exfiltration techniques in vector stores and proposes VectorPin, a cryptographic method for embedding provenance and integrity verification.
Findings
Simple anomaly detectors often catch distribution-shifting perturbations.
Small-angle orthogonal rotation defeats distribution-based detection.
VectorPin effectively verifies embedding provenance and integrity.
Abstract
Modern retrieval-augmented generation (RAG) systems convert sensitive content into high-dimensional embeddings and store them in vector databases that treat the resulting numerical artifacts as opaque. Major vector-store products do not provide native controls for embedding integrity, ingestion-time distributional anomaly detection, or cryptographic provenance attestation. We show this opens a class of steganographic exfiltration attacks: an attacker with write access to the ingestion pipeline can hide payload data inside embeddings using simple post-embedding perturbations (noise injection, rotation, scaling, offset, fragmentation, and combinations thereof) while preserving the surface-level retrieval behavior the RAG system exposes to legitimate users. We evaluate these techniques across a synthetic-PII corpus on text-embedding-3-large, four locally hosted open embedding models, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
