VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense

Jascha Wanger

arXiv:2605.13764·cs.CR·May 14, 2026

VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense

Jascha Wanger

PDF

1 Repo

TL;DR

This paper reveals how vector-store embeddings can be exploited for steganographic data exfiltration and introduces VectorPin, a cryptographic protocol to ensure embedding integrity and provenance.

Contribution

It demonstrates steganographic exfiltration techniques in vector stores and proposes VectorPin, a cryptographic method for embedding provenance and integrity verification.

Findings

01

Simple anomaly detectors often catch distribution-shifting perturbations.

02

Small-angle orthogonal rotation defeats distribution-based detection.

03

VectorPin effectively verifies embedding provenance and integrity.

Abstract

Modern retrieval-augmented generation (RAG) systems convert sensitive content into high-dimensional embeddings and store them in vector databases that treat the resulting numerical artifacts as opaque. Major vector-store products do not provide native controls for embedding integrity, ingestion-time distributional anomaly detection, or cryptographic provenance attestation. We show this opens a class of steganographic exfiltration attacks: an attacker with write access to the ingestion pipeline can hide payload data inside embeddings using simple post-embedding perturbations (noise injection, rotation, scaling, offset, fragmentation, and combinations thereof) while preserving the surface-level retrieval behavior the RAG system exposes to legitimate users. We evaluate these techniques across a synthetic-PII corpus on text-embedding-3-large, four locally hosted open embedding models, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jaschadub/VectorSmuggle
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.