Semantic Leakage from Image Embeddings
Yiyi Chen, Qiongkai Xu, Desmond Elliott, Qiongxiu Li, Johannes Bjerva

TL;DR
This paper demonstrates that image embeddings can leak semantic information even without reconstructing images, revealing a privacy vulnerability by preserving local semantic neighborhoods.
Contribution
The authors introduce SLImE, a lightweight framework that uncovers semantic content from compressed image embeddings without task-specific training.
Findings
Semantic leakage occurs through neighborhood preservation.
SLImE effectively retrieves tags and descriptions.
Semantic information can propagate through lossy mappings.
Abstract
Image embeddings are generally assumed to pose limited privacy risk. We challenge this assumption by formalizing semantic leakage as the ability to recover semantic structures from compressed image embeddings. Surprisingly, we show that semantic leakage does not require exact reconstruction of the original image. Preserving local semantic neighborhoods under embedding alignment is sufficient to expose the intrinsic vulnerability of image embeddings. Crucially, this preserved neighborhood structure allows semantic information to propagate through a sequence of lossy mappings. Based on this conjecture, we propose Semantic Leakage from Image Embeddings (SLImE), a lightweight inference framework that reveals semantic information from standalone compressed image embeddings, incorporating a locally trained semantic retriever with off-the-shelf models, without training task-specific decoders.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning
