Semantic Leakage from Image Embeddings

Yiyi Chen; Qiongkai Xu; Desmond Elliott; Qiongxiu Li; Johannes Bjerva

arXiv:2601.22929·cs.CV·February 3, 2026

Semantic Leakage from Image Embeddings

Yiyi Chen, Qiongkai Xu, Desmond Elliott, Qiongxiu Li, Johannes Bjerva

PDF

Open Access

TL;DR

This paper demonstrates that image embeddings can leak semantic information even without reconstructing images, revealing a privacy vulnerability by preserving local semantic neighborhoods.

Contribution

The authors introduce SLImE, a lightweight framework that uncovers semantic content from compressed image embeddings without task-specific training.

Findings

01

Semantic leakage occurs through neighborhood preservation.

02

SLImE effectively retrieves tags and descriptions.

03

Semantic information can propagate through lossy mappings.

Abstract

Image embeddings are generally assumed to pose limited privacy risk. We challenge this assumption by formalizing semantic leakage as the ability to recover semantic structures from compressed image embeddings. Surprisingly, we show that semantic leakage does not require exact reconstruction of the original image. Preserving local semantic neighborhoods under embedding alignment is sufficient to expose the intrinsic vulnerability of image embeddings. Crucially, this preserved neighborhood structure allows semantic information to propagate through a sequence of lossy mappings. Based on this conjecture, we propose Semantic Leakage from Image Embeddings (SLImE), a lightweight inference framework that reveals semantic information from standalone compressed image embeddings, incorporating a locally trained semantic retriever with off-the-shelf models, without training task-specific decoders.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning