Transferable Embedding Inversion Attack: Uncovering Privacy Risks in   Text Embeddings without Model Queries

Yu-Hsiang Huang; Yuche Tsai; Hsiang Hsiao; Hong-Yi Lin; Shou-De Lin

arXiv:2406.10280·cs.CR·January 15, 2025

Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries

Yu-Hsiang Huang, Yuche Tsai, Hsiang Hsiao, Hong-Yi Lin, Shou-De Lin

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a transfer attack method that uncovers privacy vulnerabilities in text embeddings without direct access to the original model, highlighting significant security concerns.

Contribution

It introduces a novel transfer attack approach that infers sensitive information from text embeddings using a surrogate model, without requiring direct model queries.

Findings

01

Transfer attack outperforms traditional methods across models.

02

Demonstrates privacy risks in text embedding technologies.

03

Effective on clinical datasets.

Abstract

This study investigates the privacy risks associated with text embeddings, focusing on the scenario where attackers cannot access the original embedding model. Contrary to previous research requiring direct model access, we explore a more realistic threat model by developing a transfer attack method. This approach uses a surrogate model to mimic the victim model's behavior, allowing the attacker to infer sensitive information from text embeddings without direct access. Our experiments across various embedding models and a clinical dataset demonstrate that our transfer attack significantly outperforms traditional methods, revealing the potential privacy vulnerabilities in embedding technologies and emphasizing the need for enhanced security measures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

coffree0123/TEIA
pytorchOfficial

Videos

Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Privacy-Preserving Technologies in Data · Access Control and Trust