Proxy-Embedding as an Adversarial Teacher: An Embedding-Guided Bidirectional Attack for Referring Expression Segmentation Models

Xingbai Chen; Tingchao Fu; Renyang Liu; Wei Zhou; Chao Yi

arXiv:2506.16157·cs.CV·September 23, 2025

Proxy-Embedding as an Adversarial Teacher: An Embedding-Guided Bidirectional Attack for Referring Expression Segmentation Models

Xingbai Chen, Tingchao Fu, Renyang Liu, Wei Zhou, Chao Yi

PDF

Open Access

TL;DR

This paper introduces PEAT, a novel embedding-guided bidirectional adversarial attack method that exposes vulnerabilities in referring expression segmentation models, enhancing their robustness and security against diverse and sensitive inputs.

Contribution

The paper proposes PEAT, a new adversarial attack technique specifically designed for RES models, addressing their multimodal structure and generalization across varied expressions.

Findings

01

PEAT outperforms existing baselines across multiple RES architectures.

02

The attack reveals significant vulnerabilities in current RES models.

03

Experiments demonstrate improved robustness and security of RES systems.

Abstract

Referring Expression Segmentation (RES) enables precise object segmentation in images based on natural language descriptions, offering high flexibility and broad applicability in real-world vision tasks. Despite its impressive performance, the robustness of RES models against adversarial examples remains largely unexplored. While prior adversarial attack methods have explored adversarial robustness on conventional segmentation models, they perform poorly when directly applied to RES models, failing to expose vulnerabilities in its multimodal structure. In practical open-world scenarios, users typically issue multiple, diverse referring expressions to interact with the same image, highlighting the need for adversarial examples that generalize across varied textual inputs. Furthermore, from the perspective of privacy protection, ensuring that RES models do not segment sensitive content…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection · Topic Modeling