SynRES: Towards Referring Expression Segmentation in the Wild via Synthetic Data

Dong-Hee Kim; Hyunjee Song; Donghyun Kim

arXiv:2505.17695·cs.LG·May 26, 2025

SynRES: Towards Referring Expression Segmentation in the Wild via Synthetic Data

Dong-Hee Kim, Hyunjee Song, Donghyun Kim

PDF

TL;DR

This paper introduces SynRES, a synthetic data generation pipeline that enhances referring expression segmentation models, enabling better generalization and performance on complex, real-world benchmarks like WildRES.

Contribution

The paper presents SynRES, a novel automated synthetic data generation method that improves RES model training and evaluation in diverse, complex scenarios.

Findings

01

Models trained with SynRES outperform previous methods on WildRES benchmarks.

02

SynRES improves gIoU by 2.0% on WildRES-ID and 3.8% on WildRES-DS.

03

WildRES reveals current RES models struggle with complex reasoning in real-world settings.

Abstract

Despite the advances in Referring Expression Segmentation (RES) benchmarks, their evaluation protocols remain constrained, primarily focusing on either single targets with short queries (containing minimal attributes) or multiple targets from distinctly different queries on a single domain. This limitation significantly hinders the assessment of more complex reasoning capabilities in RES models. We introduce WildRES, a novel benchmark that incorporates long queries with diverse attributes and non-distinctive queries for multiple targets. This benchmark spans diverse application domains, including autonomous driving environments and robotic manipulation scenarios, thus enabling more rigorous evaluation of complex reasoning capabilities in real-world settings. Our analysis reveals that current RES models demonstrate substantial performance deterioration when evaluated on WildRES. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.