TALENT: Target-aware Efficient Tuning for Referring Image Segmentation

Shuo Jin; Siyue Yu; Bingfeng Zhang; Chao Yao; Meiqin Liu; Jimin Xiao

arXiv:2604.00609·cs.CV·April 2, 2026

TALENT: Target-aware Efficient Tuning for Referring Image Segmentation

Shuo Jin, Siyue Yu, Bingfeng Zhang, Chao Yao, Meiqin Liu, Jimin Xiao

PDF

1 Repo 1 Models

TL;DR

TALENT introduces a target-aware efficient tuning framework for referring image segmentation, effectively addressing non-target activation issues and improving target localization accuracy.

Contribution

It proposes a novel framework with RCA and TLM to enhance visual feature focus on referred targets, outperforming existing methods.

Findings

01

Achieves 2.5% mIoU improvement on G-Ref validation set.

02

Effectively suppresses activation of unrelated objects.

03

Enhances target localization accuracy.

Abstract

Referring image segmentation aims to segment specific targets based on a natural text expression. Recently, parameter-efficient tuning (PET) has emerged as a promising paradigm. However, existing PET-based methods often suffer from the fact that visual features can't emphasize the text-referred target instance but activate co-category yet unrelated objects. We analyze and quantify this problem, terming it the `non-target activation' (NTA) issue. To address this, we propose a novel framework, TALENT, which utilizes target-aware efficient tuning for PET-based RIS. Specifically, we first propose a Rectified Cost Aggregator (RCA) to efficiently aggregate text-referred features. Then, to calibrate `NTA' into accurate target activation, we adopt a Target-aware Learning Mechanism (TLM), including contextual pairwise consistency learning and target-centric contrastive learning. The former uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Kimsure/TALENT
github

Models

🤗
Kimsure99/TALENT
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.