INT: Instance-Specific Negative Mining for Task-Generic Promptable   Segmentation

Jian Hu; Zixu Cheng; Shaogang Gong

arXiv:2501.18753·cs.CV·February 3, 2025

INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation

Jian Hu, Zixu Cheng, Shaogang Gong

PDF

Open Access

TL;DR

This paper introduces INT, a method that improves task-generic promptable segmentation by adaptively mining negative examples to refine instance-specific prompts, enhancing segmentation accuracy across diverse datasets.

Contribution

INT presents a novel negative mining approach that adaptively filters irrelevant information to generate more accurate instance-specific prompts for segmentation.

Findings

01

Effective across six diverse datasets.

02

Improves segmentation robustness and scalability.

03

Outperforms existing prompt-based segmentation methods.

Abstract

Task-generic promptable image segmentation aims to achieve segmentation of diverse samples under a single task description by utilizing only one task-generic prompt. Current methods leverage the generalization capabilities of Vision-Language Models (VLMs) to infer instance-specific prompts from these task-generic prompts in order to guide the segmentation process. However, when VLMs struggle to generalise to some image instances, predicting instance-specific prompts becomes poor. To solve this problem, we introduce \textbf{I}nstance-specific \textbf{N}egative Mining for \textbf{T}ask-Generic Promptable Segmentation (\textbf{INT}). The key idea of INT is to adaptively reduce the influence of irrelevant (negative) prior knowledge whilst to increase the use the most plausible prior knowledge, selected by negative mining with higher contrast, in order to optimise instance-specific prompts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition