INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation
Jian Hu, Zixu Cheng, Shaogang Gong

TL;DR
This paper introduces INT, a method that improves task-generic promptable segmentation by adaptively mining negative examples to refine instance-specific prompts, enhancing segmentation accuracy across diverse datasets.
Contribution
INT presents a novel negative mining approach that adaptively filters irrelevant information to generate more accurate instance-specific prompts for segmentation.
Findings
Effective across six diverse datasets.
Improves segmentation robustness and scalability.
Outperforms existing prompt-based segmentation methods.
Abstract
Task-generic promptable image segmentation aims to achieve segmentation of diverse samples under a single task description by utilizing only one task-generic prompt. Current methods leverage the generalization capabilities of Vision-Language Models (VLMs) to infer instance-specific prompts from these task-generic prompts in order to guide the segmentation process. However, when VLMs struggle to generalise to some image instances, predicting instance-specific prompts becomes poor. To solve this problem, we introduce \textbf{I}nstance-specific \textbf{N}egative Mining for \textbf{T}ask-Generic Promptable Segmentation (\textbf{INT}). The key idea of INT is to adaptively reduce the influence of irrelevant (negative) prior knowledge whilst to increase the use the most plausible prior knowledge, selected by negative mining with higher contrast, in order to optimise instance-specific prompts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition
