A Simple Image Segmentation Framework via In-Context Examples

Yang Liu; Chenchen Jing; Hengtao Li; Muzhi Zhu; Hao Chen; Xinlong; Wang; Chunhua Shen

arXiv:2410.04842·cs.CV·October 10, 2024·2 cites

A Simple Image Segmentation Framework via In-Context Examples

Yang Liu, Chenchen Jing, Hengtao Li, Muzhi Zhu, Hao Chen, Xinlong, Wang, Chunhua Shen

PDF

Open Access 1 Repo 1 Video

TL;DR

SINE is a straightforward image segmentation framework that uses in-context examples and a Transformer-based architecture to reduce task ambiguity and improve multi-task segmentation performance.

Contribution

The paper introduces SINE, a novel in-context segmentation method with modules for interaction and matching, enhancing task understanding and accuracy.

Findings

01

Effective across various segmentation tasks

02

Reduces task ambiguity in in-context learning

03

Improves segmentation accuracy with proposed modules

Abstract

Recently, there have been explorations of generalist segmentation models that can effectively tackle a variety of image segmentation tasks within a unified in-context learning framework. However, these methods still struggle with task ambiguity in in-context segmentation, as not all in-context examples can accurately convey the task information. In order to address this issue, we present SINE, a simple image Segmentation framework utilizing in-context examples. Our approach leverages a Transformer encoder-decoder structure, where the encoder provides high-quality image representations, and the decoder is designed to yield multiple task-specific output masks to effectively eliminate task ambiguity. Specifically, we introduce an In-context Interaction module to complement in-context information and produce correlations between the target image and the in-context example and a Matching…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aim-uofa/sine
pytorchOfficial

Videos

A Simple Image Segmentation Framework via In-Context Examples· slideslive

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques

MethodsAttention Is All You Need · Dense Connections · Adam · Linear Layer · Residual Connection · Position-Wise Feed-Forward Layer · Label Smoothing · Dropout · Byte Pair Encoding · Absolute Position Encodings