SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
Savinay Nagendra, Kashif Rashid, Chaopeng Shen, Daniel Kifer

TL;DR
SAMIC is a compact network that leverages vision foundation models to efficiently perform few-shot segmentation across various domains, significantly reducing model size and training data requirements while maintaining competitive performance.
Contribution
The paper introduces SAMIC, a small network that prompts vision foundation models for few-shot segmentation, enabling domain-specific applications with minimal training data and model size.
Findings
SAMIC achieves state-of-the-art results on multiple segmentation benchmarks.
SAMIC is 94% smaller than leading models with comparable performance.
It performs well even with only 1/5th of the training data.
Abstract
Few-shot segmentation is the problem of learning to identify specific types of objects (e.g., airplanes) in images from a small set of labeled reference images. The current state of the art is driven by resource-intensive construction of models for every new domain-specific application. Such models must be trained on enormous labeled datasets of unrelated objects (e.g., cars, trains, animals) so that their ``knowledge'' can be transferred to new types of objects. In this paper, we show how to leverage existing vision foundation models (VFMs) to reduce the incremental cost of creating few-shot segmentation models for new domains. Specifically, we introduce SAMIC, a small network that learns how to prompt VFMs in order to segment new types of objects in domain-specific applications. SAMIC enables any task to be approached as a few-shot learning problem. At 2.6 million parameters, it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBIM and Construction Integration
MethodsAverage Pooling · Kaiming Initialization · Global Average Pooling · Sparse Evolutionary Training · Max Pooling · Convolution
