OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts
Zhen Zhou, Junfeng Fan, Yunkai Ma, Sihan Zhao, Fengshui Jing, Min Tan

TL;DR
OBSeg is a novel instance segmentation framework for remote sensing images that leverages oriented bounding box prompts and segmentation foundation models to achieve high accuracy and speed, reducing reliance on bounding box detection performance.
Contribution
The paper introduces OBSeg, a new framework that uses OBB prompts with foundation models and a novel OBB prompt encoder, improving accuracy and efficiency in instance segmentation.
Findings
Outperforms existing methods in accuracy on multiple datasets.
Achieves competitive inference speed.
Effectively handles OBB prompts with a new encoder and distillation techniques.
Abstract
Instance segmentation in remote sensing images is a long-standing challenge. Since horizontal bounding boxes introduce many interference objects, oriented bounding boxes (OBBs) are usually used for instance identification. However, based on ``segmentation within bounding box'' paradigm, current instance segmentation methods using OBBs are overly dependent on bounding box detection performance. To tackle this problem, this paper proposes OBSeg, an accurate and fast instance segmentation framework using OBBs. OBSeg is based on box prompt-based segmentation foundation models (BSMs), e.g., Segment Anything Model. Specifically, OBSeg first detects OBBs to distinguish instances and provide coarse localization information. Then, it predicts OBB prompt-related masks for fine segmentation. Since OBBs only serve as prompts, OBSeg alleviates the over-dependence on bounding box detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Advanced X-ray and CT Imaging
MethodsLabel Smoothing · Knowledge Distillation
