O2Former:Direction-Aware and Multi-Scale Query Enhancement for SAR Ship Instance Segmentation
F. Gao, Y Li, X He, J Sun, J Wang

TL;DR
O2Former is a novel SAR ship instance segmentation framework that leverages multi-scale and direction-aware features, significantly improving accuracy and efficiency over existing methods.
Contribution
The paper introduces O2Former, incorporating the Optimized Query Generator and Orientation-Aware Embedding Module to better handle SAR image challenges.
Findings
Outperforms state-of-the-art segmentation methods on SAR datasets.
Enhances directional sensitivity and multi-scale feature interaction.
Improves convergence efficiency and structural detail capture.
Abstract
Instance segmentation of ships in synthetic aperture radar (SAR) imagery is critical for applications such as maritime monitoring, environmental analysis, and national security. SAR ship images present challenges including scale variation, object density, and fuzzy target boundary, which are often overlooked in existing methods, leading to suboptimal performance. In this work, we propose O2Former, a tailored instance segmentation framework that extends Mask2Former by fully leveraging the structural characteristics of SAR imagery. We introduce two key components. The first is the Optimized Query Generator(OQG). It enables multi-scale feature interaction by jointly encoding shallow positional cues and high-level semantic information. This improves query quality and convergence efficiency. The second component is the Orientation-Aware Embedding Module(OAEM). It enhances directional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced SAR Imaging Techniques · Underwater Acoustics Research · Synthetic Aperture Radar (SAR) Applications and Techniques
