P2RBox: Point Prompt Oriented Object Detection with SAM
Guangming Cao, Xuehui Yu, Wenwen Yu, Xumeng Han, Xue Yang, Guorong Li,, Jianbin Jiao, Zhenjun Han

TL;DR
P2RBox introduces a novel point prompt-based method utilizing SAM for high-quality mask proposals and refined oriented object detection, significantly improving performance with minimal annotation effort.
Contribution
The paper presents P2RBox, a new approach that employs point prompts and SAM to generate accurate rotated boxes for oriented object detection, outperforming existing point-annotated methods.
Findings
Outperforms state-of-the-art point-annotated methods by 29% mAP.
Effectively integrates with multiple detectors to enhance detection accuracy.
Demonstrates practical application potential with minimal annotation effort.
Abstract
Single-point annotation in oriented object detection of remote sensing scenarios is gaining increasing attention due to its cost-effectiveness. However, due to the granularity ambiguity of points, there is a significant performance gap between previous methods and those with fully supervision. In this study, we introduce P2RBox, which employs point prompt to generate rotated box (RBox) annotation for oriented object detection. P2RBox employs the SAM model to generate high-quality mask proposals. These proposals are then refined using the semantic and spatial information from annotation points. The best masks are converted into oriented boxes based on the feature directions suggested by the model. P2RBox incorporates two advanced guidance cues: Boundary Sensitive Mask guidance, which leverages semantic information, and Centrality guidance, which utilizes spatial information to reduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Remote-Sensing Image Classification
MethodsSegment Anything Model · Focal Loss · Convolution · 1x1 Convolution · Feature Pyramid Network · Non Maximum Suppression · FCOS · RetinaNet
