PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point   Supervised Oriented Object Detection

Botao Ren; Xue Yang; Yi Yu; Junwei Luo; Zhidong Deng

arXiv:2410.08210·cs.CV·October 11, 2024·5 cites

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection

Botao Ren, Xue Yang, Yi Yu, Junwei Luo, Zhidong Deng

PDF

Open Access 1 Repo

TL;DR

PointOBB-v2 introduces a simplified, faster, and more accurate method for single point supervised oriented object detection by generating pseudo rotated boxes from points without prior models, significantly improving speed and accuracy.

Contribution

The paper presents PointOBB-v2, a novel approach that generates pseudo rotated boxes using class probability maps and PCA, eliminating reliance on prior models and enhancing detection speed and accuracy.

Findings

01

15.58x faster training speed compared to previous methods

02

11.60%/25.15%/21.19% accuracy improvements on DOTA datasets

03

Effective in high-density object scenarios

Abstract

Single point supervised oriented object detection has gained attention and made initial progress within the community. Diverse from those approaches relying on one-shot samples or powerful pretrained models (e.g. SAM), PointOBB has shown promise due to its prior-free feature. In this paper, we propose PointOBB-v2, a simpler, faster, and stronger method to generate pseudo rotated boxes from points without relying on any other prior. Specifically, we first generate a Class Probability Map (CPM) by training the network with non-uniform positive and negative sampling. We show that the CPM is able to learn the approximate object regions and their contours. Then, Principal Component Analysis (PCA) is applied to accurately estimate the orientation and the boundary of objects. By further incorporating a separation mechanism, we resolve the confusion caused by the overlapping on the CPM,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luo-z13/pointobb
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Image and Object Detection Techniques

MethodsSoftmax · Attention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings