SOOD: Towards Semi-Supervised Oriented Object Detection
Wei Hua, Dingkang Liang, Jingyu Li, Xiaolong Liu, Zhikang Zou,, Xiaoqing Ye, Xiang Bai

TL;DR
This paper introduces SOOD, a semi-supervised oriented object detection model that improves detection of multi-oriented objects in aerial images by using novel loss functions for better supervision and consistency.
Contribution
The paper proposes a new semi-supervised oriented object detection framework with two innovative loss functions for orientation and layout consistency, enhancing detection performance.
Findings
SOOD outperforms state-of-the-art SSOD methods on DOTA-v1.5.
The proposed losses improve orientation and layout consistency in detection.
SOOD achieves significant accuracy gains in aerial image object detection.
Abstract
Semi-Supervised Object Detection (SSOD), aiming to explore unlabeled data for boosting object detectors, has become an active task in recent years. However, existing SSOD approaches mainly focus on horizontal objects, leaving multi-oriented objects that are common in aerial images unexplored. This paper proposes a novel Semi-supervised Oriented Object Detection model, termed SOOD, built upon the mainstream pseudo-labeling framework. Towards oriented objects in aerial scenes, we design two loss functions to provide better supervision. Focusing on the orientations of objects, the first loss regularizes the consistency between each pseudo-label-prediction pair (includes a prediction and its corresponding pseudo label) with adaptive weights based on their orientation gap. Focusing on the layout of an image, the second loss regularizes the similarity and explicitly builds the many-to-many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
