SPWOOD: Sparse Partial Weakly-Supervised Oriented Object Detection
Wei Zhang, Xiang Liu, Ningjing Liu, Mingxin Liu, Wei Liao, Chunyan Xu, Xue Yang

TL;DR
This paper introduces SPWOOD, a novel framework for oriented object detection that efficiently utilizes sparse weak annotations and unlabeled data, significantly reducing labeling costs while maintaining high detection performance.
Contribution
The paper presents the first Sparse Partial Weakly-Supervised Oriented Object Detection framework with innovative models and strategies to leverage minimal annotations effectively.
Findings
Significant performance improvements over traditional methods.
Effective handling of sparse annotations and unlabeled data.
Validated on DOTA and DIOR datasets.
Abstract
A consistent trend throughout the research of oriented object detection has been the pursuit of maintaining comparable performance with fewer and weaker annotations. This is particularly crucial in the remote sensing domain, where the dense object distribution and a wide variety of categories contribute to prohibitively high costs. Based on the supervision level, existing oriented object detection algorithms can be broadly grouped into fully supervised, semi-supervised, and weakly supervised methods. Within the scope of this work, we further categorize them to include sparsely supervised and partially weakly-supervised methods. To address the challenges of large-scale labeling, we introduce the first Sparse Partial Weakly-Supervised Oriented Object Detection framework, designed to efficiently leverage only a few sparse weakly-labeled data and plenty of unlabeled data. Our framework…
Peer Reviews
Decision·ICLR 2026 Poster
1. The paper introduces a novel Sparse Partial Weakly-supervised Oriented Object Detection (SPWOOD) framework designed to significantly reduce annotation costs for oriented object detection in remote sensing. This task setting is more challenging and can further reduce annotation costs. 2. SPWOOD effectively leverages sparsely annotated data, weakly annotated data, and a large volume of unlabeled data, supporting various annotation formats (RBox, HBox, Point) or their combinations. This flexibi
1. While attempting to classify different methods and their cost-effectiveness, it is overly dense. 2. The paper claims the Overall Sparse Method addresses the bias of the Single Sparse Method. However, Table 6 on page 9 shows that the Single Sparse Method sometimes yields higher annotation counts and AP performance for several categories, which appears contradictory to the paper's claims. While the paper attempts to explain this by stating the Single Sparse Method "tends to retain more annota
The primary contributions are: (1) defining this new, complex problem setting; (2) proposing a unified teacher-student framework that leverages sparse, weak (HBox/Point), and unlabeled data simultaneously; (3) introducing several components, including a Sparse-annotation-Orientation-and-Scale-aware Student (SOS-Student) and a Multi-level Pseudo-labels Filtering (MPF) mechanism. The experiments conducted on DOTA datasets demonstrate that the proposed framework achieves state-of-the-art performanc
Despite the impressive results and the important problem formulation, I have several major concerns regarding the methodology's novelty and the paper's internal consistency. 1. Questionable Novelty of the Core Supervised Model (SOS-Student) The paper presents the SOS-Student as a key innovation, but its core components for learning orientation and scale appear to be directly adopted from prior work. • Orientation Learning (Sec 3.2.2): This is explicitly stated to be a symmetry-aware approach fro
1.The paper investigates a new and practical Sparse Partial Weakly-supervised Oriented Object Detection problem that reduces annotation effort in remote sensing. 2.The proposed SOS-Student effectively integrates orientation and scale learning under sparse and weak supervision. 3.The writing and structure are clear, logical, and easy to follow, with well-presented figures and tables. 4.The framework’s generality across multiple annotation types (RBox, HBox, Point) increases its potential pra
1.The related work lacks discussion of similar sparse partial supervision paradigms explored in other domains, such as 3D detection (e.g., HINTED), which share comparable annotation reduction strategies and technical challenges. 2.The experimental validation is confined to DOTA datasets, limiting the evidence of generality; evaluation on other datasets or domains would strengthen the claims. 3.The baseline selection is relatively weak, with no comparison to stronger or more recent sparse parti
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification
