Multi-clue Consistency Learning to Bridge Gaps Between General and   Oriented Object in Semi-supervised Detection

Chenxu Wang; Chunyan Xu; Ziqi Gu; Zhen Cui

arXiv:2407.05909·cs.CV·July 9, 2024

Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection

Chenxu Wang, Chunyan Xu, Ziqi Gu, Zhen Cui

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel semi-supervised detection framework called Multi-clue Consistency Learning (MCL) that effectively addresses the unique challenges of oriented object detection in aerial images, achieving state-of-the-art results.

Contribution

The paper proposes specific strategies like Gaussian Center Assignment, Scale-aware Label Assignment, and Consistent Confidence Soft Label to bridge gaps between general and oriented object detection in semi-supervised learning.

Findings

01

Achieves state-of-the-art performance on DOTA benchmarks.

02

Effectively handles oriented objects with various scales and aspect ratios.

03

Reduces noise and improves pseudo-label quality in semi-supervised detection.

Abstract

While existing semi-supervised object detection (SSOD) methods perform well in general scenes, they encounter challenges in handling oriented objects in aerial images. We experimentally find three gaps between general and oriented object detection in semi-supervised learning: 1) Sampling inconsistency: the common center sampling is not suitable for oriented objects with larger aspect ratios when selecting positive labels from labeled data. 2) Assignment inconsistency: balancing the precision and localization quality of oriented pseudo-boxes poses greater challenges which introduces more noise when selecting positive labels from unlabeled data. 3) Confidence inconsistency: there exists more mismatch between the predicted classification and localization qualities when considering oriented objects, affecting the selection of pseudo-labels. Therefore, we propose a Multi-clue Consistency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facias914/sood-mcl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Text and Document Classification Technologies