TL;DR
This paper introduces TinySet-9M, a large-scale dataset for small object detection, and proposes a novel point-prompted detection paradigm called P2SOD, leading to a scalable framework that significantly improves detection performance.
Contribution
It presents the first large-scale multi-domain dataset for small objects and a new point-prompted detection paradigm, advancing label-efficient and semantic-aware small object detection.
Findings
TinySet-9M enables effective evaluation of small object detection methods.
Weak visual cues significantly impact label-efficient detection performance.
DEAL achieves 31.4% relative improvement over baselines with a single inference click.
Abstract
Small object detection (SOD) remains challenging due to extremely limited pixels and ambiguous object boundaries. These characteristics lead to challenging annotation, limited availability of large-scale high-quality datasets, and inherently weak semantic representations for small objects. In this work, we first address the data limitation by introducing TinySet-9M, the first large-scale, multi-domain dataset for small object detection. Beyond filling the gap in large-scale datasets, we establish a benchmark to evaluate the effectiveness of existing label-efficient detection methods for small objects. Our evaluation reveals that weak visual cues further exacerbate the performance degradation of label-efficient methods in small object detection, highlighting a critical challenge in label-efficient SOD. Secondly, to tackle the limitation of insufficient semantic representation, we move…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
