Generalized Small Object Detection:A Point-Prompted Paradigm and Benchmark

Haoran Zhu; Wen Yang; Guangyou Yang; Chang Xu; Ruixiang Zhang; Fang Xu; Haijian Zhang; Gui-Song Xia

arXiv:2604.02773·cs.CV·April 6, 2026

Generalized Small Object Detection:A Point-Prompted Paradigm and Benchmark

Haoran Zhu, Wen Yang, Guangyou Yang, Chang Xu, Ruixiang Zhang, Fang Xu, Haijian Zhang, Gui-Song Xia

PDF

1 Repo

TL;DR

This paper introduces TinySet-9M, a large-scale dataset for small object detection, and proposes a novel point-prompted detection paradigm called P2SOD, leading to a scalable framework that significantly improves detection performance.

Contribution

It presents the first large-scale multi-domain dataset for small objects and a new point-prompted detection paradigm, advancing label-efficient and semantic-aware small object detection.

Findings

01

TinySet-9M enables effective evaluation of small object detection methods.

02

Weak visual cues significantly impact label-efficient detection performance.

03

DEAL achieves 31.4% relative improvement over baselines with a single inference click.

Abstract

Small object detection (SOD) remains challenging due to extremely limited pixels and ambiguous object boundaries. These characteristics lead to challenging annotation, limited availability of large-scale high-quality datasets, and inherently weak semantic representations for small objects. In this work, we first address the data limitation by introducing TinySet-9M, the first large-scale, multi-domain dataset for small object detection. Beyond filling the gap in large-scale datasets, we establish a benchmark to evaluate the effectiveness of existing label-efficient detection methods for small objects. Our evaluation reveals that weak visual cues further exacerbate the performance degradation of label-efficient methods in small object detection, highlighting a critical challenge in label-efficient SOD. Secondly, to tackle the limitation of insufficient semantic representation, we move…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://zhuhaoraneis.github.io/TinySet-9M
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.