TL;DR
This paper introduces a large-scale remote sensing object counting dataset and a novel neural network method with attention, scale pyramid, and deformable convolution modules to improve counting accuracy in complex aerial images.
Contribution
The paper presents the first large-scale remote sensing object counting dataset and a new neural network architecture tailored to address scale, clutter, and orientation challenges.
Findings
The dataset includes four key geographic object categories.
The proposed method outperforms existing state-of-the-art approaches.
Extensive experiments validate the effectiveness of the new dataset and model.
Abstract
Object counting, whose aim is to estimate the number of objects from a given image, is an important and challenging computation task. Significant efforts have been devoted to addressing this problem and achieved great progress, yet counting the number of ground objects from remote sensing images is barely studied. In this paper, we are interested in counting dense objects from remote sensing images. Compared with object counting in a natural scene, this task is challenging in the following factors: large scale variation, complex cluttered background, and orientation arbitrariness. More importantly, the scarcity of data severely limits the development of research in this field. To address these issues, we first construct a large-scale object counting dataset with remote sensing images, which contains four important geographic objects: buildings, crowded ships in harbors, large-vehicles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution · Deformable Convolution
