TL;DR
This paper introduces DEA-Net, a novel training sample generator for robust small object detection in aerial images, leveraging model interaction and multi-task training to improve accuracy and efficiency.
Contribution
The paper proposes a dynamic enhancement anchor network that uses a sample discriminator for interactive sample screening, improving small object detection in aerial imagery.
Findings
Achieves state-of-the-art accuracy on DOTA and HRSC2016 benchmarks.
Surpasses previous methods by 0.40% mAP for oriented detection.
Maintains moderate inference speed and computational overhead.
Abstract
Object detection has made tremendous strides in computer vision. Small object detection with appearance degradation is a prominent challenge, especially for aerial observations. To collect sufficient positive/negative samples for heuristic training, most object detectors preset region anchors in order to calculate Intersection-over-Union (IoU) against the ground-truthed data. In this case, small objects are frequently abandoned or mislabeled. In this paper, we present an effective Dynamic Enhancement Anchor (DEA) network to construct a novel training sample generator. Different from the other state-of-the-art techniques, the proposed network leverages a sample discriminator to realize interactive sample screening between an anchor-based unit and an anchor-free unit to generate eligible samples. Besides, multi-task joint training with a conservative anchor-based inference scheme enhances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
