Anchor Retouching via Model Interaction for Robust Object Detection in   Aerial Images

Dong Liang; Qixiang Geng; Zongqi Wei; Dmitry A. Vorontsov; Ekaterina; L. Kim; Mingqiang Wei; Huiyu Zhou

arXiv:2112.06701·cs.CV·May 4, 2022

Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images

Dong Liang, Qixiang Geng, Zongqi Wei, Dmitry A. Vorontsov, Ekaterina, L. Kim, Mingqiang Wei, Huiyu Zhou

PDF

1 Repo

TL;DR

This paper introduces DEA-Net, a novel training sample generator for robust small object detection in aerial images, leveraging model interaction and multi-task training to improve accuracy and efficiency.

Contribution

The paper proposes a dynamic enhancement anchor network that uses a sample discriminator for interactive sample screening, improving small object detection in aerial imagery.

Findings

01

Achieves state-of-the-art accuracy on DOTA and HRSC2016 benchmarks.

02

Surpasses previous methods by 0.40% mAP for oriented detection.

03

Maintains moderate inference speed and computational overhead.

Abstract

Object detection has made tremendous strides in computer vision. Small object detection with appearance degradation is a prominent challenge, especially for aerial observations. To collect sufficient positive/negative samples for heuristic training, most object detectors preset region anchors in order to calculate Intersection-over-Union (IoU) against the ground-truthed data. In this case, small objects are frequently abandoned or mislabeled. In this paper, we present an effective Dynamic Enhancement Anchor (DEA) network to construct a novel training sample generator. Different from the other state-of-the-art techniques, the proposed network leverages a sample discriminator to realize interactive sample screening between an anchor-based unit and an anchor-free unit to generate eligible samples. Besides, multi-task joint training with a conservative anchor-based inference scheme enhances…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qxgeng/dea-net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings