Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery
Seyed Majid Azimi, Eleonora Vig, Reza Bahmanyar, Marco K\"orner, Peter, Reinartz

TL;DR
This paper introduces a novel multi-scale, rotation-aware object detection method for remote sensing images, significantly improving accuracy on challenging datasets and generalizing well across different datasets.
Contribution
The work proposes a joint image cascade and feature pyramid network with multi-size kernels, combined with rotation-based detection and a new loss for oriented boxes, advancing remote sensing object detection.
Findings
Achieves 68.16% mAP on horizontal bounding boxes on DOTA
Achieves 72.45% mAP on oriented bounding boxes on DOTA
Outperforms all published methods by large margins
Abstract
Automatic multi-class object detection in remote sensing images in unconstrained scenarios is of high interest for several applications including traffic monitoring and disaster management. The huge variation in object scale, orientation, category, and complex backgrounds, as well as the different camera sensors pose great challenges for current algorithms. In this work, we propose a new method consisting of a novel joint image cascade and feature pyramid network with multi-size convolution kernels to extract multi-scale strong and weak semantic features. These features are fed into rotation-based region proposal and region of interest networks to produce object detections. Finally, rotational non-maximum suppression is applied to remove redundant detections. During training, we minimize joint horizontal and oriented bounding box loss functions, as well as a novel loss that enforces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification
MethodsConvolution
