LR-CNN: Local-aware Region CNN for Vehicle Detection in Aerial Imagery

Wentong Liao; Xiang Chen; Jingfeng Yang; Stefan Roth; Michael Goesele,; Michael Ying Yang; Bodo Rosenhahn

arXiv:2005.14264·cs.CV·June 1, 2020

LR-CNN: Local-aware Region CNN for Vehicle Detection in Aerial Imagery

Wentong Liao, Xiang Chen, Jingfeng Yang, Stefan Roth, Michael Goesele,, Michael Ying Yang, Bodo Rosenhahn

PDF

TL;DR

LR-CNN is a novel two-stage vehicle detection method in aerial images that improves accuracy for dense, small, and arbitrarily oriented targets by enhancing local feature invariance and addressing boundary quantization issues.

Contribution

The paper introduces LR-CNN, which enhances translation and local feature invariance for better detection of dense, small, and arbitrarily oriented vehicles in aerial imagery, overcoming limitations of existing methods.

Findings

01

Significant improvement over state-of-the-art on VEDAI and DOTA datasets.

02

Effective detection of dense, small, and arbitrarily oriented vehicles.

03

Good generalization demonstrated on the DLR 3K dataset.

Abstract

State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery. We enhance translation invariance to detect dense vehicles and address the boundary quantization issue amongst dense vehicles by aggregating the high-precision RoIs' features. Moreover, we resample high-level semantic pooled features, making them regain location information from the features of a shallower convolutional block. This strengthens the local feature invariance for the resampled features and enables detecting vehicles in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocal Loss