R$^3$-Net: A Deep Network for Multi-oriented Vehicle Detection in Aerial Images and Videos
Qingpeng Li, Lichao Mou, Qizhi Xu, Yun Zhang, Xiao Xiang, Zhu

TL;DR
R$^3$-Net is a novel deep learning framework that accurately detects multi-oriented vehicles in aerial images and videos, providing orientation information crucial for trajectory and motion analysis.
Contribution
The paper introduces R$^3$-Net, combining rotatable region proposal and detection networks with a new pooling strategy, enabling joint training for multi-oriented vehicle detection.
Findings
High precision and robustness demonstrated on two open datasets.
Effective generalization to aerial videos for vehicle tracking.
Novel rotatable pooling preserves orientation information.
Abstract
Vehicle detection is a significant and challenging task in aerial remote sensing applications. Most existing methods detect vehicles with regular rectangle boxes and fail to offer the orientation of vehicles. However, the orientation information is crucial for several practical applications, such as the trajectory and motion estimation of vehicles. In this paper, we propose a novel deep network, called rotatable region-based residual network (R-Net), to detect multi-oriented vehicles in aerial images and videos. More specially, R-Net is utilized to generate rotatable rectangular target boxes in a half coordinate system. First, we use a rotatable region proposal network (R-RPN) to generate rotatable region of interests (R-RoIs) from feature maps produced by a deep convolutional neural network. Here, a proposed batch averaging rotatable anchor (BAR anchor) strategy is applied to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
