Aerial Monocular 3D Object Detection
Yue Hu, Shaoheng Fang, Weidi Xie, Siheng Chen

TL;DR
This paper introduces DVDET, a novel dual-view system for monocular 3D object detection from drones, featuring a geo-deformable transformation to handle view distortion, and provides new datasets for training and evaluation.
Contribution
The work presents a new aerial monocular 3D detection system with a trainable deformation module and introduces large-scale simulation and real-world datasets for this task.
Findings
Aerial monocular 3D detection is feasible with the proposed method.
Pre-training on simulation data improves real-world detection performance.
DVDET enhances monocular 3D detection for both aerial views and cars.
Abstract
Drones equipped with cameras can significantly enhance human ability to perceive the world because of their remarkable maneuverability in 3D space. Ironically, object detection for drones has always been conducted in the 2D image space, which fundamentally limits their ability to understand 3D scenes. Furthermore, existing 3D object detection methods developed for autonomous driving cannot be directly applied to drones due to the lack of deformation modeling, which is essential for the distant aerial perspective with sensitive distortion and small objects. To fill the gap, this work proposes a dual-view detection system named DVDET to achieve aerial monocular object detection in both the 2D image space and the 3D physical space. To address the severe view deformation issue, we propose a novel trainable geo-deformable transformation module that can properly warp information from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Video Surveillance and Tracking Methods
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
