Towards Real-time 3D Reconstruction using Consumer UAVs
Qiaosong Wang

TL;DR
This paper introduces a real-time 3D reconstruction system using consumer UAVs with a single camera, leveraging virtual stereo views and GPU acceleration to produce dense point clouds at 25 fps.
Contribution
It simplifies multi-view stereo into two-view problems, enabling real-time 3D mapping with consumer UAVs without specialized hardware.
Findings
Achieves 25 fps dense point cloud output
Validates effectiveness on real-world UAV datasets
Allows flexible baseline adjustment for improved accuracy
Abstract
We present a near real-time solution for 3D reconstruction from aerial images captured by consumer UAVs. Our core idea is to simplify the multi-view stereo problem into a series of two-view stereo matching problems. Our method applies to UAVs equipped with only one camera and does not require special stereo capturing setups. We found that the neighboring two video frames taken by UAVs flying at a mid-to-high cruising altitude can be approximated as left and right views from a virtual stereo camera. Leveraging GPU-accelerated real-time stereo estimation and efficient PnP correspondence solving algorithms, our system simultaneously predicts scene geometry and camera position/orientation from the virtual stereo cameras. Also, this method allows user-selection of varying baseline lengths, which provides more flexibility given the trade-off between camera resolution, effective measuring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
