UAV-MM3D: A Large-Scale Synthetic Benchmark for 3D Perception of Unmanned Aerial Vehicles with Multi-Modal Data
Longkun Zou, Jiale Wang, Rongqin Liang, Hai Wu, Ke Chen, Yaowei Wang

TL;DR
UAV-MM3D is a comprehensive synthetic dataset with multimodal data for UAV perception, enabling improved 3D detection, tracking, and trajectory prediction in diverse environments.
Contribution
The paper introduces UAV-MM3D, a large-scale synthetic dataset with multimodal data and annotations for UAV perception, addressing real-world data collection challenges.
Findings
Provides 400K synchronized multimodal frames across diverse scenarios.
Includes baseline models for 3D detection, pose estimation, and trajectory prediction.
Facilitates benchmarking and development of UAV perception algorithms.
Abstract
Accurate perception of UAVs in complex low-altitude environments is critical for airspace security and related intelligent systems. Developing reliable solutions requires large-scale, accurately annotated, and multimodal data. However, real-world UAV data collection faces inherent constraints due to airspace regulations, privacy concerns, and environmental variability, while manual annotation of 3D poses and cross-modal correspondences is time-consuming and costly. To overcome these challenges, we introduce UAV-MM3D, a high-fidelity multimodal synthetic dataset for low-altitude UAV perception and motion understanding. It comprises 400K synchronized frames across diverse scenes (urban areas, suburbs, forests, coastal regions) and weather conditions (clear, cloudy, rainy, foggy), featuring multiple UAV models (micro, small, medium-sized) and five modalities - RGB, IR, LiDAR, Radar, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Air Traffic Management and Optimization · Robotics and Sensor-Based Localization
