UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object   Detection

Yuchao Wang; Peirui Cheng; Pengju Tian; Ziyang Yuan; Liangjin Zhao,; Jing Tian; Wensheng Wang; Zhirui Wang; Xian Sun

arXiv:2406.04647·cs.CV·June 10, 2024

UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection

Yuchao Wang, Peirui Cheng, Pengju Tian, Ziyang Yuan, Liangjin Zhao,, Jing Tian, Wensheng Wang, Zhirui Wang, Xian Sun

PDF

Open Access

TL;DR

UVCPNet is a novel UAV-ground collaborative perception framework that enhances 3D object detection accuracy by addressing cross-domain disparities and depth estimation challenges through specialized modules and a virtual dataset.

Contribution

The paper introduces a new framework with the CDCA and CDO modules for improved aerial-ground perception and provides a virtual dataset for training and evaluation.

Findings

01

Improves detection accuracy by 6.1% on V2U-COO dataset.

02

Enhances detection accuracy by 2.7% on DAIR-V2X dataset.

03

Validates effectiveness through extensive experiments.

Abstract

With the advancement of collaborative perception, the role of aerial-ground collaborative perception, a crucial component, is becoming increasingly important. The demand for collaborative perception across different perspectives to construct more comprehensive perceptual information is growing. However, challenges arise due to the disparities in the field of view between cross-domain agents and their varying sensitivity to information in images. Additionally, when we transform image features into Bird's Eye View (BEV) features for collaboration, we need accurate depth information. To address these issues, we propose a framework specifically designed for aerial-ground collaboration. First, to mitigate the lack of datasets for aerial-ground collaboration, we develop a virtual dataset named V2U-COO for our research. Second, we design a Cross-Domain Cross-Adaptation (CDCA) module to align…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques

MethodsALIGN