Practical Collaborative Perception: A Framework for Asynchronous and   Multi-Agent 3D Object Detection

Minh-Quan Dao; Julie Stephany Berrio; Vincent Fr\'emont; Mao Shan,; Elwan H\'ery; and Stewart Worrall

arXiv:2307.01462·cs.RO·September 20, 2023

Practical Collaborative Perception: A Framework for Asynchronous and Multi-Agent 3D Object Detection

Minh-Quan Dao, Julie Stephany Berrio, Vincent Fr\'emont, Mao Shan,, Elwan H\'ery, and Stewart Worrall

PDF

1 Repo

TL;DR

This paper introduces a simple, effective collaborative perception framework for multi-agent 3D object detection that outperforms existing methods in bandwidth efficiency and robustness to synchronization issues.

Contribution

Proposes a novel collaboration method that improves bandwidth-performance tradeoff with minimal modifications to existing models and relaxed synchronization assumptions.

Findings

01

Achieves 98% of early-collaboration performance

02

Consumes bandwidth equivalent to late-collaboration methods

03

Outperforms prior state-of-the-art in efficiency and robustness

Abstract

Occlusion is a major challenge for LiDAR-based object detection methods. This challenge becomes safety-critical in urban traffic where the ego vehicle must have reliable object detection to avoid collision while its field of view is severely reduced due to the obstruction posed by a large number of road users. Collaborative perception via Vehicle-to-Everything (V2X) communication, which leverages the diverse perspective thanks to the presence at multiple locations of connected agents to form a complete scene representation, is an appealing solution. State-of-the-art V2X methods resolve the performance-bandwidth tradeoff using a mid-collaboration approach where the Bird-Eye View images of point clouds are exchanged so that the bandwidth consumption is lower than communicating point clouds as in early collaboration, and the detection performance is higher than late collaboration, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

quan-dao/practical-collab-perception
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.