SSC3OD: Sparsely Supervised Collaborative 3D Object Detection from LiDAR   Point Clouds

Yushan Han; Hui Zhang; Honglei Zhang; Yidong Li

arXiv:2307.00717·cs.CV·July 4, 2023·1 cites

SSC3OD: Sparsely Supervised Collaborative 3D Object Detection from LiDAR Point Clouds

Yushan Han, Hui Zhang, Honglei Zhang, Yidong Li

PDF

Open Access

TL;DR

This paper introduces SSC3OD, a novel framework for collaborative 3D object detection from LiDAR point clouds that requires only minimal sparse labels per scene, utilizing self-supervised and pseudo-labeling techniques to improve performance.

Contribution

The paper proposes SSC3OD, a sparsely supervised collaborative 3D detection framework with Pillar-MAE and instance mining modules, reducing annotation effort while maintaining high accuracy.

Findings

01

Effective performance improvement on large-scale datasets

02

Reduces annotation effort with only one labeled object per scene

03

Outperforms existing methods in sparse supervision scenarios

Abstract

Collaborative 3D object detection, with its improved interaction advantage among multiple agents, has been widely explored in autonomous driving. However, existing collaborative 3D object detectors in a fully supervised paradigm heavily rely on large-scale annotated 3D bounding boxes, which is labor-intensive and time-consuming. To tackle this issue, we propose a sparsely supervised collaborative 3D object detection framework SSC3OD, which only requires each agent to randomly label one object in the scene. Specifically, this model consists of two novel components, i.e., the pillar-based masked autoencoder (Pillar-MAE) and the instance mining module. The Pillar-MAE module aims to reason over high-level semantics in a self-supervised manner, and the instance mining module generates high-quality pseudo labels for collaborative detectors online. By introducing these simple yet effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · 3D Shape Modeling and Analysis · Visual Attention and Saliency Detection